Skip to content

mansh7763/mansh7763.github.io

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

21 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸš€ Himanshu Kumar - AI/ML Engineer Portfolio

Portfolio Preview GitHub Pages HTML CSS JavaScript

🌐 View Live Portfolio

A modern, interactive portfolio showcasing AI/ML projects, research, and professional experience

πŸ‘¨β€πŸ’» About Me

I'm Himanshu Kumar, a passionate AI/ML Engineer and final-year Computer Science student at IIIT Nagpur. I specialize in:

  • 🧠 Machine Learning & Deep Learning
  • πŸ‘οΈ Computer Vision & Vision-Language Models
  • πŸ”€ Natural Language Processing
  • πŸ“Š Data Science & Analytics
  • πŸ€– Generative AI & RLHF

🎯 Currently Seeking

  • ML Engineer positions
  • AI Engineer roles
  • Data Scientist opportunities
  • Research Internships

🌟 Portfolio Highlights

πŸ“Š Quick Stats

  • πŸŽ“ CGPA: 8.35/10
  • 🏒 Internships: 3 (IIT Madras, IIT Guwahati, ProCohat Technologies pvt ltd)
  • πŸ“œ NVIDIA Certifications: 5
  • πŸ“ Publications: 1 (ArXiv)
  • πŸ† Competition Wins: 2x Runner-up

πŸ”¬ Research & Publications

  • NanoVLM: Compact Vision-Language Models (ArXiv 2025)
  • Multimodal AI: Image Captioning with ViT + GPT-2/BERT
  • Audio Generation: MusicGen with RLHF for soundscape creation

πŸ’Ό Professional Experience

🎡 AI Intern - Generative AI & RLHF | IIT Madras (May 2025 - July 2025)

  • Developed soundscape music generation using MusicGen
  • Built web-based human feedback collection platform
  • Implemented RLHF pipeline for audio quality alignment

πŸ‘οΈ AI Intern | IIT Guwahati (Nov 2024 - Mar 2025)

  • Built multimodal image captioning VLM
  • Implemented Bottom-Up Top-Down attention mechanism
  • Trained on Flickr30k dataset with ViT encoder

πŸ’» Developer Intern | ProCohat Technology (Jun 2024 - Aug 2024)

  • Developed MultiPDF Chatting RAG Application
  • Integrated Supabase for document storage and retrieval
  • Enhanced personalized document management system

πŸ—οΈ Featured Projects

πŸ”¬ NanoVLM: Tiny Multimodal Vision Language Model

  • Achievement: 10x smaller than existing small VLMs
  • Performance: 39.84/50 creativity score, ROUGE-1 < 0.5
  • Tech Stack: PyTorch, Vision Transformer, GPT-2, Computer Vision
  • Links: Paper | GitHub

πŸ“° Sankshipt: News Summarizer Application

  • Features: Multi-language support (10 Indian languages)
  • Capabilities: Topic-based article retrieval and summarization
  • Tech Stack: NLP, Machine Learning, Flask, Text Processing
  • Links: GitHub

❀️ MultiLabel Sentiment Analysis

  • Challenge: 9-label emotion classification with class imbalance
  • Achievement: 88% accuracy using weighted loss
  • Tech Stack: DistilBERT, Transformers, PyTorch
  • Links: GitHub

πŸ› οΈ Technical Skills

🧠 Machine Learning

Supervised Learning β€’ Unsupervised Learning β€’ Deep Learning
Computer Vision β€’ NLP β€’ Vision-Language Models
Large Language Models β€’ RAG Systems

πŸ’» Programming Languages

Python β€’ C β€’ C++ β€’ SQL

πŸ”§ Frameworks & Libraries

PyTorch β€’ TensorFlow β€’ NumPy β€’ Pandas
Scikit-learn β€’ Flask β€’ FastAPI β€’ Seaborn

☁️ Tools & Platforms

GitHub β€’ Supabase β€’ CUDA β€’ Docker β€’ Jupyter

πŸ† Achievements & Certifications

πŸ₯‰ Competition Results

  • 2nd Runner-up: AI Hackathon, Jagriti, IIT BHU (700+ participants)
  • 2nd Runner-up: Enigma, Codefest, IIT BHU (1600+ participants)

πŸ“œ NVIDIA Deep Learning Institute Certifications

  • βœ… Fundamentals of Accelerated Computing with CUDA C/C++ (March 8, 2025)
  • βœ… Fundamentals of Deep Learning (February 16, 2025)
  • βœ… Building Transformer-Based NLP Applications (October 15, 2024)
  • βœ… Applications of AI for Anomaly Detection (November 9, 2024)

πŸŽ“ Education

Bachelor of Technology in Computer Science and Engineering
Indian Institute of Information Technology, Nagpur | CGPA: 8.35
2022 - Present (Final Year)

πŸ“š Key Coursework

  • Mathematics: Linear Algebra, Calculus, Discrete Math, Statistics
  • Computer Science: DSA, DBMS, OS, Software Engineering, Networks
  • AI Specialization: ML, DL, NLP, Computer Vision, Conversational AI

πŸ“ž Contact Information

Email LinkedIn GitHub Scholar

🌐 Portfolio Features

✨ Interactive Design

  • 🎨 Modern gradient backgrounds with glassmorphism effects
  • 🎭 Smooth animations and hover effects
  • πŸ“± Fully responsive design for all devices
  • ⚑ Fast loading and optimized performance

🎯 Key Sections

  • 🏠 Hero Section: Eye-catching introduction with call-to-action
  • πŸ‘€ About Me: Professional stats and personal description
  • πŸ’Ό Experience: Interactive timeline of internships
  • πŸ—οΈ Projects: Detailed showcase with tech stacks
  • πŸ› οΈ Skills: Categorized technical expertise
  • πŸ“œ Certifications: All credentials and achievements
  • πŸ“ž Contact: Professional contact information

πŸ”§ Technical Implementation

  • Frontend: Vanilla HTML5, CSS3, JavaScript ES6
  • Styling: Custom CSS with advanced animations
  • Deployment: GitHub Pages (Free hosting)
  • Performance: Optimized for fast loading
  • SEO: Semantic HTML structure

πŸ“ˆ Future Enhancements

  • Add dark/light theme toggle
  • Integrate blog section for technical articles
  • Add project filtering and search functionality
  • Include testimonials and recommendations
  • Implement contact form with backend
  • Add Google Analytics for visitor tracking

🀝 Let's Connect!

I'm actively seeking opportunities in AI/ML Engineering and Data Science. If you're interested in collaborating or have opportunities to discuss, feel free to reach out!


⭐ If you find this portfolio inspiring, please give it a star!

Built with ❀️ for the AI/ML community

Β© 2025 Himanshu Kumar. All rights reserved.

About

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages