Skip to content
View HarshaVardhanMannem's full-sized avatar

Highlights

  • Pro

Block or report HarshaVardhanMannem

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
header

Typing SVG

LinkedIn Gmail Hugging Face LeetCode HackerRank

πŸ“ Birmingham, AL Β |Β  πŸ€– AI/ML Engineer Β |Β  πŸš€ Open-Source Contributor


πŸ§‘β€πŸ’» About Me

I design and deploy production-grade AI systems powered by LLMs, deep learning, and computer vision. My work focuses on building reliable backend services, optimizing model inference, and delivering high-performance AI applications in production environments.

I enjoy working across the full stack β€” from fine-tuning models to building backend platforms and deploying cloud-native services.

class HarshaVardhanMannem:
    def __init__(self):
        self.location     = "Birmingham, AL πŸ“"
        self.role         = "AI/ML Software Engineer"
        self.focus        = ["LLMs", "RAG", "Deep Learning", "Computer Vision"]
        self.backend      = ["FastAPI", "Django", "NestJS"]
        self.cloud        = ["AWS", "Azure", "Docker", "GPU Deployment"]
        self.currently    = "Building production LLM systems πŸš€"

    def greet(self):
        return "Always building and exploring practical AI systems ⭐️"

πŸ”­ What I Work On

Area Details
🧠 LLM Systems Integration, fine-tuning, and inference optimization
πŸ” RAG Pipelines Retrieval-Augmented Generation & AI microservices
πŸ› οΈ Backend Dev FastAPI, Django, NestJS REST APIs
☁️ Cloud & Infra AWS, Azure, Docker, CI/CD, GPU-accelerated deployment
πŸ“¦ Open Source Publishing & optimizing NLP and generative AI models

πŸ§ͺ ML Engineering Experience

Discipline Technologies & Practices
πŸ‹οΈ Model Training PyTorch, Transformers fine-tuning, LoRA/QLoRA, PEFT, custom training loops
πŸ“Š Data Pipelines ETL workflows, data preprocessing, feature engineering, HuggingFace Datasets
πŸ”¬ Experiment Tracking MLflow, Weights & Biases, reproducible training runs
βš™οΈ Model Optimization ONNX export, TensorRT, quantization (INT8/FP16), pruning
πŸš€ Model Serving FastAPI inference endpoints, TorchServe, Triton Inference Server
πŸ€– Agentic AI Multi-agent pipelines with LangGraph, CrewAI, Google ADK
πŸ“ˆ MLOps CI/CD for ML, model versioning, A/B testing, drift monitoring
πŸ–ΌοΈ Computer Vision Object detection, image classification, segmentation (PyTorch + OpenCV)
πŸ§ͺ Test Automation Selenium, Playwright, Cypress β€” UI, E2E, and API test automation

πŸ› οΈ Tech Stack

Languages

Python Java JavaScript TypeScript SQL

AI / ML & Agents

PyTorch Hugging Face LangChain LangGraph OpenAI CrewAI Google ADK MLflow scikit-learn

Backend & Web

FastAPI Django NestJS Spring Boot Node.js Next.js

Cloud & Infrastructure

AWS Azure Docker GitHub Actions Prometheus Grafana

Testing & Automation

Selenium Playwright Cypress


πŸ“Š GitHub Stats

Β Β 

GitHub Streak



Stars



Contribution Graph

πŸ† Certifications

Certification Issuer
πŸ… AWS Certified Machine Learning Engineer – Associate Amazon Web Services
πŸ… Oracle Cloud Generative AI Professional Oracle
πŸ… NVIDIA Deep Learning Institute NVIDIA

🎯 Current Focus

πŸ”Ή Production LLM Systems πŸ”Ή Scalable Backend Architecture
πŸ”Ή High-Performance Inference πŸ”Ή AI-Powered Applications

footer

⭐️ Always building and exploring practical AI systems

Pinned Loading

  1. LLM-powered-Research-Papers-QA-Bot- LLM-powered-Research-Papers-QA-Bot- Public

    Research paper QA bot using RAG agents with LLM

    Python 1

  2. Multi-agent-restaurant-analytics Multi-agent-restaurant-analytics Public

    Ask questions, get answers. Natural language analytics for restaurant POS data. Multi-agent LangGraph system converts queries to SQL with deterministic validation. Built for production reliability.

    Python

  3. Product-Recommendation-System Product-Recommendation-System Public

    An AI-powered product recommendation chatbot that leverages RAG technology with LangChain to provide intelligent product suggestions based on user queries. Built with Flask and featuring a modern g…

    CSS

  4. Real-Time-Crypto-Stream Real-Time-Crypto-Stream Public

    A scalable cryptocurrency data ingestion system that streams real-time market data from TradingView using advanced web automation, featuring intelligent resource allocation, automated validation, a…

    TypeScript