I design and deploy production-grade AI systems powered by LLMs, deep learning, and computer vision. My work focuses on building reliable backend services, optimizing model inference, and delivering high-performance AI applications in production environments.
I enjoy working across the full stack β from fine-tuning models to building backend platforms and deploying cloud-native services.
class HarshaVardhanMannem:
def __init__(self):
self.location = "Birmingham, AL π"
self.role = "AI/ML Software Engineer"
self.focus = ["LLMs", "RAG", "Deep Learning", "Computer Vision"]
self.backend = ["FastAPI", "Django", "NestJS"]
self.cloud = ["AWS", "Azure", "Docker", "GPU Deployment"]
self.currently = "Building production LLM systems π"
def greet(self):
return "Always building and exploring practical AI systems βοΈ"| Area | Details |
|---|---|
| π§ LLM Systems | Integration, fine-tuning, and inference optimization |
| π RAG Pipelines | Retrieval-Augmented Generation & AI microservices |
| π οΈ Backend Dev | FastAPI, Django, NestJS REST APIs |
| βοΈ Cloud & Infra | AWS, Azure, Docker, CI/CD, GPU-accelerated deployment |
| π¦ Open Source | Publishing & optimizing NLP and generative AI models |
| Discipline | Technologies & Practices |
|---|---|
| ποΈ Model Training | PyTorch, Transformers fine-tuning, LoRA/QLoRA, PEFT, custom training loops |
| π Data Pipelines | ETL workflows, data preprocessing, feature engineering, HuggingFace Datasets |
| π¬ Experiment Tracking | MLflow, Weights & Biases, reproducible training runs |
| βοΈ Model Optimization | ONNX export, TensorRT, quantization (INT8/FP16), pruning |
| π Model Serving | FastAPI inference endpoints, TorchServe, Triton Inference Server |
| π€ Agentic AI | Multi-agent pipelines with LangGraph, CrewAI, Google ADK |
| π MLOps | CI/CD for ML, model versioning, A/B testing, drift monitoring |
| πΌοΈ Computer Vision | Object detection, image classification, segmentation (PyTorch + OpenCV) |
| π§ͺ Test Automation | Selenium, Playwright, Cypress β UI, E2E, and API test automation |
| Certification | Issuer |
|---|---|
| π AWS Certified Machine Learning Engineer β Associate | Amazon Web Services |
| π Oracle Cloud Generative AI Professional | Oracle |
| π NVIDIA Deep Learning Institute | NVIDIA |
| πΉ Production LLM Systems | πΉ Scalable Backend Architecture |
| πΉ High-Performance Inference | πΉ AI-Powered Applications |


