Engineering Leader • AI/ML Systems • Cloud + MLOps • Kubernetes Platforms
I build AI-first products and scalable platforms. Current interests: LLM apps, GenAI pipelines, practical MLOps, and cloud infrastructure for GPU workloads.
- LLM apps: RAG, tool use, agents, evaluation, prompt and retrieval strategies
- GenAI: diffusion fine-tuning, multimodal pipelines, inference optimization
- MLOps: training pipelines, versioning, CI/CD for ML, monitoring and drift
- Platform: Kubernetes, containers, queues, caching, observability, infra automation
- Cloud: AWS and GCP, security-first architecture, cost-aware design
- Speaker on AI infrastructure and scaling topics
- Startup competition judge (AI/ML track)
- Occasional academic contributor and mentoring
|
|
|
|
|
|
|
|
|



