A serverless, AI-powered blog generation system featuring:
- Amazon Bedrock foundation models for AI text generation
- AWS Lambda for serverless orchestration
- Amazon S3 for persistent blog storage
- API Gateway for secure, HTTPS-based client access
- Fine-grained IAM roles for secure Bedrock and S3 integration
π Tech: AWS Lambda, Amazon Bedrock, API Gateway, Amazon S3, Python
π Design: Serverless, cloud-native, scalable, production-ready
A production-grade, end-to-end abstractive text summarization system featuring:
- Full fine-tuning of Google Pegasus LLM for domain-specific summarization.
- Modular, MLOps-ready pipelines for ingestion, validation, transformation, training, evaluation, and prediction.
- FastAPI backend with Jinja2-powered web UI.
- DVC dataset versioning, MLflow experiment tracking, and centralized logging.
- Local + AWS S3 artifact storage with YAML-driven configuration.
π Tech: FastAPI, Hugging Face Transformers, Google Pegasus, DVC, MLflow, AWS S3, YAML
π Design: Modular, reproducible, cloud-ready, production-grade
A fully modular, production-grade ML pipeline for phishing detection with:
- FastAPI for API serving (
/train,/predict) - Async training using Celery + Redis
- Workflow versioning with DVC
- Hyperparameter tuning via Optuna
- Experiment tracking using MLflow + DagsHub
- Schema + drift validation, preprocessing, and artifact management
- CI/CD with GitHub Actions, auto-sync to AWS S3
π Tech: FastAPI, Scikit-learn, Celery, MLflow, Optuna, AWS, MongoDB, Docker
π Design: Modular, YAML-driven, CI/CD-enabled
A reusable and scalable machine learning pipeline to predict student scores:
- PostgreSQL-based ingestion with table auto-creation from YAML schema
- Configurable preprocessing (scaling, encoding, imputing, column operations)
- Full pipeline: ingestion β validation β transformation β training β evaluation β prediction
- Optuna-powered hyperparameter tuning with MLflow tracking
- Centralized logging with AWS S3 and local backups
- Artifact versioning using DVC
π Tech: Scikit-learn, Optuna, PostgreSQL, MLflow, DVC, AWS, YAML
π Design: Production-grade, modular, reusable, data-driven, CI/CD-enabled
A Dockerized regression pipeline with a simple web interface for real-time prediction:
- Built using ElasticNet Regression
- Flask frontend for user input and result display
- Modular pipeline: ingestion β validation β transformation β training β prediction
- Structured with reusable entity-based config classes
- MLflow-based experiment logging & reproducible config
π Tech: Flask, Scikit-learn, MLflow, Docker, YAML
π Design: Lightweight, extensible, production-ready
πΉ Data Scientist / CAE Engineer β General Motors (via TCS)
Sep 2019 β Aug 2023 Β· Bangalore, India
- Reduced EV battery module mass by 11% via ML-driven simulation optimization
- Automated FEA workflows and built predictive models for unseen load conditions
- Developed Kriging-based optimization and mentored junior engineers in ML-integrated engineering
πΉ Structural Analyst β Johnson & Johnson MedTech (via TCS)
Feb 2017 β Aug 2019 Β· Kolkata, India
- Optimized design cycles for surgical devices using FE analysis and statistical tools
- Built automation scripts for structural simulation and data extraction
- Supported design for novel uterine tumor excision device using biomechanical modeling
-
π Iβm currently working on
Production-grade AI APIs with FastAPI + Celery + MLflow for scalable ML deployment. -
π€ Iβm looking to collaborate on
Applied ML projects, MLOps tooling, or LLM use cases in healthcare or finance. -
π§ Iβm looking for help with
Model serving at scale and efficient Kubernetes-based deployment. -
π± Iβm currently learning
LLM fine-tuning, Kubeflow pipelines, and advanced ML monitoring strategies. -
π¬ Ask me about
MLOps pipelines, Optimization, Automation, or real-world AI deployment. -
β‘ Fun fact
My first ML pipeline was trained entirely on FEA simulation dataβno labeled dataset, just raw physics!
