I'm a Master's student in Data Analytics Engineering at Northeastern University, passionate about building scalable data infrastructure and deploying intelligent systems. Currently seeking full-time opportunities in Data Engineering, Analytics, or MLOps (Available August 2025).
- π Currently working on GenAI applications and MLOps pipelines
- π± Learning advanced cloud architectures and real-time data processing
- π― Looking to collaborate on open-source data engineering projects
- π¬ Ask me about data pipelines, ML deployment, and cloud infrastructure
- π Based in Boston, MA | Open to relocate
- β‘ Fun fact: I reduced documentation failures by 70% with a single API integration!
GenAI Chatbot with RAG Architecture
- Built using Mistral AI, LangChain, FAISS for intelligent document retrieval
- Orchestrated with Prefect, tracked with MLFlow, deployed on GCP Cloud Run
- Complete CI/CD pipeline with GitHub Actions and Docker
- Tech Stack: Python, Flask, Streamlit, GCP, Docker, MLflow
Advanced Analytics & Machine Learning
- Improved model accuracy by 80% through comprehensive data cleaning
- Implemented K-Means clustering for customer segmentation
- Delivered actionable marketing insights for targeted campaigns
- Tech Stack: Python, Pandas, Scikit-learn, Matplotlib
Database Design & Big Data Processing
- Designed scalable ERD models for complex supply chain management
- Implemented MongoDB MapReduce for large-scale data aggregation
- Created interactive crop visualization dashboards
- Tech Stack: MySQL, MongoDB, Python, Data Modeling
π§ Data Engineer Co-op @ Bayer (June 2024 β Dec 2024)
- Developed centralized API for 50+ gRPC services using GoLang
- Reduced documentation failures by 70% through automation
- Deployed on GKE with Helm and implemented CI/CD with GitHub Actions
π Data Analytics Intern @ Brane Enterprises (Dec 2022 β June 2023)
- Built heart disease detection model with 90% accuracy
- Processed 10K+ ECG records with real-time data analysis
- Collaborated in Agile environments with cross-functional teams
- π Building scalable data pipelines for real-time processing
- π€ Exploring advanced GenAI applications and LLM optimization
- βοΈ Mastering cloud-native architectures and serverless computing
- π Contributing to open-source data engineering tools
- π Reduced manual processes by 60% through intelligent automation
- π Improved model performance by 25% with advanced analytics
- π§ Successfully deployed production-ready MLOps pipelines
- π Built dashboards serving enterprise-level data insights

