Aayush Agarwal phantom2810

Aayush Agarwal

AI Researcher & Engineer | MS Computer Engineering @ NYU

👨‍💻 About Me

AI Researcher and Engineer specializing in privacy-preserving machine learning, multimodal AI, and production-scale ML systems. Currently pursuing MS in Computer Engineering at NYU with focus on federated learning, secure computation, and healthcare AI applications.

Core Expertise: MLOps • Computer Vision • NLP • Privacy-Preserving ML • Distributed Systems

🔬 Research & Focus Areas

🔐 Privacy-Preserving ML

Secure multi-party computation (MPC)
Federated learning systems
Differential privacy frameworks

🏥 Medical AI

Automated pathology report generation
ViLT-based multimodal models
MIMIC-CXR dataset processing

🛡️ LLM Security

Adversarial jailbreak detection
Robustness testing frameworks
Real-time vulnerability assessment

⚡ MLOps & Infrastructure

Distributed training pipelines
Cloud-scale ML systems
CI/CD automation for ML workflows

💼 Professional Experience

🔐 Software Engineer Intern @ OpenMined

📍 New York, NY | 📅 August 2025 – Present

🛡️ Designing a privacy-preserving distributed data aggregation framework using secure multi-party computation (MPC) and additive secret sharing for federated, privacy-compliant LLM training and evaluation
🏗️ Developed a two-tier network architecture (heavy/light nodes) with threshold-based secure aggregation, integrating SQL, Oracle RDBMS, and data warehouse systems into privacy-first pipelines
📝 Co-authored research paper submitted to ICLR 2026 on secure distributed computation for federated learning

🔬 AI Researcher @ NYU

📍 New York, NY | 📅 Current

🧠 Leading multimodal AI research for medical imaging and automated pathology report generation
⚡ Built PyTorch pipelines reducing preprocessing time by 20% on NYU HPC systems
🏆 Demonstrated superior performance of ViLT over MedCLIP in multimodal alignment tasks

🌐 Software Engineer (AI) @ Google Chronicle

📍 India

🛡️ Integrated anomaly detection algorithms into Google Chronicle SIEM serving 500K+ users
📊 Deployed 10+ ML-enhanced log parsers, boosting data usability by 30%
⏱️ Reduced manual log management by 30% and alert resolution time by 40%

🏢 Additional Experience

🔬 IBM Research | Research Intern

📊 Developed differential privacy frameworks achieving 35% privacy enhancement
🔒 Preserved 90% data utility while maintaining strong privacy guarantees

🤖 Tech Mahindra | AI Engineer

💬 Built NLP chatbots and computer vision solutions
📈 Achieved 25% accuracy improvement in production models

🛠️ Technical Skills

Languages & Core

AI/ML Frameworks

MLOps & DevOps

Cloud Platforms

Data & Analytics

🌟 Featured Projects

🚀 Movie Recommendation System with MLOps

Complete MLOps pipeline with distributed training on GPU infrastructure. Provisioned resources on Chameleon Cloud with IaC, implemented multi-GPU training with hyperparameter optimization, and established CI/CD pipeline using GitHub Actions and Argo Workflows.

🔒 Jailbreak Simulator - Adversarial Testing for LLMs

Modular framework for adversarial testing of GPT models with real-time vulnerability detection. Reduced jailbreak success rates through iterative optimization and automated testing pipelines.

🏆 RAG-Powered Compliance Navigator

🏅 Amazon Sambhav Hackathon Finalist • 5,000+ Active Users

RAG-powered application for export compliance with vector database semantic search and natural language querying of regulatory data.

🖼️ AI-Driven Image Categorization

Cloud-deployed image classification system with automated categorization for large-scale processing and production-ready computer vision pipeline.

� Additional Projects

Project	Tech Stack	Impact
🔒 IBM Differential Privacy	IBM Privacy Lib, Python	35% privacy enhancement with 90% data utility preservation
🔍 MSMARCO Search Engine	Information Retrieval	Efficient search algorithms and ranking mechanisms
📝 Cover Letter Generator	LangChain, NLP	Automated personalized content generation
☸️ Kubernetes ML Deployment	Kubernetes, MLOps	Scalable ML infrastructure for production
🧠 Deep Learning Research	Deep Learning	Advanced neural network architectures
💻 Algorithm Solutions	Python, Algorithms	Optimized competitive programming solutions

🎓 Education

MS Computer Engineering | New York University | 2023 - 2025
Focus: Artificial Intelligence, Machine Learning, Distributed Systems

BTech Computer Engineering | Vishwakarma Institute of Technology | 2018 - 2022
Core CS Fundamentals

🏆 Achievements & Publications

Achievements

🥇 Finalist - Amazon Sambhav Hackathon 2024
🎯 Best Attending Team - Hack NYU
📜 Best Paper Award - IEEE Pune 2022

Publications & Research

📄 IEEE Conference Paper: "Vehicle Characteristics Recognition by Appearance" (2022)
🔬 ICLR 2026 Submission: "Secure Distributed Computation for Federated Learning" (Co-author)
🔒 Patent: "CONCRETE GAN: Hybrid AI-based Data Generation Model" (Patent No. 202221001110A)

📊 GitHub Stats

📫 Contact

I'm always interested in collaborating on innovative AI projects, discussing research ideas, or exploring opportunities in privacy-preserving ML and distributed systems.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly