Skip to content
View phantom2810's full-sized avatar
  • New York

Highlights

  • Pro

Block or report phantom2810

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
phantom2810/README.md

Aayush Agarwal

AI Researcher & Engineer | MS Computer Engineering @ NYU

LinkedIn Email GitHub

profile views

πŸ‘¨β€πŸ’» About Me

AI Researcher and Engineer specializing in privacy-preserving machine learning, multimodal AI, and production-scale ML systems. Currently pursuing MS in Computer Engineering at NYU with focus on federated learning, secure computation, and healthcare AI applications.

Core Expertise: MLOps β€’ Computer Vision β€’ NLP β€’ Privacy-Preserving ML β€’ Distributed Systems


πŸ”¬ Research & Focus Areas

πŸ” Privacy-Preserving ML

  • Secure multi-party computation (MPC)
  • Federated learning systems
  • Differential privacy frameworks

πŸ₯ Medical AI

  • Automated pathology report generation
  • ViLT-based multimodal models
  • MIMIC-CXR dataset processing

πŸ›‘οΈ LLM Security

  • Adversarial jailbreak detection
  • Robustness testing frameworks
  • Real-time vulnerability assessment

⚑ MLOps & Infrastructure

  • Distributed training pipelines
  • Cloud-scale ML systems
  • CI/CD automation for ML workflows

πŸ’Ό Professional Experience

πŸ” Software Engineer Intern @ OpenMined

πŸ“ New York, NY | πŸ“… August 2025 – Present

  • πŸ›‘οΈ Designing a privacy-preserving distributed data aggregation framework using secure multi-party computation (MPC) and additive secret sharing for federated, privacy-compliant LLM training and evaluation
  • πŸ—οΈ Developed a two-tier network architecture (heavy/light nodes) with threshold-based secure aggregation, integrating SQL, Oracle RDBMS, and data warehouse systems into privacy-first pipelines
  • πŸ“ Co-authored research paper submitted to ICLR 2026 on secure distributed computation for federated learning

πŸ”¬ AI Researcher @ NYU

πŸ“ New York, NY | πŸ“… Current

  • 🧠 Leading multimodal AI research for medical imaging and automated pathology report generation
  • ⚑ Built PyTorch pipelines reducing preprocessing time by 20% on NYU HPC systems
  • πŸ† Demonstrated superior performance of ViLT over MedCLIP in multimodal alignment tasks

🌐 Software Engineer (AI) @ Google Chronicle

πŸ“ India

  • πŸ›‘οΈ Integrated anomaly detection algorithms into Google Chronicle SIEM serving 500K+ users
  • πŸ“Š Deployed 10+ ML-enhanced log parsers, boosting data usability by 30%
  • ⏱️ Reduced manual log management by 30% and alert resolution time by 40%


🏒 Additional Experience

πŸ”¬ IBM Research | Research Intern

  • πŸ“Š Developed differential privacy frameworks achieving 35% privacy enhancement
  • πŸ”’ Preserved 90% data utility while maintaining strong privacy guarantees

πŸ€– Tech Mahindra | AI Engineer

  • πŸ’¬ Built NLP chatbots and computer vision solutions
  • πŸ“ˆ Achieved 25% accuracy improvement in production models

πŸ› οΈ Technical Skills

Languages & Core
Python C++ Java SQL

AI/ML Frameworks
PyTorch TensorFlow Scikit--learn Hugging Face LangChain

MLOps & DevOps
Docker Kubernetes MLflow Airflow Ray

Cloud Platforms
AWS Google Cloud Azure

Data & Analytics
Pandas NumPy Tableau


🌟 Featured Projects

Python PyTorch MLflow Docker

Complete MLOps pipeline with distributed training on GPU infrastructure. Provisioned resources on Chameleon Cloud with IaC, implemented multi-GPU training with hyperparameter optimization, and established CI/CD pipeline using GitHub Actions and Argo Workflows.


PyTorch Hugging Face OpenAI

Modular framework for adversarial testing of GPT models with real-time vulnerability detection. Reduced jailbreak success rates through iterative optimization and automated testing pipelines.


LangChain FAISS Flask

πŸ… Amazon Sambhav Hackathon Finalist β€’ 5,000+ Active Users

RAG-powered application for export compliance with vector database semantic search and natural language querying of regulatory data.


TensorFlow OpenCV AWS

Cloud-deployed image classification system with automated categorization for large-scale processing and production-ready computer vision pipeline.


οΏ½ Additional Projects

Project Tech Stack Impact
πŸ”’ IBM Differential Privacy IBM Privacy Lib, Python 35% privacy enhancement with 90% data utility preservation
πŸ” MSMARCO Search Engine Information Retrieval Efficient search algorithms and ranking mechanisms
πŸ“ Cover Letter Generator LangChain, NLP Automated personalized content generation
☸️ Kubernetes ML Deployment Kubernetes, MLOps Scalable ML infrastructure for production
🧠 Deep Learning Research Deep Learning Advanced neural network architectures
πŸ’» Algorithm Solutions Python, Algorithms Optimized competitive programming solutions

πŸŽ“ Education

MS Computer Engineering | New York University | 2023 - 2025
Focus: Artificial Intelligence, Machine Learning, Distributed Systems

BTech Computer Engineering | Vishwakarma Institute of Technology | 2018 - 2022
Core CS Fundamentals


πŸ† Achievements & Publications

Achievements

  • πŸ₯‡ Finalist - Amazon Sambhav Hackathon 2024
  • 🎯 Best Attending Team - Hack NYU
  • πŸ“œ Best Paper Award - IEEE Pune 2022

Publications & Research

  • πŸ“„ IEEE Conference Paper: "Vehicle Characteristics Recognition by Appearance" (2022)
  • πŸ”¬ ICLR 2026 Submission: "Secure Distributed Computation for Federated Learning" (Co-author)
  • πŸ”’ Patent: "CONCRETE GAN: Hybrid AI-based Data Generation Model" (Patent No. 202221001110A)

πŸ“Š GitHub Stats


πŸ“« Contact

I'm always interested in collaborating on innovative AI projects, discussing research ideas, or exploring opportunities in privacy-preserving ML and distributed systems.

LinkedIn Email GitHub

Pinned Loading

  1. movie-recommendation_system_MLOps movie-recommendation_system_MLOps Public

    Python 1

  2. DeepakSingh260/Real-Time-News-Recommnedation-System DeepakSingh260/Real-Time-News-Recommnedation-System Public

    Python

  3. Jailbreak-Simulator-Adversarial-Testing-for-LLMs Jailbreak-Simulator-Adversarial-Testing-for-LLMs Public

    Jupyter Notebook 1

  4. rag-powered-compliance-and-incentive-navigator rag-powered-compliance-and-incentive-navigator Public

    Python 1

  5. IBM-Differential-Privacy IBM-Differential-Privacy Public

    CSS 1

  6. Efficient-Search-Engine-Development-Using-the-MSMARCO-Dataset Efficient-Search-Engine-Development-Using-the-MSMARCO-Dataset Public

    Forked from AdityaC19/Efficient-Search-Engine-Development-Using-the-MSMARCO-Dataset

    Jupyter Notebook 1