Skip to content
View revoker3661's full-sized avatar
  • Boisar

Highlights

  • Pro

Block or report revoker3661

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
revoker3661/README.md

Hello , I'm Pappu Sunil Yadav

๐Ÿ“ India ย  | ย  ๐Ÿ’ผ Open to Remote GenAI / ML Opportunities


๐Ÿš€ About Me

I am a GenAI & Applied Machine Learning Engineer focused on building real-world, production-grade AI systems rather than demo-only models.

My core expertise lies in LLMs, Retrieval-Augmented Generation (RAG), multimodal AI, and document intelligence, with a strong emphasis on system architecture โ€” how data flows from raw sources to reliable, grounded AI outputs.

I enjoy working at the intersection of ML engineering, GenAI system design, and practical deployment, especially in high-impact domains like healthcare and medical AI.


๐Ÿง  Featured Projects

๐Ÿ”น Multimodal Clinical RAG Assistant (Medical Text + Image)

What it is:
A doctor-assistive multimodal AI system that jointly understands medical text, tables, diagrams, and patient images to generate clinically grounded explanations using a multimodal LLM.

  • Architecture: Dual-encoder retrieval using MiniLM for medical text/tables/JSONL entries and OpenCLIP ViT-B/32 for diagrams and patient images.
  • Retrieval: Hybrid multimodal search with ChromaDB, enabling cross-verification between symptoms, diagnosis text, and visual evidence.
  • Reasoning: Dynamic prompt composition feeding retrieved text + images into IDEFICS2-8B (4-bit).
  • Deployment: Interactive Streamlit UI displaying matched visuals, source text, diagnosis, causes, and treatments in real time.

Impact:
โ€ข Achieved 100% text retrieval accuracy and near-perfect image retrieval.
โ€ข Significantly reduced hallucinations compared to text-only RAG systems.

๐ŸŽฅ Demo: YouTube Channel
๐Ÿ”— Repo: GitHub


๐Ÿ”น AI Document Intelligence Pipeline (PDF โ†’ Structured JSONL)

Designed a high-accuracy document AI pipeline to convert unstructured medical textbooks into structured JSONL datasets optimized for RAG and multimodal LLMs.

  • Implemented document layout segmentation using Detectron2 + PubLayNet.
  • Engineered dual-stage table verification with Microsoft Table Transformer (DETR).
  • Integrated PaddleOCR for extracting dense labels from scanned diagrams.
  • Generated spatially grounded metadata including bounding boxes and captions.

Results:
โ€ข Processed 500+ pages per book, generating 40,000+ JSONL entries.
โ€ข Achieved 95% table extraction accuracy.
โ€ข Orchestrated on NVIDIA L4 Cloud GPU.

๐Ÿ”— GitHub


๐Ÿ”น Pneumonia Detection Using Deep Learning (VGG19)

Built an automated pneumonia detection system using chest X-ray images to assist radiologists in early and accurate diagnosis.

  • Applied image preprocessing and augmentation using ImageDataGenerator.
  • Developed a transfer-learning pipeline with VGG19 and custom dense layers.
  • Used EarlyStopping and ReduceLROnPlateau to prevent overfitting.

Performance: Achieved 92โ€“97% accuracy for NORMAL vs PNEUMONIA classification.


๐Ÿ”น Personalized Medicine Recommendation System

Developed a symptom-based medical recommendation system that predicts diseases and provides actionable healthcare guidance.

  • Trained an SVC classifier achieving 100% test accuracy.
  • Built a complete ML pipeline including feature encoding and model comparison.
  • Integrated medical knowledge modules for medications, diets, precautions, and workouts.
  • Delivered as a full Flask web application with dynamic UI.

๐Ÿ”น Other Projects

  • Breast Cancer Classification (Neural Network)
  • Heart Disease Prediction (Logistic Regression)
  • Route Optimization System (A*) โ€” Hackathon 2025
  • Women Safety App โ€” Android + Firebase
  • Steganography Encryption System
  • RASA Tour Chatbot

๐Ÿ› ๏ธ Tech Stack

Languages: Python, SQL, C/C++
ML / DL: TensorFlow, PyTorch, Scikit-learn
GenAI: Hugging Face, LangChain, LangGraph, LLMs, RAG
Computer Vision & OCR: OpenCV, Detectron2, PaddleOCR
Vector Databases: ChromaDB
Deployment: Flask, Streamlit, Docker
Infrastructure: NVIDIA L4 GPU


๐Ÿ“ซ Letโ€™s Connect

๐Ÿ“ง Email: yadavpappu3661@gmail.com
๐Ÿ’ผ LinkedIn: linkedin.com/in/pappu-yadav-3319ab289
๐Ÿ’ป GitHub: github.com/revoker3661
๐ŸŽฅ YouTube: youtube.com/@pappuyadav-js3pq


โญ๏ธ Thanks for visiting โ€” letโ€™s build impactful GenAI systems together!

Pinned Loading

  1. AI-Document-Intelligence-Pipeline-PDF-Structured-JSONL-for-Multimodal-RAG- AI-Document-Intelligence-Pipeline-PDF-Structured-JSONL-for-Multimodal-RAG- Public

    A robust, production-grade pipeline converting complex Medical PDFs into structured, RAG-ready JSONL datasets. Features smart table merging, multimodal extraction, and dynamic layout analysis usingโ€ฆ

    Python

  2. Multimodal-Clinical-RAG-Assistant-Medical-Text-Image-Retrieval-System- Multimodal-Clinical-RAG-Assistant-Medical-Text-Image-Retrieval-System- Public

    A doctor-assistive AI system that interprets medical knowledge and patient images simultaneously. It utilizes a Dual-Encoder architecture to cross-reference textbook theory with visual pathology, gโ€ฆ

    Python

  3. medical-rag-assistant medical-rag-assistant Public

    An AI Clinical Decision Support assistant using RAG to provide evidence-based answers from medical textbooks. Built with LangChain, Streamlit, and Ollama.

    Jupyter Notebook

  4. tour-based-chatbot tour-based-chatbot Public