Skip to content
View RajKumaar123's full-sized avatar

Block or report RajKumaar123

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
rajkumaar123/README.md

๐Ÿ‘‹ Hello, I'm Raj Kumar

๐ŸŽฏ AI/ML Engineer | LLM & Generative AI Specialist
๐Ÿ” Building real-world solutions with Deep Learning, NLP, and Computer Vision

๐ŸŒ LinkedIn | โœ๏ธ Medium | ๐Ÿ“ง er.rajkumaar@gmail.com


๐Ÿง  Professional Summary โ€“ AI/ML, LLMs, and Generative AI

AI/ML Engineer with 14+ years of experience in banking, analytics, and enterprise-grade automation. Iโ€™ve spent the last 6+ years solving real-world problems using deep learning, LLMs, and computer vision โ€” from smart document QA and chatbot systems to secure image generation and anomaly detection.

Specialized in building explainable, ethical, and inclusive AI systems with a strong focus on NLP for low-resource Indian languages and scalable deployment using open-source tools.


๐Ÿ”ง Core Competencies

  • ๐Ÿค– LLMs & Generative AI

    • End-to-end chatbot and document QA system development using Hugging Face Transformers, LangChain, and custom LLMs
    • Retrieval-Augmented Generation (RAG) pipelines using FAISS, Pinecone, and embedding stores
    • Prompt engineering for zero-shot, instruction-based, and multilingual LLM applications
    • Low-resource language model development (e.g., Angika GPT) for regional translation and content generation
    • Scalable deployment of LLM APIs with FastAPI and Flask
  • ๐Ÿ“š Natural Language Processing (NLP)

    • NER, Sentiment Analysis, Semantic Search, and Text Classification using transformer-based architectures
    • Translation systems for multilingual and low-resource Indian languages
    • Fine-tuned models for summarization, question answering, and domain-specific content generation
    • Custom NLP pipelines built using Hugging Face, spaCy, and NLTK
  • ๐Ÿ“ˆ Machine Learning & Forecasting

    • Forecasting and risk modeling using ARIMA, XGBoost, and LSTM
    • Ensemble learning, advanced feature engineering, and hyperparameter optimization
    • Model interpretability and fairness via Explainable AI techniques (SHAP, LIME)
    • Full lifecycle ML pipeline implementation with scikit-learn and PyTorch
  • ๐ŸŽฅ Computer Vision & Generative AI

    • Object detection, segmentation, OCR, and surveillance-based anomaly detection using YOLO, SAM, SegFormer, and OpenCV
    • Built generative pipelines for Text-to-Image, Image-to-Image, and Image-to-Video using Diffusion Models, DCGANs, and ControlNet
    • Face image encryption and security through GAN-based adversarial training
    • Applied and experimented with Large Vision Models (LVMs) like Florence and SegFormer
  • โš™๏ธ MLOps & Deployment

    • Designed and deployed modular ML APIs using Flask, FastAPI, and Streamlit
    • Model packaging with Pickle, Hugging Face Hub, and CI/CD pipelines via GitHub and GitLab
    • Experience with cloud-agnostic and secure on-prem enterprise deployments
    • Version control, reproducibility, and lifecycle tracking using Git-based workflows
  • ๐Ÿ“Š Analytics & Data Engineering

    • Data preprocessing, transformation, and integration from SQL, MySQL, and DB2
    • Advanced analytics including regression, hypothesis testing, and A/B testing
    • Visualization using Matplotlib, Seaborn, Plotly, and Power BI for decision support
  • ๐Ÿ‘ฅ Leadership & Agile Delivery

    • Agile/Scrum-based project management with JIRA and Confluence
    • Sprint planning, roadmap building, stakeholder communication
    • Led globally distributed teams across delivery lifecycles from PoC to production
    • Mentored engineers on Python, ML, NLP, and deployment standards
    • Translating business goals into scalable and interpretable AI solutions

๐Ÿ“œ Certifications

  • Certified SAFeยฎ Scrum Master
    Issued by Scaled Agile, Inc. | Issued March 2025
    Credential ID: 48725598-4060

๐ŸŽ“ Education

Degree Institution Year Grade
M. Tech (Data & Computational Science) IIT Jodhpur 2023 7.74 CGPA
B. Tech (CSE) Rajasthan Technical University 2010 68%

๐Ÿš€ AI/ML & NLP Project Portfolio

Delivered enterprise-grade AI/ML solutions across banking, finance, and language tech using LLMs, Computer Vision, and Deep Learning.

Flask + LangChain powered RAG chatbot with multi-doc upload, image-aware retrieval, session memory, and a clean Markdown chat UI using ChromaDB and Gemini.

A computer vision-based alert system to detect heat spikes in thermal or surveillance videos using OpenCV. Designed for industrial safety and anomaly detection with real-time alert logging and visualization.

A modular computer vision pipeline that reads driving scene videos and overlays category-specific annotations (bounding boxes and labels) from CSV files. Built using OpenCV and Python for rapid prototyping, visual validation, and AI model support in video analytics workflows.

A computer vision project that detects vehicles, estimates speed, flags over-speeding, and generates alert frames โ€” built with YOLOv8, OpenCV, and Jupyter.

Developed a personalized ML-based lending solution for early-stage retailers with 2โ€“3 years of business history. Utilized multiple models to evaluate creditworthiness and integrated Explainable AI (XAI) for transparent, data-driven decision-making. Improved loan approval accuracy while reducing default risks.

Developed a neural machine translation (NMT) model to translate Angikaโ€”an endangered Indian languageโ€”into English. Used an encoder-decoder LSTM architecture in TensorFlow Keras. Aimed at preserving linguistic heritage and enabling accessibility for low-resource languages. Supports cultural conservation through AI-driven translation.

๐Ÿ”น Multilingual Content Generation (Angika GPT)

Leveraging my expertise in Natural Language Processing and Generative AI, I spearheaded the development of Angika GPT, (www.angikaGPT.com) a custom Language Model designed to address the challenges of low-resource language translation and content generation, with a particular focus on Angika. This initiative involved utilizing Hugging Face Transformers to build and fine-tune a specialized LLM, actively contributing to digital preservation efforts by enabling content creation and translation in Angika. My role encompassed all aspects of the project, from model selection and training to performance evaluation and optimization, ultimately fostering the development of inclusive language technology and promoting multilingual NLP capabilities.

๐Ÿ”น JIRA Estimation & Risk Prediction

ML model predicting story points, impacted components, and risk levels using historical JIRA data and NLP โ€” improved sprint planning by 20%.

๐Ÿ”น Smart Invoice Query Assistant (RAG + LangChain)

Real-time assistant using LangChain + RAG for invoice-related queries. Integrated document loaders, chunking, embeddings, and vector search for fast, human-like responses. Included intent detection and adaptive learning.

๐Ÿ”น OCR Automation for Finance

Improved invoice/cheque OCR pipeline using DL for handwritten and printed text โ€” reduced manual processing by 25%.

๐Ÿ”น Real-Time Anomaly Monitoring Dashboard

Built a predictive alert dashboard with real-time updates, anomaly detection, and visual trends โ€” cut incident response time by 30%.


๐Ÿ“š Research Papers & Presentations

  • ๐Ÿง  Explainable Sentiment Analysis for Financial News โ€“ ICIEM 2023
  • ๐ŸŒ NMT for Endangered Languages (Angika) โ€“ IIT Patna, 2023
  • ๐Ÿ’Š Generative AI for Drug Recommendations โ€“ SHADG 2024 (IIT Kharagpur)
  • ๐Ÿ“Š LLMs & DPT Models for Sentiment Analysis โ€“ NIT Patna 2024
  • ๐Ÿ” Facial Image Encryption with DCGANs โ€“ M.Tech Project, IIT Jodhpur 2023

๐Ÿง‘โ€๐Ÿ’ผ Leadership Snapshot

  • Led cross-functional AI/ML teams across geographies for complex delivery programs
  • Acted as Agile Scrum Master โ€” driving velocity, alignment, and outcomes
  • Mentored early-career engineers in Python, ML, NLP, and deployment workflows
  • Modernized legacy platforms by embedding AI/ML pipelines
  • Defined roadmaps and aligned data science goals to business strategies

โ€œAI is not just about intelligence โ€” it's about accessibility, ethics, and trust-driven innovation.โ€
โ€” Raj Kumar

Pinned Loading

  1. Angika-LowResource-Translator Angika-LowResource-Translator Public

    Angika-LowResource-Translator is a neural machine translation model designed to translate between Angika and English. Built for low-resource language support, it leverages fine-tuned transformers tโ€ฆ

    Jupyter Notebook

  2. SmartLendXAI-AI-Powered-Lending-Decisions-for-Early-Stage-Retailers SmartLendXAI-AI-Powered-Lending-Decisions-for-Early-Stage-Retailers Public

    SmartLendXAI is an AI-based loan approval prediction system for early-stage retailers with 2โ€“3 years of business history. It uses multiple ML algorithms to assess credit risk and integrates Explainโ€ฆ

    Jupyter Notebook

  3. Thermal-Video-Red-Zone-Detection-and-Alerting-System Thermal-Video-Red-Zone-Detection-and-Alerting-System Public

    This project checks thermal or surveillance videos to find areas with too much red color, which can mean heat or fire. It marks those areas, saves alert times, and shows a graph of how red changes โ€ฆ

    Jupyter Notebook

  4. Video-Data-Processing-with-Python-and-OpenCV Video-Data-Processing-with-Python-and-OpenCV Public

    Video-Data-Processing-with-Python-and-OpenCV is a modular project for automated annotation and visualization of video data. It reads video frame-by-frame, overlays bounding boxes and category labelโ€ฆ

    Jupyter Notebook