Skip to content
View mittalchauhan's full-sized avatar

Block or report mittalchauhan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
mittalchauhan/README.md

🧠 Data Science Research & Strategic Portfolio

Welcome to my centralized research hub. This repository contains specialized projects focusing on Multi-Modal Risk Fusion, Transformer Architectures, Explainable AI (XAI), and Strategic Market Analytics.

My work focuses on bridging foundational machine learning research with actionable corporate strategy, emphasizing high-fidelity diagnostics and evidence-based decision-making.


Project Strategic Matrix

Rank Project Name Research Category Primary Tech Stack Repository Link
1 Sentinel Gold Multi-Modal Risk Fusion Spark, Kafka, GPT-2 View Repo
2 ELS-Pulse Heuristic NLP & Brand Audit Ensemble ML, Streamlit View Repo
3 InsuraPulse Explainable AI (XAI) SHAP, Scikit-Learn View Repo
4 QuantPro Signal Processing yfinance, Pandas View Repo
5 MoviePulse Latent Factor Modeling SVD, NLP, TF-IDF View Repo
6 RetailPulse Association Theory Apriori, FP-Growth View Repo
7 Consumer Audit Consensus Methodology Flask, Scikit-Learn View Repo
8 LLM Foundations Transformer Theory PyTorch, Tiktoken View Repo
9 Airline Fleet Planning Strategic CAPEX Analysis Power BI, Power Query View Repo
10 Shopping Trends Behavioral Economics Power BI, DAX View Repo

🔬 Selected Research Highlights

🛡️ Sentinel Gold: Cyber-Physical Fusion

  • Strategic Problem: Modern security suffers from disconnected digital threat feeds and physical surveillance silos.
  • Research Solution: A fusion engine correlating high-velocity Kafka indicators with live CCTV telemetry via PySpark.
  • Outcome: Leverages GPT-2 to generate automated executive summaries, bridging the gap between technical telemetry and management-level reporting.

🧠 ELS-Pulse: Sarcasm-Aware Brand Intelligence

  • Strategic Problem: Linguistic nuances like sarcasm cause "sentiment flipping," leading to high error rates in automated brand audits.
  • Research Solution: A Voting Ensemble (87.1% accuracy) utilizing custom sarcasm-aware heuristics to identify linguistic nuances.
  • Outcome: Provides high-fidelity sentiment auditing through Neural Probability Heatmapping and TF-IDF feature significance analysis.

📊 InsuraPulse: Explainable AI for Risk

  • Strategic Problem: The "Black Box" nature of complex regression models prevents actuarial transparency and stakeholder trust.
  • Research Solution: Applied SHAP (SHapley Additive exPlanations) to isolate and interpret non-linear drivers in medical premiums.
  • Outcome: Delivers transparent, auditable models by normalizing skewed residual patterns through Log-Linear transformations.

📈 QuantPro: Quantitative Signal Processing

  • Strategic Problem: Financial research requires low-latency pipelines capable of extracting signals from high-velocity data.
  • Research Solution: A robust pipeline utilizing real-time yfinance data to monitor market volatility and price-action correlations.

🧪 LLM-Foundations: Transformer Deconstruction

  • Strategic Problem: Verifying the mathematical mechanics, convergence stability, and latent organization of modern LLM architectures.
  • Research Solution: A ground-up PyTorch implementation covering BPE tokenization, Multi-head Self-Attention, and Layer Normalization.
  • Validation: Verified through mathematical monitoring of Perplexity and Loss, ensuring foundational architectural stability.

🛠️ Technical Competencies

  • Modeling: Transformer Architecture, Matrix Factorization (SVD), Ensemble Learning, BPE Tokenization.
  • Statistics: Explainable AI (SHAP), Z-Score/Lift Analysis, Neural Probability Mapping, Feature Significance (TF-IDF).
  • Engineering: Apache Spark, Kafka, Real-time APIs (yfinance), Power Query (M/ETL), Flask.

🔗 Connect & Collaborate

Pinned Loading

  1. mittalchauhan mittalchauhan Public

  2. SENTINEL-GOLD-Multi-Modal-Cyber-Physical-Threat-Fusion-Engine SENTINEL-GOLD-Multi-Modal-Cyber-Physical-Threat-Fusion-Engine Public

    SENTINEL GOLD 🛡️: multi-modal Cyber-Physical Threat Fusion Engine

    Python

  3. ELS-PULSE-SARCASM-AWARE-SENTIMENT-TERMINAL-AND-BRAND-CASE-STUDIES ELS-PULSE-SARCASM-AWARE-SENTIMENT-TERMINAL-AND-BRAND-CASE-STUDIES Public

    Sarcasm-aware sentiment analysis system using ensemble machine learning for real-time brand monitoring, competitor intelligence, and explainable NLP diagnostics.

    Python

  4. QUANTPRO-REAL-TIME-STOCK-INTELLIGENCE-PIPELINE QUANTPRO-REAL-TIME-STOCK-INTELLIGENCE-PIPELINE Public

    An Institutional-grade Real-Time Stock Intelligence Pipeline. Features live equity tracking, Monte Carlo price projections, Risk Matrix analytics and Neural Engine drift monitoring for quantitative…

    JavaScript

  5. INSURA-PULSE-DYNAMIC-PREMIUM-REGRESSION-CASE-STUDY INSURA-PULSE-DYNAMIC-PREMIUM-REGRESSION-CASE-STUDY Public

    INSURA-PULSE: A high-performance Insurance premium regression suite featuring a multi-engine terminal and outlier diagnostics along with real-time MAE benchmarking

    Python

  6. LLM-Foundations-GPT-From-Scratch LLM-Foundations-GPT-From-Scratch Public

    Foundational GPT-style Large Language Model (LLM) pipeline

    Python