Skip to content

rashidraihan/Hybrid-Explainable-Framework-for-Stroke-Prediction

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

3 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

Hybrid Explainable Framework for Stroke Prediction

Hybrid Explainable Framework for Stroke Prediction is an advanced AI-powered system for early stroke prediction that combines machine learning, deep learning, and explainable AI techniques. This comprehensive framework enhances predictive accuracy while ensuring clinical interpretability for healthcare applications.

⚠️ Notice: This project is currently under publication. Only the web-based interface and essential components are included for demonstration purposes. Full model architecture, dataset preprocessing scripts, and training configurations will be released post-publication.

πŸ“Š Data Analysis & Exploratory Data Analysis

Class Distribution Analysis

target_distribution

The dataset exhibits significant class imbalance with only 4.9% stroke-positive cases, necessitating advanced resampling techniques for robust model training.

Univariate Distributions by Stroke Status

age_distribution_by_stroke

Age Analysis: Stroke-positive individuals show distinct age distribution patterns, with higher risk observed in older demographic groups.

bmi_distribution_by_stroke

BMI Analysis: Body Mass Index distributions reveal subtle differences between stroke and non-stroke cases, informing feature engineering strategies.

avg_glucose_level_distribution_by_stroke

Glucose Level Analysis: Average glucose levels demonstrate significant variations between stroke-positive and negative cases, highlighting its importance as a clinical predictor.

πŸ“ˆ Model Performance Evaluation

Comprehensive Model Comparison

metrics_grid

Multi-faceted evaluation across 12 machine learning algorithms demonstrating performance trade-offs across accuracy, F1-score, precision, recall, ROC-AUC, and PR-AUC metrics.

ROC Curve Analysis

image

Receiver Operating Characteristic curves showing strong discriminative power across all evaluated models, with consistent performance across different classification thresholds and excellent area under curve values.

πŸ”¬ Explainable AI (XAI)

Integrated Feature Importance Analysis

shap_lime_tabnet_fn_approx_final_labeled_quantile

Combined SHAP and LIME analysis revealing key clinical features contributing to stroke prediction, with age and glucose levels emerging as dominant risk factors in model decision-making.

πŸ—οΈ Technical Architecture

End-to-End Prediction Pipeline

Our framework implements a comprehensive workflow:

  1. Data Preprocessing: Handling class imbalance, missing values, and feature normalization
  2. Feature Engineering: Clinical feature transformation and selection
  3. Hybrid Modeling: Machine learning and deep learning ensemble approaches
  4. Model Interpretation: Explainable AI techniques for clinical transparency
  5. Performance Validation: Comprehensive evaluation across multiple metrics

πŸ”§ Key Features

  • πŸ€– Hybrid AI Approach: Combines traditional machine learning and modern deep learning models
  • πŸ” Explainable Predictions: Transparent feature importance for clinical trust
  • πŸ”„ Robust Preprocessing: Advanced handling of class imbalance and data quality
  • πŸ“Š Comprehensive Evaluation: Multi-metric assessment across diverse algorithms
  • πŸ₯ Clinical Relevance: Domain-informed feature selection and interpretation
  • ⚑ Scalable Architecture: Modular design for healthcare integration

πŸ› οΈ Technical Stack

  • Programming: Python 3.8+
  • Machine Learning: Scikit-learn, XGBoost, LightGBM
  • Deep Learning: PyTorch-based architectures
  • Explainable AI: SHAP, LIME for model interpretability
  • Visualization: Matplotlib, Seaborn, Plotly
  • Data Processing: Pandas, NumPy for efficient data manipulation
  • Model Optimization: Advanced hyperparameter tuning techniques

πŸ“ Repository Structure

Hybrid-Stroke-Prediction/
β”œβ”€β”€ src/
β”‚   β”œβ”€β”€ data_preprocessing/       # Data cleaning and preparation pipelines
β”‚   β”œβ”€β”€ feature_engineering/      # Feature selection and transformation
β”‚   β”œβ”€β”€ models & evaluation/      # Model implementations & performance assessment
β”œβ”€β”€ config/                       # Model and experiment configurations
β”œβ”€β”€ tests/                        # Comprehensive test suites
β”œβ”€β”€ requirements.txt              # Project dependencies
└── README.md                     # Project documentation

🎯 Research Contributions

1. Clinical AI Integration

Bridges the gap between high-performance AI models and clinical practicality through interpretable and actionable predictions.

2. Advanced Modeling Strategy

Implementation of both traditional machine learning algorithms and modern deep learning architectures for comprehensive predictive performance.

3. Explainable Healthcare AI

Transparent model reasoning enabling clinical validation and trust in AI-assisted decision making.

4. Robust Evaluation Framework

Multi-dimensional assessment across accuracy, sensitivity, specificity, and clinical relevance metrics.

πŸ“Š Performance Highlights

  • Strong Predictive Performance: Comprehensive model evaluation demonstrating reliable stroke prediction capabilities across multiple algorithms
  • Clinical Interpretability: Transparent feature importance analysis aligning with medical domain knowledge
  • Robust Generalization: Consistent performance across different validation strategies and data splits
  • Scalable Architecture: Modular design suitable for integration with healthcare systems

πŸ”¬ Methodology Overview

Our systematic approach encompasses:

  1. Comprehensive Data Analysis: In-depth exploratory data analysis to understand feature distributions and relationships
  2. Advanced Feature Engineering: Domain-informed transformations and selection techniques
  3. Diverse Model Development: Implementation of multiple machine learning and deep learning approaches
  4. Rigorous Evaluation: Multi-faceted assessment including performance metrics and model interpretability
  5. Clinical Validation: Framework designed for healthcare professional review and practical application

Β© 2025 Raihan Rashid. All rights reserved.

About

AI-powered system for early stroke prediction that combines machine learning, deep learning, and explainable AI techniques. This comprehensive framework enhances predictive accuracy while ensuring clinical interpretability and fairness in healthcare applications.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors