Skip to content

AnkushOO7/loan-default-prediction

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

11 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

🏦 Loan Default Prediction

A machine learning project to predict whether a client will default on a loan based on their financial and demographic data. Built for a hackathon using real-world NBFC (Non-Banking Financial Company) data.


πŸ” Problem Statement

The goal is to develop a classification model that predicts the probability of a customer defaulting on a loan. This is a critical task for financial institutions to reduce risk and improve credit decision-making.


πŸ“Š Dataset Overview

The dataset includes features such as:

  • Customer demographics (e.g., age, gender, education)
  • Loan characteristics (e.g., loan amount, tenure, interest rate)
  • Financial indicators (e.g., income, credit history)

⚠️ Dataset not included due to policy. If you are a recruiter or reviewer, please contact me or use synthetic data for testing.


🧠 Project Workflow

  1. Data Preprocessing

    • Handling missing values
    • Label encoding / one-hot encoding
    • Scaling numerical features
  2. Exploratory Data Analysis (EDA)

    • Distribution analysis
    • Correlation heatmaps
    • Class balance check
  3. Model Building

    • XGBoost Classifier with tuning
    • 5-Fold Stratified Cross-Validation
  4. Model Evaluation

    • Accuracy, Precision, Recall, F1-Score
    • ROC-AUC Score
    • Confusion Matrix

πŸ“ˆ Model Performance

Metric Score
Cross-Validated Accuracy XX%
ROC-AUC Score XX%
Precision / Recall / F1 XX% / XX% / XX%

Final model: XGBClassifier(n_estimators=200, learning_rate=0.05, max_depth=4)


πŸ“¦ Tech Stack

  • Python 3.x
  • Pandas, NumPy
  • Scikit-learn
  • XGBoost
  • Matplotlib / Seaborn

πŸš€ Future Improvements

  • Address class imbalance (SMOTE, ADASYN, or scale_pos_weight)
  • Hyperparameter tuning (GridSearch / Optuna)
  • Feature importance analysis
  • Streamlit dashboard for predictions

🧾 License

This project is open-source under the MIT License.


πŸ™‹β€β™‚οΈ Author

Ankush Waghmare πŸ“§ ankush.waghmare3804@gmail.com

About

The goal of the problem is to predict whether a client will default on the loan payment or not.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors