GitHub

CREDIT CARD DEFAULT PREDICTION AND BEHAVIOUR SCORING USING MACHINE LEARNING

Overview: This project focuses on developing a high-performance machine learning system to identify high-risk customers by analyzing financial, transactional, and credit-bureau data. It includes thorough data exploration, advanced missing-value imputation, engineered feature creation, imbalance handling, and the evaluation of multiple models—ultimately delivering an optimized ensemble classifier capable of reliably detecting potential defaults or non-compliance cases.

Key Steps:

Data Understanding & EDA

Explored 140K+ records across on-us, transaction, and bureau attributes.
Identified missing values, outliers, and key correlated behavioral features.

Missing Value Imputation

Compared MICE and MCMC.
MCMC chosen for better multivariate consistency.

Feature Engineering

AutoFE-based transformations, PCA, and interaction features.
Feature ranking via ANOVA F-test + RFE.
Final: 282 optimized features.

Class Imbalance Handling

Tested SMOTE, ADASYN, Tomek Links, NearMiss, and no sampling.
Best approach: No sampling + Logit Shift, threshold tuned using Youden’s J.

Modeling

Models trained: XGBoost, CatBoost, LightGBM, Logistic Regression, Random Forest, SVM, Decision Trees.
Best model: Voting Classifier
Performance: a.) ROC-AUC: 0.7832 b.) Recall: 0.23

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
Convolve-Report.pdf		Convolve-Report.pdf
README.md		README.md
notebook.ipynb		notebook.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

NewsSentiment12/bruh

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages