Skip to content

vanshdhiman090/bank_churn_analysis_with_python

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Project Overview

This project focuses on predicting customer churn in a retail bank using machine learning techniques. Churn — when customers leave the bank — directly impacts profitability. By analyzing customer demographics, account information, and behavioral patterns, this project aims to identify the most influential features contributing to churn and build predictive models to assist in retention strategies.

Dataset – The analysis is based on a dataset of 10,000 customers with features such as:

Age
Gender
Balance
Credit Score
Tenure
Number of Products
Geography
Estimated Salary
Churn Label (Exited/Stayed)

Tools & Libraries Used

Python (Jupyter Notebook)
Pandas, NumPy – Data manipulation & preprocessing
Matplotlib, Seaborn – Visualization
Scikit-learn – Machine learning models & evaluation
Imbalanced-learn – Handling class imbalance

Workflow

Data Exploration & Cleaning
Checked missing values, duplicates, and outliers
Handled categorical encoding and scaling
Exploratory Data Analysis (EDA)
Visualized churn distribution and key customer features
Identified relationships between churn and variables (e.g., age, balance, geography)
Feature Engineering
Encoded categorical features
Normalized/standardized numerical features

Modeling

Trained multiple ML models (Logistic Regression, Random Forest, XGBoost, etc.)
Evaluated with accuracy, precision, recall, F1-score, and ROC-AUC
Insights & Recommendations
Highlighted most important features driving churn
Suggested strategies for customer retention

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors