Scoring Banking Leads

Miguel Chacón

Content

Project Description
Hypotheses / Questions
Dataset
Analysis
Cleaning
Model Training and Evaluation
Deliverables
Links

Project Description

In this project, I use Machine Learning with Python to create a lead scoring model, leading a 50% increase in conversion rate. It is meant to show my understanding of:

Machine Learning concepts, pipelines and model building & tuning.
Digital Marketing and conversion rate optimization.
Python, Pandas, and Scikit-learn.
Modeling and forecasting of business processes.

Business Question

The client, a commercial bank, wants to improve its sales without hiring more sales representatives. I use Business Analytics frameworks to approach the question and deliver a solution.

The lead scoring model I delivered reduces the proportion of leads that are contacted while simultaneously increasing the conversion rate. All else equal, this will allow further lead-generation campaigns to increase their lead volume, leading to an increase in revenue and, thus, profitability.

Dataset

The dataset consists of 45,000 instances of phone calls made by sales representatives during a sales campaign.

Analysis

Overview the general steps you went through to analyze your data in order to test your hypothesis.
Document each step of your data exploration and analysis.
Include charts to demonstrate the effect of your work.
If you used Machine Learning in your final project, describe your feature selection process.

Cleaning

Dimensionality reduction achieved by merging categories.
Columns with multicollinearity were dropped.
Categorical columns were encoded.
The target variable was changed to a numerical type.

Model Training and Evaluation

Tested 46 combinations of 8 algorithms and 6 scalers with default settings to find the combination with the highest recall score.
Selected a Random Forest Classifier and tested 100 parameter combinations with a 3-fold cross-validation for each, optimizing for Recall. The resulting model had 65% higher score than the model with default settings.
To evaluate the model, I took a random sample of 15 leads and made educated guesses on whether they would convert, comparing my results against the model. The model's predictions led to a 50% higher conversion rate.

Deliverables

A Machine Learning model that allows new leads to be assigned a score between 1 and 10 based on their likelihood to convert to sales, allowing the sales team to prioritize these for calls.
A PPT presentation detailing the business question, the model as a solution, and the results.

Links

Repository
Presentation

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
Data		Data
Notebook.ipynb		Notebook.ipynb
Presentation.pptx		Presentation.pptx
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Scoring Banking Leads

Content

Project Description

Business Question

Dataset

Analysis

Cleaning

Model Training and Evaluation

Deliverables

Links

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Scoring Banking Leads

Content

Project Description

Business Question

Dataset

Analysis

Cleaning

Model Training and Evaluation

Deliverables

Links

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages