Product Data Science | Retention Analytics | Machine Learning | Experimentation Strategy | Revenue Optimization
Customer churn directly impacts revenue, growth, and long-term product sustainability. This project approaches churn not just as a classification problem โ but as a product retention and revenue optimization challenge.
The goal is to:
- Identify high-risk churn segments
- Understand behavioral and contractual drivers
- Quantify business impact
- Recommend product-level interventions
- Enable experimentation-driven retention strategies
This mirrors how churn is handled in SaaS, FinTech, HealthTech, and Telecom product organizations.
Subscription-based products experience revenue leakage when customers cancel early.
- Key Product Questions:
- Which customers are most likely to churn?
- What signals predict churn behavior?
- When is the highest-risk churn window?
- What retention experiment should be prioritized?
- Retention Rate / Active Subscription Rate
Supporting Metrics:
- Monthly Recurring Revenue (MRR)
- Customer Churn Rate
- Customer Lifetime Value (CLV)
- Average Revenue Per User (ARPU)
- Tenure Distribution
- Contract Conversion Rate
Assume:
- 10,000 active customers
- 26% churn rate
- $70 average monthly revenue
- A 5% reduction in churn leads to significant annual revenue preservation.
This project provides predictive modeling to enable proactive intervention before churn occurs.
Key findings from EDA:
- Month-to-month contracts have the highest churn probability
- Short-tenure customers churn within the early lifecycle stage
- Higher monthly charges correlate with increased churn
- Electronic check payment method shows elevated churn behavior
These insights inform targeted retention strategies.
Models Evaluated:
- Logistic Regression
- Random Forest
- Decision Tree
- Gradient Boosting (if applicable)
Evaluation Metrics:
- Accuracy
- Precision
- Recall
- F1 Score
- ROC-AUC
The selected model achieved strong ROC-AUC performance and improved churn risk identification.
Top predictors of churn:
- Contract Type
- Tenure
- Monthly Charges
- Internet Service
- Payment Method
Interpretation:
Churn behavior is influenced primarily by lifecycle stage and contract structure rather than demographics โ indicating product and pricing optimization opportunities.
- Instead of stopping at prediction, this project proposes actionable experiments:
1๏ธโฃ Contract Conversion Incentive
- Offer discounted annual plans to month-to-month customers.
Hypothesis:
- Customers transitioning to long-term contracts will reduce churn probability.
2๏ธโฃ Early Lifecycle Engagement Experiment
- Target customers within the first 90 days with onboarding nudges.
Hypothesis:
- Improved onboarding engagement increases long-term retention.
3๏ธโฃ High-Value Customer Retention Offer
- Provide personalized loyalty offers to high ARPU customers flagged as high churn risk.
Hypothesis:
- Targeted retention incentives preserve revenue efficiently.
All experiments can be validated using A/B testing frameworks.
- Data Cleaning & Preprocessing
- Exploratory Data Analysis (EDA) with business interpretation
- Feature Engineering (contract type, tenure segmentation, billing patterns)
- Encoding & Scaling
- Train-Test Split
- Model Training (Logistic Regression, Random Forest, etc.)
- Model Evaluation (ROC-AUC, Precision, Recall, F1)
- Feature Importance Analysis
- Product-Level Interpretation & Strategy Recommendation
- Python
- Pandas
- NumPy
- Scikit-learn
- Matplotlib
- Seaborn
- Jupyter Notebook
- Streamlit (optional deployment)
Customer-Churn/
โ
โโโ data/
โโโ notebooks/
โโโ models/
โโโ app.py
โโโ requirements.txt
โโโ README.md
- If implemented in production, this system enables:
- Proactive churn detection
- Targeted retention campaigns
- Revenue preservation
- Experiment-driven product decisions
- Lifecycle-based segmentation
Even a 3โ5% reduction in churn materially increases annual recurring revenue and customer lifetime value.
- SHAP-based model explainability
- Survival analysis for churn timing
- Uplift modeling for retention targeting
- Real-time churn scoring API
- Automated A/B testing simulator integration
- Deployment with FastAPI
- Product Data Science
- Customer Retention Analytics
- Churn Modeling
- Predictive Modeling
- Feature Engineering
- Machine Learning
- Experiment Design
- Revenue Optimization
- Business Analytics
- Python
Clone the repository:
- git clone https://github.com/Denis0242/Customer-Churn.git
- cd Customer-Churn
Install dependencies:
- pip install -r requirements.txt
Run the notebook:
- jupyter notebook
If using Streamlit (optional):
- streamlit run app.py
Denis Agyapong
Product Data Scientist | Data Analyst
Oakland, CA