Skip to content

Denis0242/Telecom_Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

47 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Telecom_Analysis

🖼️ Dashboard Preview

Telecom Analytics Dashboard

📊 Telecom Customer Analytics Dashboard

Python Streamlit Plotly Scikit-Learn Pandas

A Product Data Science–oriented telecom analytics project that transforms telecom usage data into actionable insights across customer overview, engagement, experience, and satisfaction workflows.


🚀 Project Summary

This project analyzes telecom customer behavior using Python, exploratory data analysis, feature engineering, clustering, and an interactive Streamlit dashboard.

It is designed to communicate the kind of workflow a Product Data Scientist would use to:

  • understand user behavior - profile customer engagement

  • explore experience-related metrics - segment users into meaningful groups

  • present insights through a clean interactive dashboard

  • The repository includes:

  • a Streamlit app for dashboard delivery

  • data files for cleaned and intermediate analysis outputs

  • notebooks for analysis work

  • Python scripts for cleaning and helper utilities


🎯 Product Data Science Angle

This project is positioned as a product analytics + behavioral segmentation case study in the telecom domain. Key product questions this project helps answer:

  • Which users are most active and valuable?
  • How does usage differ across apps like YouTube, Google, Netflix, Email, Gaming, and Social Media?
  • Which users show high engagement versus low engagement?
  • How can clustering help identify meaningful behavioral segments?
  • How can telecom experience and satisfaction analysis be extended into retention and churn use cases?

🧩 Core Analysis Modules

1. User Overview Analysis

  • dataset quality checks
  • missing value analysis
  • handset manufacturer and handset type exploration
  • user/session distribution
  • total usage patterns across application categories

2. User Engagement Analysis

  • sessions per user
  • total duration per user
  • total traffic per user
  • top users by usage
  • outlier detection
  • KMeans clustering for engagement segmentation
  • application-level usage comparison

3. Experience Analysis

  • service quality indicators
  • latency and reliability metrics
  • friction analysis
  • user experience scoring

4. Satisfaction Analysis

  • NPS-style metrics
  • churn signals
  • retention storytelling
  • satisfaction driver analysis

🏗️ Project Structure

Telecom_Analysis/
├── Data/
├── Notebooks/
├── scripts/
├── app.py
├── preview.png
├── requirements.txt
└── README.md

🛠️ Tech Stack

Core:

  • Python
  • Pandas, NumPy
  • Scikit-learn

Visualization:

  • Plotly
  • Matplotlib
  • Streamlit

Workflow:

  • Jupyter Notebooks
  • GitHub
  • Virtual Environments (uv / pip)

⚙️ Quick Start

Clone the repository

Create and activate a virtual environment

  • uv venv --python 3.11

Windows

  • .venv\Scripts\activate

Mac/Linux

  • source .venv/bin/activate

Install dependencies

  • pip install -r requirements.txt

Run the dashboard

  • streamlit run app.py

📊 Dashboard Highlights

  • loading the default dataset from Data/cleaned_data.csv
  • optional upload of CSV or Excel telecom datasets
  • dataset overview metrics
  • missing value inspection
  • handset analysis
  • user behavior analysis
  • application usage analysis
  • engagement distributions
  • outlier detection
  • engagement clustering using KMeans

📊 Multi-page Dashboard Flow

  • User Overview Analysis
  • User Engagement Analysis
  • Experience Analysis
  • Satisfaction Analysis

📁 Available Data Assets

  • cleaned_data.csv — main cleaned dataset used by the dashboard
  • data.csv — original telecom dataset
  • user_engagement.csv — engagement-focused data
  • user_experience_metrics.csv — experience-related features
  • Week1_challenge_data_source.xlsx — source dataset

This shows a full pipeline: raw data → cleaned data → analysis → dashboard.


🧠 Modeling and Analytics Approach

  • Exploratory Data Analysis
  • profiling user behavior
  • identifying missing data
  • comparing usage across app categories
  • Feature Aggregation
  • session counts
  • total duration
  • total traffic
  • app-level usage
  • Behavioral Segmentation
  • scaling and normalization
  • KMeans clustering
  • elbow method
  • Dashboard Communication
  • business-readable metrics
  • visual storytelling
  • interactive exploration

💼 Why This Project Matters

  • translates raw telecom data into business insights
  • applies product thinking to analytics
  • builds interactive dashboards
  • combines EDA + ML + storytelling
  • demonstrates end-to-end workflow

Relevant Roles:

  • Product Data Scientist
  • Product Analyst
  • Customer / Growth Analyst
  • Behavioral Analytics

🔭 Suggested Future Enhancements

  • churn prediction
  • retention cohorts
  • feature importance (SHAP)
  • KPI layer (AARRR, North Star)
  • A/B testing
  • experience analytics
  • Streamlit Cloud deployment

🤝 Author

Denis Agyapong

Product Data Science / Data Analyst

About

Telecom customer analytics project exploring churn risk, usage behavior, and revenue patterns using Python, EDA, and machine learning to generate actionable business insights.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors