Skip to content

fragan7dsouza/customer-segmentation-ml

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

5 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

🧠 Customer Segmentation using Machine Learning (K-Means Clustering)

This project focuses on segmenting customers based on their purchasing behavior using the K-Means clustering algorithm. The dataset used is a real-world online retail dataset, and the project walks through data cleaning, feature engineering, clustering, and visualization.

πŸš€ What This Project Does

  • Cleans and preprocesses transactional data
  • Aggregates customer-level features: Frequency, Quantity, Total Spend
  • Applies standard scaling to normalize data
  • Uses the Elbow Method to find the optimal number of clusters
  • Performs K-Means clustering to group customers
  • Visualizes the customer segments using PCA (Principal Component Analysis)

πŸ“Š Tech Stack

  • Python
  • Pandas
  • Scikit-learn
  • Matplotlib
  • Seaborn
  • Google Colab / Jupyter Notebook

πŸ“‚ Files

  • Customer_Segmentation.ipynb: The main notebook containing the complete pipeline
  • Online Retail.xlsx: The dataset used is included as Online Retail.xlsx
  • README.md: You're reading it 😊

πŸ“Έ Sample Outputs

  • Elbow Curve to choose k
  • PCA scatterplot showing customer clusters

🧠 What I Learned

  • Data cleaning and preprocessing
  • Unsupervised machine learning with K-Means
  • Dimensionality reduction using PCA
  • How to interpret and present clustering results

πŸ“¬ Contact

Built during my AIML internship at Dlithe.
Feel free to reach out or connect on LinkedIn (www.linkedin.com/in/fragan-d-souza-64626a29b)

About

Segmented customers using K-Means clustering on retail transaction data. Engineered features like frequency, quantity, and spend per customer. Visualized segments using PCA for business insights. Built during my AIML internship using Python, Pandas, and Scikit-learn.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors