GitHub - Akhil-krishnan-r/eCommerce-Transactions-Dataset-Assignment: This project focuses on analyzing customer and transaction data for better segmentation and targeted marketing. Key Features: EDA: Analysis of customer behavior and product trends. Lookalike Model: Recommends top 3 similar customers based on their profiles and transaction history, with similarity scores. Clustering: Segments customers using K-mean

Files Description:

Customers.csv ○ CustomerID: Unique identifier for each customer. ○ CustomerName: Name of the customer. ○ Region: Continent where the customer resides. ○ SignupDate: Date when the customer signed up.
Products.csv

○ ProductID: Unique identifier for each product. ○ ProductName: Name of the product. ○ Category: Product category. ○ Price: Product price in USD.

Transactions.csv

○ TransactionID: Unique identifier for each transaction. ○ CustomerID: ID of the customer who made the transaction. ○ ProductID: ID of the product sold. ○ TransactionDate: Date of the transaction. ○ Quantity: Quantity of the product purchased. ○ TotalValue: Total value of the transaction. ○ Price: Price of the product sold.

Assignment Tasks:

Task 1: Exploratory Data Analysis (EDA) and Business Insights

Perform EDA on the provided dataset.
Derive at least 5 business insights from the EDA. ○ Write these insights in short point-wise sentences (maximum 100 words per insight). Deliverables: ● A Jupyter Notebook/Python script containing your EDA code. ● A PDF report with business insights (maximum 500 words).

Task 2: Lookalike Model

Build a Lookalike Model that takes a user's information as input and recommends 3 similar customers based on their profile and transaction history. The model should: ● Use both customer and product information. ● Assign a similarity score to each recommended customer. Deliverables: ● Give the top 3 lookalikes with there similarity scores for the first 20 customers (CustomerID: C0001 - C0020) in Customers.csv. Form an “Lookalike.csv” which has just one map: Map<cust_id, List<cust_id, score>> ● A Jupyter Notebook/Python script explaining your model development. Evaluation Criteria: ● Model accuracy and logic. ● Quality of recommendations and similarity scores.

Task 3: Customer Segmentation / Clustering

Perform customer segmentation using clustering techniques. Use both profile information (from Customers.csv) and transaction information (from Transactions.csv). ● You have the flexibility to choose any clustering algorithm and any number of clusters in between(2 and 10) ● Calculate clustering metrics, including the DB Index(Evaluation will be done on this). ● Visualise your clusters using relevant plots.

Deliverables: ● A report on your clustering results, including: ○ The number of clusters formed. ○ DB Index value. ○ Other relevant clustering metrics. ● A Jupyter Notebook/Python script containing your clustering code. Evaluation Criteria: ● Clustering logic and metrics. ● Visual representation of clusters.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
Akhil_krishnan_Clustering.ipynb		Akhil_krishnan_Clustering.ipynb
Akhil_krishnan_Clustering.pdf.pdf		Akhil_krishnan_Clustering.pdf.pdf
Akhil_krishnan_EDA.ipynb		Akhil_krishnan_EDA.ipynb
Akhil_krishnan_EDA.pdf.pdf		Akhil_krishnan_EDA.pdf.pdf
Akhil_krishnan_Lookalike.csv.csv		Akhil_krishnan_Lookalike.csv.csv
Akhil_krishnan_Lookalike.ipynb		Akhil_krishnan_Lookalike.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages