This project was developed as part of the Snestron Data Analytics Internship. It focuses on analyzing a synthetic sales dataset using Python for data cleaning and exploratory analysis, and Power BI for interactive visualization. The aim is to simulate a real-world business scenario and deliver actionable insights through data.
sales-data-analysis/
βββ data/
β βββ synthetic_sales_data.csv # Raw dataset
β βββ clean_sales_data.csv # Cleaned dataset for Power BI
βββ notebooks/
β βββ sales_analysis.ipynb # Python analysis & visualizations
βββ powerbi/
β βββ sales_dashboard.pbix # Power BI dashboard file
βββ screenshots/
β βββ dashboard_overview.png # Screenshots of Power BI dashboard
βββ README.md # Project documentation
- Python (Pandas, Matplotlib, Seaborn)
- Jupyter Notebook
- Power BI Desktop
- GitHub (for project submission)
From the analysis:
- π Monthly sales trends show clear patterns and seasonality.
- π Electronics category contributes the highest revenue.
- π Region-wise, North and West performed better than South and East.
- π‘ Top products and customers generate a significant share of total sales.
Interactive dashboard built in Power BI with:
- KPI cards (Total Sales, Total Quantity, Avg Order Value)
- Sales trend over time
- Category-wise and region-wise breakdown
- Top 10 products and customers
-
Open
notebooks/sales_analysis.ipynb -
Install dependencies:
pip install pandas matplotlib seaborn
-
Run the notebook to reproduce EDA and plots.
- Open
powerbi/sales_dashboard.pbixin Power BI Desktop. - Interact with filters and slicers for insights.
This repository contains the final deliverable for the internship task. It demonstrates the complete data analytics lifecycle: Data β Cleaning β Analysis β Visualization β Insights β Reporting.
