Skip to content

Debojyoti2904/vendor-performance-analysis-python-sql-powerbi

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

🧾 Vendor Performance Analysis – Retail Inventory & Sales

Analyzing vendor efficiency and profitability to support strategic purchasing and inventory decisions using SQL, Python, and Power BI.


📌 Table of Contents


Overview

This project provides a comprehensive analysis of vendor performance and retail inventory to help drive data-driven purchasing, pricing, and inventory optimization decisions.
It integrates SQL for ETL, Python for advanced analysis, and Power BI for dashboard visualization — building a complete end-to-end analytics solution.


Business Problem

Efficient management of inventory and vendor relationships is essential to maintain profitability in retail operations.
This project aims to:

  • Identify underperforming brands that may need pricing or promotional adjustments
  • Measure vendor-level contribution to total sales and profit
  • Quantify cost benefits from bulk purchasing
  • Detect inefficiencies in inventory turnover
  • Statistically evaluate differences in vendor profitability

Dataset

  • Multiple CSV files stored in /data/ directory (sales, vendors, inventory)
  • A summary table is generated by merging and aggregating data for vendor-level analysis

Tools & Technologies

  • SQL → Data extraction, cleaning, and transformation using CTEs, Joins, and Filters
  • Python → Analytical computation and hypothesis testing (Pandas, Matplotlib, Seaborn, SciPy)
  • Power BI → Interactive dashboards and visual insights
  • GitHub → Version control and documentation

Project Structure

vendor-performance-analysis/
│
├── README.md
├── .gitignore
├── requirements.txt
├── Vendor Performance Report.pdf
│
├── notebooks/                  # Jupyter notebooks
│   ├── exploratory_data_analysis.ipynb
│   ├── vendor_performance_analysis.ipynb
│   ├── vendor.ipynb
│
├── scripts/                    # Python scripts for ingestion and processing
│   ├── ingestion_db.py
│   └── get_vendor_summary.py
│
├── dashboard/                  # Power BI dashboard file
│   └── vendor_performance_dashboard.pbix

Data Cleaning & Preparation

Data preprocessing steps included:

  • Filtering invalid records:
    • Removed rows where Gross Profit ≤ 0
    • Removed rows where Profit Margin ≤ 0
    • Excluded transactions with Sales Quantity = 0
  • Data consistency:
    • Standardized data types (dates, floats, and categories)
    • Handled outliers and missing values
  • Aggregation:
    • Created vendor-level summary tables with metrics such as average profit, total sales, and margin

Exploratory Data Analysis (EDA)

Negative or Zero Values Detected:

  • Gross Profit: Min -52,002.78 (loss-making sales)
  • Profit Margin: Min -∞ (sales at or below cost)
  • Unsold Inventory detected → indicates slow-moving or obsolete stock

Outliers Identified:

  • Freight Costs as high as 257K
  • Unusually large purchase and sales prices

Correlation Insights:

  • Weak correlation between Purchase Price & Profit
  • Strong correlation between Purchase Quantity & Sales Quantity (0.999)
  • Negative correlation between Profit Margin & Sales Price (-0.179)

Research Questions & Key Findings

  1. Brands for Promotions: 198 brands with low sales but high profit margins
  2. Top Vendors: Top 10 vendors account for 65.69% of purchases → over-reliance risk
  3. Bulk Purchasing Impact: Achieved 72% cost savings per unit in bulk orders
  4. Inventory Turnover: $2.71M worth of unsold inventory detected
  5. Vendor Profitability Comparison:
    • High-performing vendors → Mean Margin = 31.17%
    • Low-performing vendors → Mean Margin = 41.55%
  6. Hypothesis Testing Results: Statistically significant difference in profit margins → confirms distinct vendor strategies

Dashboard

Power BI Dashboard Highlights:

  • Vendor-wise Sales & Profitability Overview
  • Inventory Turnover and Stock Insights
  • Bulk Purchase Cost Savings Visualization
  • Performance Heatmaps by Brand and Vendor

Vendor Performance Dashboard


How to Run This Project

  1. Clone the repository:
    git clone https://github.com/yourusername/vendor-performance-analysis.git
    
  2. Load and ingest CSV data into database:
python scripts/ingestion_db.py
  1. Create vendor summary table:
python scripts/get_vendor_summary.py
  1. Open and run notebooks:
    • notebooks/exploratory_data_analysis.ipynb
    • notebooks/vendor_performance_analysis.ipynb
  2. Open Power BI Dashboard:
    • dashboard/vendor_performance_dashboard.pbix

Final Recommendations

  • Diversify vendor base to reduce risk
  • Optimize bulk order strategies
  • Reprice slow-moving, high-margin brands
  • Clear unsold inventory strategically
  • Improve marketing for underperforming vendors

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors