Analyzed a large dataset using Pandas for data cleaning and null value removal. Developed a dimensional model and pipeline for processing, with visualizations created in Matplotlib, Seaborn, and a dashboard in Power BI. Tools: Pandas, NumPy, Matplotlib, Seaborn, Power BI.
Python,matplotlib,numpy,pandas, Jupyter Notebooks,SQL,POWER_BI.
Download raw eCommerce data from Kaggle, ensuring it contains customer details, sales transactions, and product performance information.
Load the raw data into a SQL database to organize and manage it efficiently.
Use Jupyter Notebooks to query the SQL database for data analysis. Tools like SQLAlchemy or pymysql can be used to connect SQL with Python.
Clean and preprocess the data using Pandas,Handle missing values and outliers,Normalize and transform data into analysis-ready formats,Add new features.
Analyze the data in Jupyter Notebooks using,Matplotlib and Seaborn for visualizing sales trends, customer segmentation, and product performance. Statistical summaries to uncover patterns.
Summarize insights and create visual reports using Power BI:
https://www.kaggle.com/datasets/olistbr/brazilian-ecommerce/
1.Importing Datasets
2.Cleaning the Data
3.Data frame manipulation
4.Summarizing the Data
5.Building data pipeline




