This project is the final capstone for the IBM Data Analyst Certificate. It includes exploratory data analysis, data cleaning, and visualization using Python.
dataset/– Raw and cleaned data files.notebooks/– Jupyter Notebooks for analysis.reports/– Final reports summarizing findings.presentation/– PowerPoint presentation.images/– Screenshots of key insights.
- 📈 Data cleaning process removed 15% of missing values.
- 🔥 Correlation found between variables X and Y.
- 🌎 Geographical analysis revealed regional trends.
- Python (Pandas, Matplotlib, Seaborn)
- Jupyter Notebooks
- SQL (if applicable)
- PowerPoint
📥 How to Use
- Clone the repo:
git clone https://github.com/anthonydenis01/ibm-data-analyst-capstone.git - Open the
notebooks/folder in Jupyter Notebook. - Run
exploratory_analysis.ipynbto explore the data.
License: MIT License