The purpose of this repo is to store all documents associated with completion of the Descriptive Analytic Tasks section for the City of Cape Town - Data Science Unit Code Challenge.
As requested the final deliverable artifact is a report (in PDF form) for the Executive Management team of the City: Urban Waste Management Report.pdf.
Supporting documentation includes:
- The main Jupyter Notebook used to investigate the data and perform analyses and visualizations is urban_waste_management_analysis.ipynb
- The analysis was conducted on the file sr_hex_truncated.csv supplied by the city. This is the structure of the dataset for reference:
- A separate Python file permutations_test.py is supplied which is used to keep the permutations testing code separate. It should be placed in the same folder as urban_waste_management_analysis.ipynb as it is imported during the running of the code.
- A requirements.txt file if required for setting up the environment (note that Python 3.12.9 was used and the only libraries required are numpy, pandas, and matplotlib so the code should really run in most environments!).
Note that Page 1 of the mock dashboard shown in the PDF was constructed using GCP's Looker Studio so only a screenshot is shared here, but the underlying data was direct from sr_hex_truncated.csv with some basic adjustments made to handle date formatting requirements.
Acknowledgement: Github Copilot was used for syntactic assistance and troubleshooting, otherwise as they used to say in school "This is all my own work!".
