ETL that extracts data from various sources to load car accident into a
PostgreSQL database using a star schema design.
Packages used include:
- pandas
- psycopg2
- seaborn
- plotly
- matplotlib
US-Car-Accident-ETL-Analysis
| README.md
| uml.png
| tableau_dashboard.png
| notebook_viz.png
|
└─── data # state ranking and usholidays data only
# car accident data found here
| | └─── state_road_rankings.csv
| | └─── usholidays.csv
|
└─── db_modules
| | └─── db_connections.py
| | └─── sql_queries.py # contains drop, insert, create statements
|
| create_database.py # creates the accidentAnalysis database tables
| etl_helper.ipynb # preliminary code to set up ETL
| etl_processor.py # loads accidents, holidays, time, and demographics data


