scraped_data_raw\: This folder is where the script to scrape restaurants from addresses outputs to.11-19-2021 730pm ET\: Files with these naming patterns contain scrapes at points in time copied over from thescraped_data_rawfolder.- The csv files with the longer names in them have the timestamp of when the data was written, and the rows (corresponding to
OAK_Berk_geocodio_filtered.csv) that are in that file. df_merged.csvis these files appended to each other, and joined withOAK_Berk_geocodio_filtered.csv. These files are generated by runningprocess_raw_data.ipynb.
- The csv files with the longer names in them have the timestamp of when the data was written, and the rows (corresponding to
alemada_ca_gb_geocodio.csvOAK_Berk_geocodio.csv: The addresses to check, with lattitude, longitude, and Geoid.OAK_Berk_geocodio_filtered.csv: A shortened version ofOAK_Berk_geocodio.csvwith some rows removed.ACS\: This folder contains data from the 2019 Census.
posty2_redo.py: The script that uses selenium to take an address, and return delivery time, name, and delivery fee for a restaurant.distance2.py: Given the link to a restaurant on postmates and the lattitude and longitude of an address, find the distance between those two (will be used to normalize delivery time).distance.py: Obsolete.distance_scrape.pyposty2_redo.pyand join withOAK_Berk_geocodio_filtered.csvinto a single file calleddf_merged.csv.posty2.py: Obsolete.
process_raw_data.ipynb: Used to combine multiple CSVs fromFilter_Geocodio.ipynb: Used to filterOAK_Berk_geocodio.csvinto valid addresses inOAK_Berk_geocodio_filtered.csv.cody_analysis.ipynb: Attempt at linkingjoined_wDistance.csvtoholc_grade.csv
chromedriver: Used for selenium Macchromedriver.exe: Used for selenium Windows