The code and data behind this article: http://thedataface.com/2018/06/culture/comedy-writing-staffs
Data comes from https://www.imdb.com/interfaces/ and from web scraping individual IMDb episode ratings pages
allWritersAndGenders.csv -- contains each unique writer in the analysis and their assigned gender
top100AllDataAttempt3.csv -- data set used for analysis. Contains 12,237 observations of episode data
IMDb-webscraping-merging-munging.ipynb -- merging and subsetting downloaded IMDb datasets into a data frame of information about all episodes released by the top 100 comedies. Finished result includes a column which holds a list of the names of each writer on the observation's writing staff.
writer-gender-comp-calculate.ipynb -- use Gender API to predict gender of each unique writer, then calculate gender composition measures for each writing staff.
IMDb-Comedies+Gender-Analysis.ipynb -- Preliminary analysis and visualization of final data set