-
Notifications
You must be signed in to change notification settings - Fork 14
Exam project
For getting some inspiration we recommend that your read a few recent research articles that have used machine learning for conducting social science. Below is a selection of topics and associated articles.
Blumenstock, J., Cadamuro, G., and On, R. (2015). Predicting poverty and wealth from mobile phone metadata. Science, 350(6264):1073--1076
Stephens-Davidowitz, S. (2013b). Unreported Victims of an Economic Downturn
Glaeser, E. L., Kominers, S. D., Luca, M., and Naik, N. (2015). Big Data and Big Cities: The Promises and Limitations of Improved Measures of Urban Life. Working Paper 21778, National Bureau of Economic Research
Gebru, T., Krause, J., Wang, Y., Chen, D., Deng, J., Aiden, E. L., and Fei-Fei, L. (2017). Using deep learning and Google Street View to estimate the demographic makeup of neighborhoods across the United States.
Wu, A. (2017). Gender Stereotyping in Academia: Evidence from Economics Job Market Rumors Forum. SSRN Scholarly Paper ID 3051462, Social Science Research Network, Rochester, NY
Gentzkow, M., Shapiro, J. M., and Taddy, M. (2016). Measuring Polarization in High-Dimensional Data: Method and Application to Congressional Speech. Working Paper 22423, National Bureau of Economic Research
Hansen, S., McMahon, M. and Prat, A., 2017. Transparency and deliberation within the FOMC: a computational linguistics approach. The Quarterly Journal of Economics, 133(2), pp.801-870.
Mønsted, B., Sapieżyński, P., Ferrara, E. and Lehmann, S., 2017. Evidence of complex contagion of information in social media: An experiment using Twitter bots. PloS one, 12(9), p.e0184148.
The grade for this course is exclusively determined by the project handed in. The project will be judged on a number of dimensions, these include:
- how the data was obtained (setting up new data collection);
- how the tools for working with networks, geo-data, text and machine learning are applied (at least one must be used, two is recommended);
- how the methods are applied and which methods are used;
- how results are explained (writing, figures, tables with model output etc.);
- the research question and its originality and how it is answered.
The exam projects have a number of requirements that must be met, these are:
- Research question (hand in April 30, max half page)
- Groups with up to four members, you choose
- Project formalia
- Report (.pdf file)
- The style should be like a light written research article (brief literature review, references to methods etc.)
- Grading will be based on this report but process should be document in Jupyter Notebook.
- The following maximum number of pages (normalsider): 1 pax: 12 pages, 2 pax: 16 pages, 3 pax: 20 pages, 4 pax: 24 pages.
- Documentation in Jupyter Notebook (.ipynb file)
- Report (.pdf file)
- Some advice
- It is more important that you spent time on calibrating and validating the models you work with rather than using as many models as possible.