This dataset provides a sample of public procurement contracts from the European Commission's Tenders Electronic Daily (TED) database for educational purposes. The population of contracts are those awarded between 2018 and 2022, with a single buyer and a single winner and a non-missing contract value. From this population, a random sample of 50,000 is drawn.
GeoDist is publicly accessible at https://www.cepii.fr/cepii/en/bdd_modele/bdd_modele_item.asp?id=6 and the Etalab 2.0 license (https://www.etalab.gouv.fr/wp-content/uploads/2018/11/open-licence.pdf) allows reuse and sharing.
We downloaded https://www.cepii.fr/distance/dist_cepii.dta and shared it under "data/raw/cepii/dist_cepii.dta"
Stata 14 and above. Last run on a Mac, runtime took a few minutes.
Execute the .do files from the top of the "day1" folder in the following order
- code/download_data.do
- code/read_ted.do
- code/summarize.do
- CEPII. 2024. "GeoDist [data set]" Accessed from https://www.cepii.fr/cepii/en/bdd_modele/bdd_modele_item.asp?id=6 on 2025-01-27
- Mayer, T. & Zignago, S. (2011), Notes on CEPII’s distances measures : the GeoDist Database, CEPII Working Paper 2011-25
- European Commission, 2022. "Tenders Electronic Daily (TED) (csv subset) – public procurement notices [data set]." Available at: https://data.europa.eu/data/datasets/ted-csv Sample extract distributed by Coded Thinking OÜ, 2023, available at: https://github.com/codedthinking/tender-home-bias/tree/v2.0