Dockerfile to access EU industrial emission and ETS data via SQL / MariaDB
- Errors and Inconsistencies in European Emission Data (Dec 2025, Industry Decarbonization Newsletter)
- Greenhouse Gas Emission Data: Public, difficult to access, and not always correct (Dec 2025, presentation at 39C3)
[!NOTE] The database and table schemas are work in progress and may change.
You can build the Docker image with make and start it with make run.
It will expose a MariaDB (MySQL compatible) server (username ghg / no password) and a
phpMyAdmin web interface on localhost.
It currently provides a database eu with multiple tables:
iep/ieprawcontain emission data from the EU's Industrial Emissions Portal.ieprawcontains the data in unmodified form (column names and content like in CSV data source), whileiepcontains a simplified version with shorter names (e.g., column nameyearinstead ofreportingYear, pollutant nameCO2instead ofCarbon dioxide (CO2)) and uninteresting data removed.etscontaints emission data for the European Emission Trading System (ETS).linkingcontains data that allows connecting entries from the two tables.
The examples subdirectory contains some usage examples in Python.
The data sources are not part of this repository. They are fetched while building the Dockerfile.
The Industrial Emissions Portal data is provided by the European Environmental Agency. We use the latest CSV version of the air releases.
The ETS data is provided by the EU's Union Registry. The latest "Verified Emissions" are provided as an XLSX (Excel) file.
The linking data has been provided by Jan Abrell, Mirjam Kosch, and Leonard Stimpfle in the Cadmus EUI Research Repository.
All data sources use a CC BY 4.0 license.
This repository was created by Hanno Böck as a research tool for the Industry Decarbonization Newsletter.