Skip to content

openagri-eu/Geodatabase_OpenAgri

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 

Repository files navigation

Geospatial database_OpenAgri

This document provides metadata for the GeoPackage associated with the study titled "Geospatial Framework for Assessing the Suitability and Demand for Agricultural Digital Solutions in Europe: A Tool for Informed Decision-Making". The GeoPackage contains a single layer of NUTS3 polygons, representing regional boundaries across Europe. Each polygon is attributed with:

Environmental, socio-economic, and connectivity variables: These variables form the basis for the geospatial analysis conducted in the study. Derived indexes: The indexes assess regional suitability and demand for agricultural digital solutions (ADS), as well as connectivity performance. These include: -Natural Characteristics Similarity Index (NCSI) -Socio-Economic Characteristics Similarity Index (SCSI) -Fertilization Need Index (FNI) -Irrigation Need Index (INI) -Pest Management Need Index (PMNI) -Rural Connectivity Performance Index (RCPI)

Citation Guidelines

Users of this dataset are required to cite the article "Chalazas, T.; Koukourikos, A.; Bauwens, J.; Berkvens, N.; Van Beek, J.; Kalatzis, N.; Papadopoulos, G.; Ilias, P.; Marianos, N.; Brewster, C. Geospatial Framework for Assessing the Suitability and Demand for Agricultural Digital Solutions in Europe: A Tool for Informed Decision-Making. ISPRS Int. J. Geo-Inf. 2025, 14, 185. https://doi.org/10.3390/ijgi14050185" in any publication or project utilizing this data. Additionally, detailed references to the original databases from which the environmental, socio-economic, and connectivity variables were sourced are provided in the article and this documentation. These sources should also be appropriately cited to acknowledge the original data providers.

This documentation serves as a quick reference to the dataset structure and content. For detailed information on the variables, their calculation methodologies, and the broader context of the study, please consult the associated article.

The variables Soil_Organic_Carbon, Nitrogen, Phosphorous, Soil_Texture_Sand, Soil_Texture_Clay, Soil_Texture_Silt were derived from the LUCAS 2018 Soil Module and were aggregated to the NUTS3 level polygons as part of the analysis for the creation of relevant indexes. The aggregated values for these variables are not provided here but the original datasets can be acquired/requested from the European soil data centre: https://esdac.jrc.ec.europa.eu/content/lucas-2018-topsoil-data#tabs-0-description=0 https://esdac.jrc.ec.europa.eu/content/topsoil-physical-properties-europe-based-lucas-topsoil-data

Related references:

Orgiazzi, A., Ballabio, C., Panagos, P., Jones, A., Fernández-Ugalde, O. (2018). LUCAS Soil, the largest expandable soil dataset for Europe: A review. European Journal of Soil Science, 69(1): 140-153. https://doi.org/10.1111/ejss.12499

Fernandez-Ugalde, O., Scarpa, S., Orgiazzi, A., Panagos, P., Van Liedekerke, M., Marechal A., & Jones, A. (2022). LUCAS 2018 Soil Module. Presentation of dataset and results. EUR 31144 EN, Publications Office of the European Union, Luxembourg. ISBN 978-92-76-54832-4, https://doi.org/10.2760/215013.

Ballabio C., Panagos P., Montanarella L. Mapping topsoil physical properties at European scale using the LUCAS database (2016) Geoderma, 261 , pp. 110-123.

Geodatabase Columns

  1. column_name: fid
  • Data Type: Integer
  • Description: A unique identifier for each feature in the dataset.
  1. column_name: NUTS_ID
  • Data Type: Text
  • Description: Unique identifier for each NUTS 3 region.
  • Notes: Based on the NUTS (Nomenclature of Territorial Units for Statistics) classification.
  1. column_name: LEVL_CODE
  • Data Type: Integer
  • Description: Code representing the NUTS level of the polygon.
  • Notes: For NUTS 3, this value is typically 3.
  1. column_name: CNTR_CODE
  • Data Type: Text
  • Description: ISO country code indicating the country to which the NUTS 3 region belongs.
  1. column_name: NAME_LATN
  • Data Type: Text
  • Description: Official name of the NUTS 3 region in Latin characters.
  1. column_name: NUTS_NAME
  • Data Type: Text
  • Description: Name of the NUTS 3 region in the native language.
  1. column_name: MOUNT_TYPE
  • Data Type: Integer
  • Description: Classification of the region based on mountain typology.
  • Notes: Values typically range from 0 (non-mountainous) to specific codes for mountainous regions.
  1. column_name: URBN_TYPE
  • Data Type: Integer
  • Description: Classification of the region based on urban typology.
  • Notes: Values typically range from 1 (urban) to 3 (rural).
  1. column_name: COAST_TYPE
  • Data Type: Integer
  • Description: Classification of the region based on coastal proximity.
  • Notes: Values typically range from 0 (non-coastal) to 1 (coastal).

NUTS_ID, LEVL_CODE, CNTR_CODE, NAME_LATN, NUTS_NAME, MOUNT_TYPE, URBN_TYPE, and COAST_TYPE are derived from Eurostat's NUTS classification system.

Eurostat (2021): Nomenclature of Territorial Units for Statistics (NUTS). Available at: https://ec.europa.eu/eurostat/web/nuts

  1. column_name: Annual_Precipitation
  • Data Type: demical (double)
  • Description: Aggregated annual precipitation within the NUTS3 region.
  • Units: millimeters per square kilometer (mm/km²)
  • Notes: Derived/Calculated from geospatial analysis of the Copernicus Climate Change Service ERA5.**

References:

Hersbach, H., Bell, B., Berrisford, P., Biavati, G., Horányi, A., Muñoz Sabater, J., Nicolas, J., Peubey, C., Radu, R., Rozum, I., Schepers, D., Simmons, A., Soci, C., Dee, D., Thépaut, J-N. (2023): ERA5 hourly data on single levels from 1940 to present. Copernicus Climate Change Service (C3S) Climate Data Store (CDS).

  1. column_name: Surface_Soil_Moisture (average for highest FAPAR) (volume fraction)
  • Data Type: demical (double)
  • Description: Average surface soil moisture during the month with the highest Fraction of Absorbed Photosynthetically Active Radiation (FAPAR) within the NUTS3 region.
  • Units: Volume fraction (m³/m³)
  • Notes: Derived/Calculated from geospatial analysis of the NASA-USDA Enhanced SMAP Global Soil Moisture data and FAPAR datasets.**

References:

Entekhabi, D., Njoku, E. G., O'neill, P. E., Kellogg, K. H., Crow, W. T., Edelstein, W. N., ... & Van Zyl, J. (2010). The soil moisture active passive (SMAP) mission. Proceedings of the IEEE, 98(5), 704-716.

  1. column_name: Days_year_over_10°C
  • Data Type: Integer
  • Description: Mean annual number of days with temperatures exceeding 10°C within the NUTS3 region.
  • Units: Days per year
  • Notes: Derived/Calculated from geospatial analysis of climatic datasets (e.g., Copernicus Climate Change Service ERA5). column_name: Species_Richness
  1. column_name: Species_Richness
  • Data Type: Integer
  • Description: Aggregated number of distinct species observed within the NUTS3 region, representing biodiversity.
  • Units: Count (number of species)
  • Notes: Derived/Calculated from geospatial analysis of the European Environment Agency's Article 17 spatial data under the Habitats Directive 92/43/EEC.

References:

European Commission, Council Directive 92/43 CEE on the conservation of natural habitats and of wild fauna and flora, European Community Gazzette 206 (1992) 1-50.

  1. column_name: Farms_Predominant_Size
  • Data Type: Text
  • Description: The predominant farm size in hectares (Ha) within the NUTS3 region, representing the most common size category of agricultural holdings.
  • Units: Hectares (Ha)
  • Notes: Derived from Eurostat's Farm Structure Survey (FSS) and aggregated to the NUTS3 level.**
  1. column_name: Farms_Predominant_Economic_Size
  • Data Type: Text
  • Description: The predominant economic size of farms in euros (€) within the NUTS3 region, representing the most common economic size category of agricultural holdings.
  • Units: Euros (€)
  • Notes: Derived from Eurostat's Farm Structure Survey (FSS) and aggregated to the NUTS3 level.**
  1. column_name: Farms_Predominant_Legal_Form
  • Data Type: Text
  • Description: The predominant legal form of agricultural holdings within the NUTS3 region, indicating whether farms are primarily individual, or legal entities.
  • Units: Categorical (e.g., "Legal", "Natural")
  • Notes: Derived from Eurostat's Farm Structure Survey (FSS) and aggregated to the NUTS3 level.
  1. Column_name: Natural_Person_Legal_Form_to_Total
  • Data Type: Text
  • Description: The proportion of farms operating under form of a natural person (e.g., individual or family-owned farms) relative to the total number of farms within the NUTS3 region.
  • Units: Ratio (0-1)
  • Notes: Derived from Eurostat's Farm Structure Survey (FSS) and aggregated to the NUTS3 level.

The variables Farms_Predominant_Size, Farms_Predominant_Economic_Size, Farms_Predominant_Legal_Form, and Natural_Person_Legal_Form_to_Total are derived from Eurostat's Farm Structure Survey (FSS). These data provide critical insights into the structural characteristics of farms across Europe.

For more information, refer to Eurostat's Farm Structure Survey data: https://ec.europa.eu/eurostat/databrowser/view/ef_m_farmang/default/table?lang=en.

  1. column_name: Rural_Fixed_Broadband_Coverage
  • Data Type: Integer
  • Description: The percentage of rural households within the NUTS3 region covered by fixed broadband infrastructure.
  • Units: Percentage (%)
  • Notes: Derived from Eurostat's broadband coverage statistics and aggregated to NUTS3 regions
  1. column_name: Rural_5G_Cellular_Network_Coverage
  • Data Type: Integer
  • Description: The percentage of rural households within the NUTS3 region covered by 5G cellular network infrastructure.
  • Units: Percentage (%)
  • Notes: Derived from Eurostat's broadband and cellular network statistics and aggregated to NUTS3 regions

The variables Rural_Fixed_Broadband_Coverage and Rural_5G_Cellular_Network_Coverage are derived from Eurostat's broadband and cellular network coverage statistics. These datasets provide detailed insights into the level of digital infrastructure available in rural areas across Europe.

For more information, refer to Eurostat's broadband coverage statistics: https://ec.europa.eu/eurostat/databrowser/view/isoc_cbt/default/table?lang=en.

  1. column_name: Population
  • Data Type: Integer
  • Description: Total population within the NUTS3 region, representing the number of residents.
  • Units: Count (number of people)
  • Notes: Calculated from the JRC database and aggregated to the NUTS3 level through geospatial analysis.

References:

Schiavina, M., Freire, S., Carioli, A., & MacManus, K. (2023). GHS-POP R2023A–GHS Population Grid Multitemporal (1975-2030). In European Commission. Joint Research Centre (JRC).

  1. column_name: NUTS3_Area
  • Data Type: demical (double)
  • Description: Total land area of the NUTS3 region, calculated in square kilometers (km²).
  • Units: Square kilometers (km²)
  • Notes: Derived from geospatial analysis of the NUTS3 polygons.
  1. column_name: Population_Density
  • Data Type: demical (double)
  • Description: Population density within the NUTS3 region, calculated as the total population divided by the total land area.
  • Units: Population per square kilometer (population/km²)
  • Notes: This variable is derived by dividing the Population column by the NUTS3_Area column, both of which were aggregated or calculated through geospatial analysis within the NUTS3 boundaries.
  1. column_name: Median_Elevation
  • Data Type: demical (double)
  • Description: Median elevation of the NUTS3 region, representing the middle value of elevation when all elevation measurements within the region are sorted.
  • Units: Meters (m)
  • Notes: Copernicus Digital Elevation Model (COP-DEM), accessed via Copernicus Data Space Ecosystem. Geospatial analysis was performed to aggregate the elevation data within the NUTS3 boundaries. Elevation values were calculated and aggregated for each NUTS3 polygon to provide a representative measure of terrain characteristics.
  1. column_name: Natural_Characteristics_Index
  • Data Type: Integer
  • Description: Cluster classification representing groups of NUTS3 regions with similar natural characteristics, such as soil properties, elevation, precipitation, and soil moisture.
  • Units: Cluster ID (categorical, integer values)
  1. column_name: Socio-economic_Similarity_Index
  • Data Type: Integer
  • Description: Cluster classification representing groups of NUTS3 regions with similar socio-economic characteristics, such as farms predominant size, economic size and legal status
  • Units: Cluster ID (categorical, integer values)
  1. column_name: Fertilization_Need_Index
  • Data Type: demical (double)
  • Description: Performance index ranging from 0 to 1 representing the need (low need for low values, high need for high values) for fertilization ADSs on NUTS3 regions.
  • Units: 0 to 1 (performance values)
  1. column_name: Irrigation_Need_Index
  • Data Type: demical (double)
  • Description: Performance index ranging from 0 to 1 representing the need (low need for low values, high need for high values) for irrigation ADSs on NUTS3 regions.**
  • Units: 0 to 1 (performance values)
  1. column_name: Pest_Management_Need_Index
  • Data Type: demical (double)
  • Description: Performance index ranging from 0 to 1 representing the need (low need for low values, high need for high values) for pest management ADSs on NUTS3 regions.
  • Units: 0 to 1 (performance values)
  1. column_name: Rural_Connectivity_Performance_Index
  • Data Type: demical (double)
  • Description: Performance index ranging from 0 to 1 representing the Rural Cloud connectivity capabilities (low for low values, high for high values) on NUTS3 regions.**
  • Units: 0 to 1 (performance values)
  1. column_name: Dominant_Crop_1
  • Data Type: text
  • Description:: The most predominant crop cultivated within the NUTS3 region, based on production data originally reported at the NUTS2 level.
  • Units: N/A
  • Notes: Crop data was sourced from Eurostat’s “Crop production in national humidity by NUTS 2 region” dataset (https://ec.europa.eu/eurostat/databrowser/view/apro_cpnhr/default/table?lang=en). To assign crop dominance at the NUTS3 level, the dominant crop information from the NUTS2 polygon was spatially transferred to its corresponding NUTS3 polygons. This method assumes uniformity in crop distribution within each NUTS2 region.
  1. column_name: Dominant_Crop_2
  • Data Type: text
  • Description:: The second most predominant crop cultivated within the NUTS3 region, based on production data originally reported at the NUTS2 level.
  • Units: N/A
  1. column_name: Dominant_Crop_3
  • Data Type: text
  • Description:: The third most predominant crop cultivated within the NUTS3 region, based on production data originally reported at the NUTS2 level.
  • Units: N/A
  1. column_name: Dominant_Crop_4
  • Data Type: text
  • Description:: The fourth most predominant crop cultivated within the NUTS3 region, based on production data originally reported at the NUTS2 level.
  • Units: N/A
  1. column_name: Dominant_Crop_5
  • Data Type: text
  • Description:: The fifth most predominant crop cultivated within the NUTS3 region, based on production data originally reported at the NUTS2 level.
  • Units: N/A

About

Geodatabase consisting the knowledge-base of the OpenAgri Decision Support Tool.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors