Open datasets from Paraguay packaged for easy use with Python and pandas.
datopy provides clean, ready-to-use datasets from official Paraguayan sources (INE, BCP, World Bank) as pandas DataFrames. No web scraping, no API keys, no downloads - just import datopy and start analyzing.
pip install datopyOr install from source:
git clone https://github.com/skyvanguard/datopy.git
cd datopy
pip install -e .import datopy
# List all available datasets
datopy.list_datasets()
# ['asuncion_neighborhoods_2022', 'departments_geography', 'education_enrollment', ...]
# Load a dataset as a pandas DataFrame
df = datopy.load_dataset('population_by_department_2022')
print(df.head())
# Get dataset metadata
datopy.describe_dataset('gdp_annual')
# {'name': 'gdp_annual', 'description': 'Gross Domestic Product...', 'source': 'World Bank / IMF', ...}
# Search datasets by keyword
datopy.search_datasets('economy')
# ['gdp_annual', 'inflation_annual', 'poverty_rate']| Dataset | Description | Source | Period | Rows |
|---|---|---|---|---|
population_by_department_2022 |
Population by department (Census 2022) | INE | 2022 | 18 |
population_by_department_2012 |
Population by department (Census 2012) | INE | 2012 | 18 |
asuncion_neighborhoods_2022 |
Population by neighborhood in Asuncion | INE | 2022 | 40 |
homicide_rate |
Homicide rate per 100,000 | UNODC/World Bank | 2010-2023 | 14 |
gdp_annual |
GDP in current USD | World Bank/IMF | 2000-2023 | 24 |
inflation_annual |
Consumer price inflation | BCP/World Bank | 2000-2023 | 24 |
poverty_rate |
Poverty and extreme poverty rate | INE/World Bank | 2001-2023 | 23 |
departments_geography |
Geographic reference (area, capital, density) | INE | 2022 | 18 |
exports_by_product |
Exports by product in USD millions | BCP | 2020-2023 | 20 |
education_enrollment |
Student enrollment by level | MEC/INE | 2018-2023 | 24 |
traffic_accidents |
Road traffic deaths and injuries | ANTSV | 2012-2024 | 13 |
population_projection |
Population projection with demographics | UN WPP 2024 | 2000-2050 | 11 |
Load a dataset by name. Returns a pandas DataFrame.
Return a sorted list of all available dataset names.
Return metadata dictionary for a dataset (description, source, columns, etc.).
Search datasets by keyword in name, description, or category.
- INE (Instituto Nacional de Estadistica) - ine.gov.py
- BCP (Banco Central del Paraguay) - bcp.gov.py
- World Bank - data.worldbank.org
- UNODC - dataunodc.un.org
- MEC (Ministerio de Educacion y Ciencias) - mec.gov.py
- ANTSV (Agencia Nacional de Transito y Seguridad Vial) - antsv.gov.py
- UN WPP (World Population Prospects) - population.un.org
Contributions are welcome! To add a new dataset:
- Add the CSV file to
datopy/data/ - Add metadata to
datopy/datasets.py - Add tests
- Submit a PR
MIT License - see LICENSE for details.