Skip to content

skyvanguard/datopy

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

datopy

CI Python 3.9+ License: MIT

Open datasets from Paraguay packaged for easy use with Python and pandas.

datopy provides clean, ready-to-use datasets from official Paraguayan sources (INE, BCP, World Bank) as pandas DataFrames. No web scraping, no API keys, no downloads - just import datopy and start analyzing.

Installation

pip install datopy

Or install from source:

git clone https://github.com/skyvanguard/datopy.git
cd datopy
pip install -e .

Quick Start

import datopy

# List all available datasets
datopy.list_datasets()
# ['asuncion_neighborhoods_2022', 'departments_geography', 'education_enrollment', ...]

# Load a dataset as a pandas DataFrame
df = datopy.load_dataset('population_by_department_2022')
print(df.head())

# Get dataset metadata
datopy.describe_dataset('gdp_annual')
# {'name': 'gdp_annual', 'description': 'Gross Domestic Product...', 'source': 'World Bank / IMF', ...}

# Search datasets by keyword
datopy.search_datasets('economy')
# ['gdp_annual', 'inflation_annual', 'poverty_rate']

Available Datasets

Dataset Description Source Period Rows
population_by_department_2022 Population by department (Census 2022) INE 2022 18
population_by_department_2012 Population by department (Census 2012) INE 2012 18
asuncion_neighborhoods_2022 Population by neighborhood in Asuncion INE 2022 40
homicide_rate Homicide rate per 100,000 UNODC/World Bank 2010-2023 14
gdp_annual GDP in current USD World Bank/IMF 2000-2023 24
inflation_annual Consumer price inflation BCP/World Bank 2000-2023 24
poverty_rate Poverty and extreme poverty rate INE/World Bank 2001-2023 23
departments_geography Geographic reference (area, capital, density) INE 2022 18
exports_by_product Exports by product in USD millions BCP 2020-2023 20
education_enrollment Student enrollment by level MEC/INE 2018-2023 24
traffic_accidents Road traffic deaths and injuries ANTSV 2012-2024 13
population_projection Population projection with demographics UN WPP 2024 2000-2050 11

API Reference

load_dataset(name: str) -> pd.DataFrame

Load a dataset by name. Returns a pandas DataFrame.

list_datasets() -> list[str]

Return a sorted list of all available dataset names.

describe_dataset(name: str) -> dict

Return metadata dictionary for a dataset (description, source, columns, etc.).

search_datasets(query: str) -> list[str]

Search datasets by keyword in name, description, or category.

Data Sources

Contributing

Contributions are welcome! To add a new dataset:

  1. Add the CSV file to datopy/data/
  2. Add metadata to datopy/datasets.py
  3. Add tests
  4. Submit a PR

License

MIT License - see LICENSE for details.

About

Open datasets from Paraguay for data analysis and research

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages