Skip to content

Utilities to scrape, process, and analyze data from Canadian public accounts

Notifications You must be signed in to change notification settings

BuildCanada/public_accounts_scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

33 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Public Accounts Extractor

This is a tool to make it easy to download and ETL public accounts data.

Methodology

The approach that it takes is to gradually upgrade the datasets that it grabs.

For public accounts:

  • download the HTML public accounts verbatim (the format is the same from) pb scrape <years>
  • normalize the data for each year into the same structure pb extract

For statscan datasets:

Usage

`./bin/pb

About

Utilities to scrape, process, and analyze data from Canadian public accounts

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages