Skip to content

DataDevils/NC_WaterData

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

NC Water Data Exploration

[TOC]


♦ Executive Summary ♦

I spent time exploring various internet resources for useful water data with a focus on North Carolina. I evaluated resources on the relevance of the data they provided as well as how easily the data could be accessed and downloaded, a summary of which is provided below. I also generated a number of Python snippets, provided as interactive Python notebooks (ipynb) files that demonstrate how some of these datasets are accessed and visualized.

While this represents a number of useful sites, it's certainly not exhaustive.

Summary of datasets evaluated

Source Dataset Ease of Access Data Utility Comments
EWG Tap Water Database ♦♦ ♦♦♦ Not updated, Hard to scrape
EPA Consumer Confidence Reports ♦♦♦ ♦♦ Data gaps, Inconsistent
EPA Enforcement & Compliance History Online(ECHO) ♦♦♦♦ ♦♦♦♦♦ Good, but so much data it's hard to know what's what
EPA Safe Drinking Water Info. System (SWDIS) ♦♦ ♦♦♦ Clumsy interface.
Data possibly redundant w/ECHO.
EPA Permit Compliance Sysem/Integrated Complicance Information System (PCS/ICIS) ♦♦ ♦♦♦♦ Data rich, but hard to scrape. Slow.
USGS/EPA Water Quality Portal ♦♦♦♦♦ ♦♦♦♦♦ Well organized &
comprehensive data
N DEQ/ DWR Water Resources Information, Storage, Analysis, & Retrival System (WRISARS) ♦♦ ♦♦♦ Access point to other datasets
NC DEQ/ DWR NC Withdrawal Transfer Registration ♦♦♦♦ ♦♦♦♦♦ Useful data that can be scraped
NC DWR/ DWR NC Local Water Supply Plans ♦♦♦♦♦ Useful, but difficult to scrape
NC DEQ Water Resources Information Technology Branch -- -- Link to other data portals; many require password access
NC DEQ Drinking Water Watch ♦♦ ♦♦♦♦ Useful data, layered interface, redundant data?
Charlotte Water Water quality data ♦♦♦♦♦ ♦♦♦♦♦ Comprehensive and easily accessed

I. National Data Sources

A. EWG's Tap Water Database

B. EPA Consumer Confidence Reports (CCR)

  • Overview: Not Helpful

    • While scraping is fairly straightforward, the data are inconsistent with many missing values.
    • Data are aggregated and vague
  • Link: https://www.epa.gov/ccr

  • Summary: Community Water Systems (CWS) are required to provide drinking water quality reports to customers. This is a repository of those reports.

  • Data link: https://ofmpub.epa.gov/apex/safewater/f?p=136:102::::::

    • Cryptic REST interface, but state summary data can easily be scraped.
    • Summary data includes few details, relevant data (City, County, Population Served). Some have website links, but not all are working. Links likely go to glossy summaries that don't facilitate scraping.
  • Code examples:

    • EPA/Explore-CCR-Reports.ipynb: a brief exploration of pre-downloaded data in CSV format.

C. EPA Enforcement and Compliance History Online (ECHO)

D. EPA Safe Drinking Water Information System (SWDIS)

E. EPA Permit Compliance System/Integrated Compliance Information System (PCS/ICIS)

  • Overview: Potential, but will work hard for the data, which may be replicated elsewhere (ECHO?)
    • Gateway to much data, but clumsy click interface.
    • Uses REST, but documentation is scant and structure is lacking.
    • Results are formatted.

F. US Water Quality Portal

  • Overview: Most promising!
    • Repository of many datasets from multiple sources (EPA, USGS).
    • Web services and file shares provide ready access to data with excellent documentation
    • Need to compare what data are provided relative to state/local data portals.
  • Link: https://www.waterqualitydata.us/
  • Summary: The Water Quality Portal (WQP) is a cooperative service sponsored by the United States Geological Survey (USGS), the Environmental Protection Agency (EPA), and the National Water Quality Monitoring Council (NWQMC). It serves data collected by over 400 state, federal, tribal, and local agencies: https://www.waterqualitydata.us/. The data include information on sites where data are gathered, physical/chemical monitoring data, and biological sample data. Complete metadata are available here: https://www.waterqualitydata.us/portal_userguide/
  • Data:
  • Code examples:
    • USWQP/USWaterData-Scrape.ipynb uses the WQP web service to pull station data for all sites in Durham Co. (N = 489).
    • USWQP/USWaterData-Explore.ipynb provides and example for ingesting and visualizing the US Water Quality Portal data scraped for Durham County.

II. State Data Sources: NC Dept. of Environmental Quality

A. Water Resources Information, Storage, Analysis, and Retrieval System (WRISARS)

B. Water Withdrawal & Transfer Registration

  • Overview: Useful and obtainable!
  • Link: http://www.ncwater.org/Permits_and_Registration/Water_Withdrawal_and_Transfer_Registration/.
  • Summary: The NC DEQ maintains a database of registered water withdrawal facilities that includes the amounts of withdrawals, discharges, and transfers going back to 1999.
  • Code examples:
    • NCDEQ\1-NCDEQ-Scraper.ipynb extracts a set of data file described below. Data are collected for the years spanning 2010 to 2017.
    • NCDEQWithdrawalMaster.csv - Lists of each withdrawal facility (N=1194), listing its owner, name, whether the report is in draft or completed, and the site code.
    • NCDEQMonthlyVolumeData.csv Average daily withdrawal and maximum day withdrawal by month in million gallons per day (MGD).
    • NCDEQWithdrawalSourceData.csv - Source Information - One row for each water withdrawal source.
    • NCDEQDischargeMethods.csv - Average daily discharge and maximum day discharge by month in million gallons per day (MGD)
    • NCDEQTransferInfo.csv - Details on the amount of water transferred across facilities
    • NCDEQ\2-ExploreNCWithdrawalData.ipynb provides and example of how the above files can be accessed and the data visualized.
    • NCDEQ\3-WaterBudget-Simple.ipynb explores water budgets at the watershed scale with the underlying objectives of revealing what we can show with this data and identify data gaps that would prove helpful.
    • NCDEQ\4-WaterBudget-Interactive.ipynb interactively explores water budgets at the watershed scale with the underlying objectives of revealing what we can show with this data and identify data gaps that would prove helpful

C. NC Water Supply Plans

  • Overview: Quite useful, but difficult to scrape
    • Many important attributes for constructing water budgets
    • Some hierarchical tables, making scraping somewhat difficult
  • Link: https://www.ncwater.org/Water_Supply_Planning/Local_Water_Supply_Plan/
  • Summary: The local water supply plans assess a water system's current and future needs and its ability to meet those needs. These reports span back to 1997 (with some gaps prior to 2007).
  • Data: Data include many useful attributes, including information on a system's distribution system, service area, use by type, sales, sources, sales, treatment plant capacity, wastewater information, ...
  • Code examples:
    • NCDEQ\X-WaterSupplyPlanScraper.ipynb is a work in progress to scrape the annual water supply plan data from the Department of Water Resources portal. More time is needed as code needs to be created to scrape each table individually.

D. NC Water Resources Information Technology Branch

  • Overview: Something to keep an eye on.
    • Mostly a pointer to other data sources. Many, however, require a password to access.
  • Link: https://deq.nc.gov/about/divisions/water-resources/planning/information-technology-branch
  • Summary: The Water Resources Information Technology Branch develops and maintains the Division's computer applications and databases. The branch is comprised of three units: BIMS support, Public Water Supply support, and Water Planning support. Links to many applications storing/hosting water data.
  • Code examples: None (as these are mostly links to data, not data itself)

E. Public Water System Supervision/Drinking Water Watch


III. Local Data Sources

A. Charlotte Water Data


About

Tools to find and extract NC Water Data

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published