Skip to content

Open source code to geocode OMOP Location entities via DeGauss

License

Notifications You must be signed in to change notification settings

chorus-ai/UF-Geocoding

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Exposome Data Linkage Tool

Background and Introduction

Exposome data refers to the comprehensive collection and integration of information representing an individual's or population's exposure to environmental, social, and lifestyle factors, including Social Determinants of Health (SDoH). These datasets are increasingly important for clinical research worldwide. However, because they are collected and maintained by different organizations, there is no centralized, unified database that allows researchers to quickly access, visualize, and download reliable data.

This project addresses the following needs:

  1. A GIS-enabled database to store geospatial data
  2. Efficient data ingestion with minimal manual effort
  3. A lightweight web application for data cataloging, visualization, and basic analysis
  4. Local geocoding toolkits for converting addresses to coordinates or FIPS codes

Database Design

We use PostgreSQL with the PostGIS extension to support geospatial operations. Inspired by the OHDSI GIS Working Group, our database includes core tables to store both geospatial and variable data.

Key design elements:

  • data_source table: Stores metadata from all source datasets, including SDoH, environmental data, and geometry data.
  • For each SDoH and environmental dataset, two tables are created:
    • variable_index table: Captures attributes and properties of each variable
    • variable_value table: Stores variable values linked to geocoded locations
  • Geospatial data is stored in dedicated tables and linked to variables using geoCodes such as FIPS, ZCTA, COUNTY, and STATE

Database Design


Toolkits

Most SDoH datasets use Census Tract (FIPS) boundaries. To support investigators, we provide toolkits to geocode source addresses into coordinates or FIPS codes.

To protect patient privacy, geocoding should be executed locally. Our toolkit supports both:

  • CSV file input
  • Direct extraction from OMOP CDM databases

If your institution already provides geocoding services, you may skip these toolkits.

Detailed instructions are available here.

Demo Videos


Web Application

We developed a web application that provides investigators with:

  • A data catalog
  • The exposome data linkage tool
  • Data visualization features

A basic data analysis tool will be included in the next release.

The web application is deployed on HiPerGator PubApps.

External Investigator Workflow

Contributing to Exposome Data Linkage Tool

Thank you for your interest in contributing! To ensure smooth collaboration, please follow these guidelines:

How to Contribute

  1. Fork the repository.
  2. Create a feature branch (git checkout -b feature/my-feature).
  3. Commit your changes (git commit -am 'Add new feature').
  4. Push to your branch (git push origin feature/my-feature).
  5. Open a Pull Request describing your changes.

Reporting Issues

  • Use the Issues tab for bug reports, feature requests, or questions.
  • Provide clear steps to reproduce any bugs and include any relevant screenshots or logs.

Coding Standards

  • Python scripts: PEP8 formatting
  • Markdown: Use proper headings and code blocks
  • Docker scripts: Ensure commands are tested and documented

Contact / Questions

For questions, feedback, or collaboration requests regarding this repository, please reach out to:

prismap labic3-center@ufl.edu

Or you can submit an issue directly in this repository, and we will respond promptly.

License

This repository is licensed under the MIT License. See LICENSE for details.

About

Open source code to geocode OMOP Location entities via DeGauss

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 96.8%
  • Dockerfile 3.2%