Exposome data refers to the comprehensive collection and integration of information representing an individual's or population's exposure to environmental, social, and lifestyle factors, including Social Determinants of Health (SDoH). These datasets are increasingly important for clinical research worldwide. However, because they are collected and maintained by different organizations, there is no centralized, unified database that allows researchers to quickly access, visualize, and download reliable data.
This project addresses the following needs:
- A GIS-enabled database to store geospatial data
- Efficient data ingestion with minimal manual effort
- A lightweight web application for data cataloging, visualization, and basic analysis
- Local geocoding toolkits for converting addresses to coordinates or FIPS codes
We use PostgreSQL with the PostGIS extension to support geospatial operations. Inspired by the OHDSI GIS Working Group, our database includes core tables to store both geospatial and variable data.
Key design elements:
data_sourcetable: Stores metadata from all source datasets, including SDoH, environmental data, and geometry data.- For each SDoH and environmental dataset, two tables are created:
variable_indextable: Captures attributes and properties of each variablevariable_valuetable: Stores variable values linked to geocoded locations
- Geospatial data is stored in dedicated tables and linked to variables using geoCodes such as FIPS, ZCTA, COUNTY, and STATE
Most SDoH datasets use Census Tract (FIPS) boundaries. To support investigators, we provide toolkits to geocode source addresses into coordinates or FIPS codes.
To protect patient privacy, geocoding should be executed locally. Our toolkit supports both:
- CSV file input
- Direct extraction from OMOP CDM databases
If your institution already provides geocoding services, you may skip these toolkits.
Detailed instructions are available here.
We developed a web application that provides investigators with:
- A data catalog
- The exposome data linkage tool
- Data visualization features
A basic data analysis tool will be included in the next release.
The web application is deployed on HiPerGator PubApps.
Thank you for your interest in contributing! To ensure smooth collaboration, please follow these guidelines:
- Fork the repository.
- Create a feature branch (
git checkout -b feature/my-feature). - Commit your changes (
git commit -am 'Add new feature'). - Push to your branch (
git push origin feature/my-feature). - Open a Pull Request describing your changes.
- Use the Issues tab for bug reports, feature requests, or questions.
- Provide clear steps to reproduce any bugs and include any relevant screenshots or logs.
- Python scripts: PEP8 formatting
- Markdown: Use proper headings and code blocks
- Docker scripts: Ensure commands are tested and documented
For questions, feedback, or collaboration requests regarding this repository, please reach out to:
prismap lab – ic3-center@ufl.edu
Or you can submit an issue directly in this repository, and we will respond promptly.
This repository is licensed under the MIT License. See LICENSE for details.

