🔗 Try it now: DataGator Lite
DataGatorLite is a lightweight web app built with Flask for uploading datasets and performing fuzzy matching against standardized International Futures territory names. It supports Excel and CSV formats, and allows manual review and editing of suggestions before exporting the cleaned data to a downloadable CSV file.
- Upload Excel or CSV files
- Smart detection of header rows
- Select specific rows/columns for name matching
- Fuzzy matching using RapidFuzz with a threshold control
- Manual override and review interface
- Export the cleaned dataset
- Secure session handling (server-side)
git clone https://github.com/yourusername/datagator_flask_mongo.git
cd datagator_flask_mongopython -m venv venv
source venv/bin/activate # On Windows: venv\Scripts\activatepip install -r requirements.txtcp .env.example .envFill in the required fields (DB_URI_PART1, etc.)
Use this route to directly connect to a MongoDB database with territory mapping
Use the JSON file under the data folder to read the territory mapping database
python app.pyApp will be available via a local host link.
MIT License — feel free to use and adapt.
Built by Yutang Xiong
Contact: yutang.xiong@du.edu