ComBase Scraper

Parallel Processing: 10 threads for 10x speed improvement
Search Delay: 2-minute wait after search before scraping starts
Deduplication: Removes duplicate food parts from organism names
Thread-Safe: Real-time progress tracking across all threads

Simple ComBase data scraper with English interface.

Quick Start

pip install -r config/requirements.txt

Single Thread (Simple):

python simple_scraper.py

Parallel (10 Threads - Faster):

python parallel_scraper.py

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
archive		archive
config		config
docs		docs
src		src
.gitignore		.gitignore
README.md		README.md
parallel_scraper.py		parallel_scraper.py
simple_scraper.py		simple_scraper.py