Data Crawler & Automation Engineer building production scraping systems and open-source extraction tools.
From closed APIs β reverse-engineered protocols β pip-installable packages β data at scale.
| Area | What you can expect |
|---|---|
| API Reverse Engineering | Decode private protocols (protobuf, GraphQL, internal REST) β direct HTTP extraction, no browser needed, 50x faster |
| Anti-Bot Evasion | Bypass Cloudflare, Shape Security, Incapsula, DataDome using anti-detect browsers, TLS fingerprinting & ISP proxies |
| Scalable Data Pipelines | Async scraping at 100K+ records/week β’ proxy rotation β’ checkpoint resumption β’ structured output |
| Open-Source Tooling | Production-ready pip packages with full docs, streaming APIs, event systems & CLI interfaces |
| Government Data Collection | 23 US states automated β business registrations & professional licenses from gov portals |
| Project | What it does | Link |
|---|---|---|
| GoogleMapsCollector | Reverse-engineers Google Maps' internal protobuf API β 100K+ records/week, no browser, no API key | Repo Β· pip install gmaps-extractor |
| MetaAdsCollector | Reverse-engineers Meta's private GraphQL API β full Ad Library extraction across all countries | Repo Β· pip install meta-ads-collector |
| google-maps-pb-decoder | Protobuf decoder for Google Maps' binary wire format β research & extraction toolkit | Repo |
| Project | What it does | Link |
|---|---|---|
| generic-scraper-1 | LLM-powered structured extraction from any website β define fields, get data, no selectors needed | Repo Β· pip install scraper |
| linkedin-profile-extractor | LinkedIn profile extraction with anti-detection β experience, education, skills, full profiles | Repo |
| google-maps-scraper | Google Maps scraping via browser automation with stealth mode | Repo |
| Project | What it does | Link |
|---|---|---|
| gov_websites_collector | Collects business registrations & professional licenses from 23 US state government websites β Camoufox anti-detect + ISP proxies | Repo |
| Category | Tools |
|---|---|
| Languages & Core | |
| Scraping & Automation | |
| Reverse Engineering | |
| Backend & APIs | |
| Databases | |
| Infrastructure |
- π API Reverse Engineering: Decode closed/private APIs (protobuf, GraphQL, internal REST) for direct, fast data extraction
- π‘οΈ Anti-Bot Evasion: Defeat Cloudflare, Imperva, Shape Security, DataDome β TLS fingerprinting, anti-detect browsers, ISP proxies
- β‘ Scalable Scraping Systems: Async pipelines, proxy rotation, checkpoint resumption β 100K+ records/week on a single machine
- π¦ Open-Source Tooling: Production-ready pip packages with streaming APIs, event systems, and full documentation
Best contact: LinkedIn
If you find the work useful, a β helps more people discover it.

