Releases: breezy89757/SmartScraper
Releases · breezy89757/SmartScraper
v1.1.0: Portable SmartScraper
Added
- Portable Package Export: Download scrapers as ZIP files (
/downloadendpoint). - Auto-Dependency Management: Generated scrapers now include PEP 723 metadata for
uv runsupport and a fallbackvenvsetup. - Task Scheduler Integration:
setup_task.batand PowerShell script to register daily scraper tasks on Windows. - PTT Gossiping Support: Updated default example to handle landing pages like PTT.
- Filename Parsing: Frontend now correctly extracts dynamic filenames from headers.
Changed
- UI Overhaul: Migrated to a professional dark theme.
- README: Rewritten to focus on engineering workflow and architecture.
- Sandbox: Enhanced import security whitelist (e.g.,
urllib.parsesubmodules).
Fixed
- Batch Script Syntax: Fixed
run.batcrashing due to parenthesis parsing inechocommands. - Download Filename: Fixed issue where downloads had UUID filenames without extensions.
SmartScraper v1.0.0 🚀
🎉 SmartScraper v1.0.0 - AI-Powered Web Scraper Generator
SmartScraper 是一款結合 Playwright 與 Azure OpenAI 的智能爬蟲生成工具,能夠自動分析網頁結構並生成可執行的 Python 爬蟲程式碼。
✨ New Features
- AI Auto-Fix (Self-Healing): 當爬取失敗或回傳空值時,AI 能自動診斷並修復程式碼。
- Sandbox Execution: 內建安全沙箱,支援
equests, �s4 與白名單模組執行。 - Visual Analysis: 使用 GPT-5.2-chat 進行視覺化網頁結構分析。
- Interactive UI: 步驟指示器、程式碼編輯器、即時執行結果顯示。
🛡️ Safety & Disclaimer
- 本工具僅供技術研究與學術交流使用。
- 使用者需自行遵守目標網站之
obots.txt 與服務條款。 - 內建 LICENSE (MIT)。
Full Changelog: https://github.com/breezy89757/SmartScraper/commits/v1.0.0