Skip to content

Releases: breezy89757/SmartScraper

v1.1.0: Portable SmartScraper

04 Jan 07:46

Choose a tag to compare

Added

  • Portable Package Export: Download scrapers as ZIP files (/download endpoint).
  • Auto-Dependency Management: Generated scrapers now include PEP 723 metadata for uv run support and a fallback venv setup.
  • Task Scheduler Integration: setup_task.bat and PowerShell script to register daily scraper tasks on Windows.
  • PTT Gossiping Support: Updated default example to handle landing pages like PTT.
  • Filename Parsing: Frontend now correctly extracts dynamic filenames from headers.

Changed

  • UI Overhaul: Migrated to a professional dark theme.
  • README: Rewritten to focus on engineering workflow and architecture.
  • Sandbox: Enhanced import security whitelist (e.g., urllib.parse submodules).

Fixed

  • Batch Script Syntax: Fixed run.bat crashing due to parenthesis parsing in echo commands.
  • Download Filename: Fixed issue where downloads had UUID filenames without extensions.

SmartScraper v1.0.0 🚀

03 Jan 11:38

Choose a tag to compare

🎉 SmartScraper v1.0.0 - AI-Powered Web Scraper Generator

SmartScraper 是一款結合 Playwright 與 Azure OpenAI 的智能爬蟲生成工具,能夠自動分析網頁結構並生成可執行的 Python 爬蟲程式碼。

✨ New Features

  • AI Auto-Fix (Self-Healing): 當爬取失敗或回傳空值時,AI 能自動診斷並修復程式碼。
  • Sandbox Execution: 內建安全沙箱,支援
    equests, �s4 與白名單模組執行。
  • Visual Analysis: 使用 GPT-5.2-chat 進行視覺化網頁結構分析。
  • Interactive UI: 步驟指示器、程式碼編輯器、即時執行結果顯示。

🛡️ Safety & Disclaimer

  • 本工具僅供技術研究與學術交流使用。
  • 使用者需自行遵守目標網站之
    obots.txt 與服務條款。
  • 內建 LICENSE (MIT)。

Full Changelog: https://github.com/breezy89757/SmartScraper/commits/v1.0.0