Skip to content

Latest commit

 

History

History
25 lines (19 loc) · 637 Bytes

File metadata and controls

25 lines (19 loc) · 637 Bytes

ASPHALT

Asphalt is a web scraper for aggregating articles from a blog or news website and use it for context in various AI.

Articles Scraper For AI Training

Created by: Niccolò Agnoletti License: GPL v2.0 or greater Language: Python 3 Version: 0.1a System: MacOS

GUIDE

For a specific website, edit the file selectors.json with the corect html structure. Blog is for the blog home page where the scraper will gather links, Article is for the single article structure where the scraper will gather the content.

Start Project

python3 -m venv venv
source venv/bin/activate

Close project

deactivate