Skip to content

blazardsky/asphalt

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ASPHALT

Asphalt is a web scraper for aggregating articles from a blog or news website and use it for context in various AI.

Articles Scraper For AI Training

Created by: Niccolò Agnoletti License: GPL v2.0 or greater Language: Python 3 Version: 0.1a System: MacOS

GUIDE

For a specific website, edit the file selectors.json with the corect html structure. Blog is for the blog home page where the scraper will gather links, Article is for the single article structure where the scraper will gather the content.

Start Project

python3 -m venv venv
source venv/bin/activate

Close project

deactivate

About

web scraper to extract data from articles that can be feed to AIs

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages