Skip to content

esoterydactyl/WordyLib

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

WordyLib

Description WordyLib is a dead simple class library that leverages the HTML Agility Pack to scrape web pages and filter items on a blacklist. A large sample blacklist is provided, containing a large variety of filler, names, non-typographic characters, etc. The goal of WordyLib is to provide a simple way to retrieve only "keywords" from a website.

WordyLib should remain simple and flexible, and will likely become the basis for larger crawling and analytics projects. Because of this, all classes and methods in WordyLib are public. This has the side effect of making them useful in Powershell when imported with "Add-Type".

About

Web scraping and keyword finding.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published