GitHub Scraper

This is an old project that I created a while back that explores the various methods of scraping a GitHub repository to collect repository versions.
Standard scrape: scraping each version page independently (Not recommended)
GitHub API: A much more efficient way of data collection and is perferred.

PLEASE NOTE

This is currently a side project and for EDUCATIONAL purposes. This project is currently in no way, shape, or form an ongoing project and was developed as an idea.

GitHub-Scraper.jar

GitHub-Scraper.jar is a fun little Windows taskbar tool that I created that puts a friendly GUI interface in front of the user without overwhelming them with a full application.

Simply put, the windows taskbar tool (when executed) will appear at the bottom right.

When the task (at the bottom right) is clicked, it will ask for a GitHub URL (Example, "https://github.com/DSappington/GitHub-Scraper").
Once entered, A file saver will appear.
Save the file to where ever you would like, and we are done.

Check the file and all of the versions will be written to that designated location.

Below is the Code and how it functions:
APICaller: utilizes the GitHub API to make called to a GitHub repo
App: Windows Taskbar executable
Crawler: (OLD) crawls the GitHub repo webpage.
Format: Format class to clean up System calls. (easier for debugging purposes).
VersionSort: Sorts the versions collected before writing to file.
VersionWriter: Write the versions (collected) to a file.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GitHub Scraper

PLEASE NOTE

GitHub-Scraper.jar

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
APICaller		APICaller
App		App
Crawler		Crawler
Format		Format
Sort		Sort
VersionWriter		VersionWriter
.gitignore		.gitignore
README.md		README.md
Scraper.jar		Scraper.jar

DSappington/GitHub-Scraper

Folders and files

Latest commit

History

Repository files navigation

GitHub Scraper

PLEASE NOTE

GitHub-Scraper.jar

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages