Web Scraper for Coffee Dataset

Web scraper implemented in Python for collecting data on coffee.

Files:

coffeescraper.py and coffeescraper2.py - files with Python code to scrape data from chosen retailers;
data_cleaning.py - file with Python code to clean the scraped data, including missing values, outliers, data formatting etc.;
data - folder with datasets in json format;
data/c1 - folder with pictures scraped in process but not used in further processing;

Project description

Present web scraper collects data from online coffee retailers such as price, weight, grind, roast etc. in order to gather a data set for statistical modelling of data related to coffee brewing methods and pricing strategy. Project developed as part of the AiCore fellowship.

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
data		data
.gitignore		.gitignore
README.md		README.md
brewed-scraper.py		brewed-scraper.py
coffee-desk-scraper.py		coffee-desk-scraper.py
data-cleaning-all.ipynb		data-cleaning-all.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Web Scraper for Coffee Dataset

Files:

Project description

About

Uh oh!

Releases

Packages

Languages

dominikacecylia/Web-Scraping-Project

Folders and files

Latest commit

History

Repository files navigation

Web Scraper for Coffee Dataset

Files:

Project description

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages