Skip to content

Scraped 1000 books from "Books to Scrape", cleaned the data, stored it in MySQL, analyzed with SQL, and visualized insights using Python.

Notifications You must be signed in to change notification settings

Ddiyaaa/Web_Scraping

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

6 Commits
Β 
Β 
Β 
Β 

Repository files navigation

πŸ“š Books to Scrape: Data Analysis Project

πŸ“Œ Description

This project involves web scraping data from Books to Scrape β€” an online mock bookstore. The data was cleaned, stored in a MySQL database, analyzed using SQL, and visualized using Python libraries to uncover insights related to book pricing, ratings, availability, and categories.


🧰 Tech Stack

Python BeautifulSoup MySQL Matplotlib Seaborn Jupyter Notebook SQL


✨ Features

Scrapes 1000 books from 50 pages using BeautifulSoup

Extracts title, price, rating, availability

Cleans and stores data in MySQL using mysql.connector

Performs SQL queries for business insights

Visualizes:

Price distribution

Rating frequency

Book availability

Category-based trends

πŸ“ˆ Key Findings

Majority of books are priced under Β£30

Most books hold a 3-star rating

Fiction and Nonfiction dominate the dataset

Availability is generally balanced

High-priced books are rare and often in limited stock

About

Scraped 1000 books from "Books to Scrape", cleaned the data, stored it in MySQL, analyzed with SQL, and visualized insights using Python.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published