Webscraping-SQL-EDA | Masai School Build-Week Project

Role: Team Member (1 of 3)
Focus Areas: Web Scraping · SQL Insights · Exploratory Data Analysis (EDA)

Project Overview

This build-week project (Masai School) focused on scraping book data from the web, analyzing it using SQL, and deriving insights through Exploratory Data Analysis (EDA).

Tech Stack

Python (web scraping, data cleaning, EDA via Jupyter Notebook)
SQL (data storage and querying: .sql scripts)
CSV (intermediate storage)
Jupyter Notebook (.ipynb)
Optional: PowerPoint (.pptx) for presentations

Project Workflow

Web Scraping: Used Python to extract book details (title, price, ratings, etc.) into book_data.csv.
Data Cleaning: Handled missing values, duplicates, and formatting errors.
SQL Analysis: Used .sql scripts (BooksData_Insights.sql) to derive insights (e.g., availability, expensive books, etc.).
EDA & Visualization: Conducted in notebook (EDA.ipynb) — included charts like histograms, boxplot, and summary tables.
Presentation: Summarized findings in a PowerPoint (Web-Scraping-SQL-Insights-and-EDA.pptx).

Key Insights

Price Distribution: Majority of books are priced under £30.
Ratings: Most books have 1–3 star ratings; 4–5 star books are higher priced.
Availability: Most books are fully available.
Price vs Rating: Higher-rated books (4–5 stars) generally have higher prices, indicating correlation between quality and price.

Team Members

Name	Role
Tanmay Manna	Team Lead — Web Scraping
Diya Shah	SQL Insights
Prince Raj Gupta	Exploratory Data Analysis (EDA)

How to Run Locally

# Clone repo
git clone https://github.com/princerg/webscraping-sql-eda.git
cd webscraping-sql-eda

# Install dependencies
pip install -r requirements.txt

# Run Jupyter Notebook
jupyter notebook notebooks/EDA.ipynb

# Execute SQL insights
# Open BooksData_Insights.sql in MySQL Workbench and run queries

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Webscraping-SQL-EDA | Masai School Build-Week Project

Project Overview

Tech Stack

Project Workflow

Key Insights

Team Members

How to Run Locally

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
data		data
notebooks		notebooks
presentation		presentation
sql		sql
README.md		README.md
requirements.txt		requirements.txt

princerg/webscraping-sql-eda

Folders and files

Latest commit

History

Repository files navigation

Webscraping-SQL-EDA | Masai School Build-Week Project

Project Overview

Tech Stack

Project Workflow

Key Insights

Team Members

How to Run Locally

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages