Skip to content

Latest commit

 

History

History
86 lines (57 loc) · 2.32 KB

File metadata and controls

86 lines (57 loc) · 2.32 KB

🕸️ Web Scraping Scripts

Welcome to the Web Scraping Scripts repository under the HashSlap Summer of Code (HSSoC) initiative!

This repository houses modular and powerful web scraping scripts to extract useful data from the web — whether you're automating data collection, learning web scraping, or building utilities, this is the perfect playground.


📦 What's in This Repo?

Here you'll find Python scraping scripts for:

  • 📰 News websites
  • 📦 E-commerce price trackers
  • 🧑‍🎓 GitHub profile scrapers
  • 📊 Stock market data grabbers
  • 🎥 YouTube metadata collectors

Built using popular tools like requests, BeautifulSoup, and Selenium.


🧠 Goals of This Project

  • Provide real-world scraping examples
  • Help beginners learn how to parse HTML and interact with websites
  • Build a collaborative collection of reliable scrapers
  • Support modular, reusable script architecture

🗂 Folder Structure

.
├── ecommerce/
│   └── amazon_price_tracker.py
├── news/
│   └── bbc_headlines.py
├── social/
│   └── github_stats_scraper.py
├── video/
│   └── youtube_video_data.py
└── README.md

🤝 Contributing Guidelines

  • Use Python 3.x
  • Organize scripts by category
  • Keep your code clean and documented
  • Output must be in JSON, CSV, or pretty printed format

📌 You must only scrape public content & respect robots.txt.


🚀 Getting Started

git clone https://github.com/your-username/scraping-scripts.git
cd scraping-scripts
python3 your-script.py

📜 License

This project is licensed under the MIT License.