Skip to content

RuhamLeal/tech-news-scraper

Repository files navigation

Tech News Scraper 💻

Summary
  1. About the Project
  2. Technologies
  3. Features
  4. How to Execute the Project
  5. Contact

About the Project

Tech News is a Python-based data scraping project that uses Beautiful Soup to perform web scraping on Trybe's blog and fetch technology news and articles, storing them in a non-sequential MongoDB database.


Technologies

✨ Show Tecnologies
  • PYTHON 3
  • MONGODB
  • PYMONGO
  • PYTEST
  • BEAUTIFUL SOUP
  • FLAKE

Features

  • Collect news and articles from Trybe's blog using web scraping and store them in a MongoDB database
  • Search news by title
  • Search news by date
  • Browse news by category
  • List the top 5 categories with the most news

How to Execute the Project

To run the project locally, follow the steps below.

  1. Check that your machine has the minimum configuration for the project to run;
  • Python 3;
  • Docker;
  • Docker-compose version equal to or greater than 1.29.2.
  1. Clone the repository;
git clone https://github.com/RuhamLeal/tech_news_scraper.git
  1. Navigate to the root of the project;
cd ./tech_news_scraper
  1. Create and activate the virtual environment.
python3 -m venv .venv

source .venv/bin/activate
  1. Install dependencies in the virtual environment.
python3 -m pip install -r dev-requirements.txt
  1. If you don't have MongoDB installed locally, launch it via Docker.
docker-compose up -d mongodb

7.Run the following command to access the menu.

tech-news-analyzer

Contact

Ruham Leal
Email: ruhamxlpro@hotmail.com
Linkedin

Voltar ao topo

About

A Python project to scrape data from trybe's blog that catches technology news and articles

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors