Skip to content

Latest commit

 

History

History
38 lines (24 loc) · 1.12 KB

File metadata and controls

38 lines (24 loc) · 1.12 KB

Dataset Retriever Program

This repository contains Ironhack project corresponding to module 1 - Building a Pipeline.

The main function of the program is to retrieve a clean and wrangled dataset fulfilling the requirements of Challenge 1. Also retrieves two different tables with results of Bonus 1 and Bonus 2 challenges.

Main topics

  • Connection to SQL database
  • SQL Queries
  • Python Programming
  • Handling Pandas library
  • Working with APIs and API Wrappers
  • Web Scraping
  • Building a Pipeline

Installation

Use the package manager conda to install libraries in the environment you are executing the program. Required libraries are shown below:

conda install pandas
conda install -c anaconda sqlalchemy
conda install -c conda-forge tqdm

Usage

Execute the file main.py from terminal or and IDE (i.e. Pycharm).

References