Amazon Price Web Scrap

Collect Historical Price Data From Amazon

Project Background

Pipeline:

DAG:

Collect historical price data for selected products in order to use it for analytical purposes. Save price data on S3 and local file system.

Objective: show to a client the variation of price about one or more product.

There will have data analysis with the data collected coming soon. Straight from S3, the data will be analyzed with Pandas and Matplotlib.

Install & Setup

git clone https://github.com/ccallazans/price-web-scraping.git
cd price-web-scraping

Edit docker-compose file on /airflow/docker-compose.yaml with credentials

AWS_ACCESS_KEY_ID: ''
AWS_SECRET_ACCESS_KEY: ''
AWS_DEFAULT_REGION: ''
AWS_S3_BUCKET_NAME: ''

On ~/price-web-scraping/ folder run start.sh to create and run the containers.

bash start.sh

Wait until the containers load and access Airflow webserver on:

localhost:8080

Enter the following credentials:

username: airflow
password: airflow

On DAGs page, use the following dag: "amazon_web_scrap"

Usage

It can be used to collect the data from a list of links. Use the "src/products.json" file to edit these links.
Add another line on the file the product link. Ex:

Authors

Ciro Callazans | @ccallazans
Leonardo José C. Pedreira Gama | @Leonardopedreira

License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
airflow		airflow
assets		assets
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
start.sh		start.sh
stop.sh		stop.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Amazon Price Web Scrap

Table of Contents

Project Background

Install & Setup

Usage

Authors

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Amazon Price Web Scrap

Table of Contents

Project Background

Install & Setup

Usage

Authors

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages