Skip to content

Latest commit

 

History

History
30 lines (12 loc) · 489 Bytes

File metadata and controls

30 lines (12 loc) · 489 Bytes

imdb_scraper

Scrapy project for scraping data from IMDB

Clone the repo and navigate into imdb_scraper folder.

$ git clone https://github.com/samacyc/imdb_scraper

$ cd imdb_scraper/

$ pip install scrapy

$ cd imdb/

You can Change the category that you want scrap and number of pages in imdb/spiders/imdb_spyder.py

Start the crawler.

$ scrapy crawl drama -o filename.json or filename.csv You have both options

Data will be stored in the same directory .

Happy Sraping.