Skip to content

zaidongy/Spotify-Scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Spotify Logo Spotify Web Scraper

Spotify Web Scraper crawls through the Spotify web interface and extracts artist logo information.

Technology Stack

Dependencies

Install Dependencies with pip

pip install beautifulsoup4
pip install pymongo
pip install selenium

Alternatively you can install the requirement.txt

pip install -r requirement.txt

Configuration Setup

The scraper will look for config.py file at the root with the following parameters to connect to your database

# config.py

DATABASE_CONFIG = {
    'host': 'mongodb://{0}:{1}@MONGODB_HOST/DB_COLLECTION_NAME',
    'dbuser': '', # Database user
    'dbuserpassword': '', # Database password
}

Selenium Chrome Webdriver

Since Spotify webpages are generated dynamically, we need a headless browser to generate the webpages dynamically from the source. I have chosen the Chrome WebDriver which works great in this instance.

Download and save the Chrome WebDriver at the root level.

Author

Chris Yang

About

Tool to scrape the Spotify web interface to retrieve artist logos

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages