Skip to content

stzheng716/newbee_scraper

Repository files navigation

newbee_scraper

Taking the Sting Out of the Job Hunt


Running the project locally

  1. clone the repo
  2. navigate to the project root
  3. create your venv
python -m venv venv
  1. activate and enter your venv
source venv/bin/activate
  1. install dependencies
pip install -r requirements.txt
  1. create your .env file
SECRET_KEY=[YOUR SECRET KEY HERE] or ask someone on the team for the secret key
DATABASE_URL=postgresql:///job_crawler
TEST_DATABASE_URL=postgresql:///job_crawler
DATABASE_NAME=job_crawler
OPEN_AI_API_KEY=[ROBOT API KEY]
  1. Create your local job_crawler database: If you already have a job_crawler DB and want to use the backup data:
    1. drop table, create table, import backup_database.sql:

      1. dropdb job_crawler
      2. createdb job_crawler
      3. psql -d job_crawler -f data/backup_database.sql

      or start from scratch

    2. create the tables and run the scraper (this will take a while):

      1. createdb job_crawler
      2. psql -d job_crawler -f data/migrate.sql
      3. python full_scrape.py

Project Overview:

Who's it for? - Bootcamp grads & junior devs What are we looking for? - Job descriptions that fit our needs (bootcamp grads, no degree) Where are we looking for it? - (for now) https://stillhiring.today/

There will be three scrapers:

  1. Scrape for the URLs

    • Get company "career" URLs
  2. Scrape those job URLs (company websites) for jobs in our field

    • top 5 job boards
      1. jobs.lever.co: 191
      2. boards.greenhouse.io: 113
      3. jobs.ashbyhq.com: 37
      4. jobs.jobvite.com: 8
      5. careers.smartrecruiters.com: 7
  3. Scrape and save the job descriptions

  4. Run the job descriptions through GPT


REMINDER:

Please make sure to run the following commands before you start working on the codebase to ensure you have the most up-to-date packages and code:

git pull
pip install -r requirements.txt

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages