Skip to content

OLX issue #5

@mati4458

Description

@mati4458

I am unable to compile olx1.py, even though bs4, lxml and html5lib are installed. Otomoto scraping works well, but OLX isn't scraping at all.

~/otomoto_olx-scraper/olx1 $ pip3 install lxml
Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple
Requirement already satisfied: lxml in /home/pi/.local/lib/python3.9/site-packages (4.9.1)

~/otomoto_olx-scraper/olx1 $ python3 olx1.py
Starting...
Previous run: 221101-155614
This run: 221101-155812
Folder created: 221101-155812
Page URL: https://is.gd/OhXxPQ
First run - no file exists.
How many pages are there to crawl? 1
Page number: 1 / 1
Waiting for 2 seconds before opening URL...
<●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●> 2/2 [100%] in 2.3s (0.88/s)
Opening page...
Scraping page...
Traceback (most recent call last):
File "/home/pi/otomoto_olx-scraper/olx1/olx1.py", line 165, in
pullData(full_page_url) # throw URL to function
File "/home/pi/otomoto_olx-scraper/olx1/olx1.py", line 118, in pullData
soup = BeautifulSoup(page, features="lxml") # get URL into BS # *NOTE: v: olx
File "/home/pi/.local/lib/python3.9/site-packages/bs4/init.py", line 248, in init
raise FeatureNotFound(
bs4.FeatureNotFound: Couldn't find a tree builder with the features you requested: lxml. Do you need to install a parser library?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions