otto next page extraction does not work as expected for some pages

I just noticed in the otto log files, that the next page extraction is not working as expected for some pages and the following error shows up: 

```
2023-01-21 04:38:57 [scrapy.core.scraper] ERROR: Spider error processing <GET https://www.otto.de/schuhe/halbschuhe/?nachhaltigkeit=alle-nachhaltigen-artikel&zielgruppe=herren&l=gq&o=120 via http://splash:8050/execute> (referer: None)
[...]
  File "/tmp/scraping-1674237617-grym39ye.egg/scraping/spiders/otto_de.py", line 65, in parse_SERP
    if int(pagination_info["o"]) > response.meta.get("o", 0):
ValueError: invalid literal for int() with base 10: ''
```

This Error shows up 18 times in the log file. In #115 we updated the next page extraction in order to scrape products without filtering for sustainable products only. It could be that this change is the cause, but needs some inspection.





Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

otto next page extraction does not work as expected for some pages #120

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

otto next page extraction does not work as expected for some pages #120

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions