Skip to content

Conversation

@sebastian-nagel
Copy link
Contributor

@sebastian-nagel sebastian-nagel commented Dec 12, 2025

  • Upgrade crawler-commons dependency to 1.6
  • Robots.txt parser: use URL objects in newly introduced methods to avoid the unnecessary parsing of URLs.
  • Update URLUtil test to adapt to a change in the public suffix list

Robots.txt parser: use URL objects in newly introduced
methods to avoid the unnecessary parsing of URLs.
Update URLUtil test to adapt to a change in the public suffix list
@sebastian-nagel sebastian-nagel force-pushed the upgrade-crawler-commons-1.6 branch from 3815d13 to 080c2c1 Compare December 12, 2025 14:14
Copy link
Member

@lewismc lewismc left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

+1 good addition of tests. Very specific.

@sebastian-nagel sebastian-nagel merged commit c7cf569 into apache:master Dec 18, 2025
6 checks passed
@sebastian-nagel sebastian-nagel deleted the upgrade-crawler-commons-1.6 branch December 18, 2025 08:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants