You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
feat(fetchers): enhance ArXivFetcher with PDF binary indication (#89)
## What
Enhance ArXivFetcher with binary content indication for PDF URLs.
## Why
Closes#57 — When agents request /pdf/ URLs, the fetcher should indicate
that the original content is binary (PDF) and only metadata is returned,
consistent with the core binary handling behavior.
## How
- Added `is_pdf_url()` helper to detect /pdf/ vs /abs/ URLs
- Added binary content note in metadata section for PDF URLs
- Added tests for PDF detection, DOI/journal ref extraction
## Risk
- Low
- Only adds informational note to output for PDF URLs
### Checklist
- [x] Unit tests are passed
- [x] Smoke tests are passed
- [x] Specs are up to date and not in conflict
0 commit comments