Skip to content

Releases: BlockSecCA/docfilter

DocFilter v1.7.1 - Critical PDF Processing Fix

22 Jun 01:35

Choose a tag to compare

This release resolves critical PDF processing issues that were causing crashes and incorrect behavior in browser integration.

🚨 Critical Fixes

  • Fixed browser integration crashes - Eliminated stack overflow and performance degradation when processing PDFs via bookmarklet
  • Fixed arXiv PDF processing - URLs like https://arxiv.org/pdf/1234567 now work correctly (previously failed due to missing .pdf extension)
  • Fixed false "token limit exceeded" errors - Small PDFs that work fine via file upload now also work via browser integration
  • Fixed raw binary content saving - PDFs now properly extract text instead of saving garbled binary data

✅ All PDF Scenarios Now Working

  • Small PDF upload: ✅ Processes fully
  • Small PDF web/arXiv: ✅ Processes fully
  • Large PDF upload: ✅ Truncated with content preserved
  • Large PDF web: ✅ Truncated with content preserved

🛠️ Technical Improvements

  • Enhanced PDF URL detection for arXiv, query parameters, and fragments
  • Added comprehensive debug logging visible in DevTools console
  • Improved buffer content detection regardless of filename extension
  • Better error handling with helpful troubleshooting guidance

📋 Upgrade Notes

  • No breaking changes - existing functionality preserved
  • Database schema unchanged - no migration required
  • All existing configurations and data remain intact

Recommended for all users - especially those using browser integration with PDF content.