DocFilter

⚠️ DISCLAIMER: This software is provided "AS IS" without warranty of any kind. This is a personal project shared for educational purposes. Issues, pull requests, and support requests will not be processed or responded to. Use at your own risk.

A desktop application for filtering and classifying documents, URLs, and multimedia using local or remote LLMs.

Architecture

DocFilter implements a three-tier architecture within the Electron framework:

┌─────────────────┐    IPC     ┌─────────────────┐    SQL     ┌─────────────────┐
│ Presentation    │ ◄────────► │ Application     │ ◄────────► │ Data Tier       │
│ Tier            │            │ Tier            │            │                 │
│ - React UI      │            │ - Business      │            │ - SQLite DB     │
│ - User Input    │            │   Logic         │            │ - Config        │
│ - Display       │            │ - AI APIs       │            │ - Artifacts     │
│                 │            │ - File Extract  │            │                 │
└─────────────────┘            └─────────────────┘            └─────────────────┘
    Renderer Process              Main Process                   Persistent Storage

Detailed Component Flow

graph LR
    subgraph "Frontend (React)"
        UI[User Interface]
        DZ[DropZone Component]
        IB[Inbox Component]
        DP[DetailPane Component]
        CM[ConfigModal Component]
    end

    subgraph "Electron Main Process"
        MP[Main Process]
        IPC[IPC Handlers]
        DB[SQLite Database]
    end

    subgraph "Content Processing"
        EXT[Content Extractors]
        PDF[PDF Extractor]
        DOCX[DOCX Extractor]
        WEB[Web Scraper]
        YT[YouTube Extractor]
    end

    subgraph "AI Analysis"
        PROC[Processor Service]
        OAI[OpenAI Provider]
        ANT[Anthropic Provider]
        LOCAL[Local LLM Provider]
    end

    subgraph "External"
        FILES[PDF/DOCX Files]
        URLS[Web URLs]
        OPENAI[OpenAI API]
        CLAUDE[Anthropic API]
        OLLAMA[Local Ollama]
    end

    %% User interactions
    FILES --> DZ
    URLS --> DZ
    
    %% Frontend to Main
    DZ --> IPC
    IB --> IPC
    DP --> IPC
    CM --> IPC
    
    %% Main process coordination
    IPC --> PROC
    IPC --> DB
    DB --> IB
    
    %% Content processing flow
    PROC --> EXT
    EXT --> PDF
    EXT --> DOCX
    EXT --> WEB
    EXT --> YT
    
    %% AI analysis flow
    PROC --> OAI
    PROC --> ANT
    PROC --> LOCAL
    
    %% External API calls
    OAI --> OPENAI
    ANT --> CLAUDE
    LOCAL --> OLLAMA
    WEB --> URLS
    
    %% Data flow back
    PROC --> DB
    DB --> DP

    style UI fill:#e1f5fe
    style DB fill:#f3e5f5
    style PROC fill:#e8f5e8
    style EXT fill:#fff3e0
    style OAI fill:#ffebee
    style ANT fill:#ffebee
    style LOCAL fill:#ffebee

Features

Multi-format Support: PDF, DOCX, TXT, URLs, YouTube videos
Browser Integration: Send URLs directly from any browser with bookmarklet
Smart Token Management: Configurable limits with intelligent content truncation
AI Analysis: OpenAI, Anthropic, and local LLM support with visual status indicators
Large Document Handling: Process massive PDFs while preserving full content
Local Storage: All data stored locally (SQLite)
Drag & Drop: Easy file ingestion
Configurable: Customizable system prompts, providers, and token limits

Getting Started

Quick Start (Windows - Recommended)

Install Dependencies:
```
npm install
```
Build the Application:
```
npm run build
```
Run the App:
```
npx electron dist/main/src/main/main.js
```

Development Setup (WSL2/Linux)

Note: Requires GUI libraries for Electron display.

Install Dependencies:

npm install
sudo apt install -y libnss3 libnspr4 libatk-bridge2.0-0 libdrm2 libxcomposite1 libxdamage1 libxrandr2 libgbm1 libxss1 libasound2

Build and Run:
```
npm run build
npm start
```

Configuration

Before using the app, configure your LLM providers:

Click the "Config" button in the top-right
Set your system prompt (instructions for the AI)
Configure Token Limit: Set max tokens based on your model:
- GPT-3.5: ~16,000 tokens
- GPT-4: ~128,000 tokens
- Claude: ~200,000 tokens
- Local LLMs: Varies by model
Configure at least one provider:
- OpenAI: Add your API key (use gpt-4o for large documents)
- Anthropic: Add your API key (use claude-3-haiku-20240307 or newer)
- Local LLM: Set endpoint (e.g., http://localhost:11434/api/generate for Ollama)

Browser Integration

Send URLs directly from your browser to DocFilter:

Setup Bookmarklet

Copy this JavaScript code:

javascript:(function(){window.open('docfilter://process?url=' + encodeURIComponent(window.location.href));})();

Add to your browser:
- Create a new bookmark
- Paste the code as the bookmark URL/location
- Name it "Send to DocFilter"

Usage

Navigate to any webpage (great for arXiv papers!)
Click your "Send to DocFilter" bookmark
Browser prompts to open DocFilter (allow and remember choice)
DocFilter automatically processes the page

What It Does

PDF URLs: Downloads and analyzes PDF files directly
Web Pages: Extracts main content from regular websites
URL Cleaning: Removes tracking parameters automatically
Single Window: Uses existing DocFilter window if already open

Token Management

DocFilter intelligently handles large content:

How It Works

Token Estimation: Estimates content size (~4 characters per token)
Smart Truncation: If content exceeds your token limit, it's truncated for AI analysis
Full Preservation: Complete extracted content is always saved regardless of truncation
Visual Indicators: Clear badges show when content was truncated

Status Indicators

No badge: AI analyzed the full content
✂️ Truncated badge: AI analyzed partial content (first ~80% of token limit)
❌ Error badge: Processing failed (content still preserved)

Upgrading Models

Increase token limit in config for better models
Reprocess existing items to analyze more content
Truncation badges update based on new limits

Usage

Add Content:
- Drag files into the drop zone
- Click the drop zone to browse and select files
- Enter URLs in the URL input field
- Use browser bookmarklet for one-click URL sending
AI Analysis: The app extracts content and gets AI recommendations ("Read" or "Discard") with summary and reasoning
Review Results:
- Browse the inbox with creation timestamps
- Click items to view full details in the right pane
- See AI-generated summary, detailed reasoning, extracted content, and provider used
Manage Items:
- Filter by All/Read/Discard in the inbox
- Delete items with the × button
- Reprocess items with different settings using the 🔄 button

Supported Formats

Documents: PDF, DOCX, TXT, Markdown
Web: Any URL (extracts main content)
YouTube: Video URLs (extracts title/description)

Local LLM Setup

For Ollama:

# Install Ollama
curl -fsSL https://ollama.ai/install.sh | sh

# Pull a model
ollama pull llama2

# Start server (usually runs on localhost:11434)
ollama serve

Then configure the endpoint as http://localhost:11434/api/generate in the app.

Building for Production

Development Build

npm run build
npx electron dist/main/src/main/main.js

Package as Executable

Prerequisites

npm install --save-dev electron-builder

Create Windows Installer (.exe)

npm run dist:win

Creates an NSIS installer in the release/ folder.

Package for All Platforms

npm run dist

Package Without Installer (Portable)

npm run pack

Creates a portable folder with the executable.

Note: Add application icons to assets/icon.ico (Windows), assets/icon.icns (Mac), and assets/icon.png (Linux) for proper branding.

Data Storage

The app stores all data locally in your OS user data directory:

Windows: %APPDATA%/reading_agent/
macOS: ~/Library/Application Support/reading_agent/
Linux: ~/.local/share/reading_agent/

Database includes:

Processed artifacts with extracted content
AI recommendations and reasoning
Configuration settings and provider credentials
Processing timestamps (local time)

Troubleshooting

Large Document Issues

Token Limit Too Low: Increase max tokens in config for your model
Still Getting Truncated: Large documents may exceed even high token limits - this is normal
Context Length Errors: Your max token setting is higher than your model supports
Reprocessing: Use reprocess button after increasing token limits to analyze more content

Browser Integration Issues

"No application found": Make sure DocFilter has been run at least once to register protocol
Browser not prompting: Check popup blockers or manually allow popups for the site
Wrong URL sent: Some sites use complex URLs - the bookmarklet sends the current page URL

Token Management

Content Truncated: Look for ✂️ badge - full content is always preserved below
Want Full Analysis: Increase token limit and reprocess, or use a more powerful model
Error but Content Saved: Check reasoning section for specific error details

File Processing Issues

Supported: PDF, DOCX, TXT, MD files
Drag-drop and file picker both supported
Check console for extraction errors

WSL2 Display Issues

# Install required libraries
sudo apt install -y libnss3 libnspr4 libatk-bridge2.0-0 libdrm2 libxcomposite1 libxdamage1 libxrandr2 libgbm1 libxss1 libasound2

# Or run on Windows instead (recommended)

Roadmap

See ROADMAP.md for planned features and enhancements including dark theme support, additional file formats, browser extensions, and advanced AI capabilities.

Author

Carlos - BlockSecCA

Name		Name	Last commit message	Last commit date
Latest commit History 92 Commits
.githooks		.githooks
assets		assets
src		src
tests		tests
.gitignore		.gitignore
BROWSER_INTEGRATION.md		BROWSER_INTEGRATION.md
CHANGELOG.md		CHANGELOG.md
CLAUDE_CODE.md		CLAUDE_CODE.md
HELP.md		HELP.md
LICENSE		LICENSE
README.md		README.md
ROADMAP.md		ROADMAP.md
SPECIFICATION.md		SPECIFICATION.md
jest.config.js		jest.config.js
package-lock.json		package-lock.json
package.json		package.json
run-windows.md		run-windows.md
tsconfig.json		tsconfig.json
tsconfig.node.json		tsconfig.node.json
tsconfig.node.tsbuildinfo		tsconfig.node.tsbuildinfo
vite.config.d.ts		vite.config.d.ts
vite.config.js		vite.config.js
vite.config.ts		vite.config.ts

Folders and files

Latest commit

History

Repository files navigation

DocFilter

Architecture

Detailed Component Flow

Features

Getting Started

Quick Start (Windows - Recommended)

Development Setup (WSL2/Linux)

Configuration

Browser Integration

Setup Bookmarklet

Usage

What It Does

Token Management

How It Works

Status Indicators

Upgrading Models

Usage

Supported Formats

Local LLM Setup

Building for Production

Development Build

Package as Executable

Prerequisites

Create Windows Installer (.exe)

Package for All Platforms

Package Without Installer (Portable)

Data Storage

Troubleshooting

Large Document Issues

Browser Integration Issues

Token Management

File Processing Issues

WSL2 Display Issues

Roadmap

Author

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages