Bartleby, the Scrivener

An AI-powered tool for processing document corpora and researching them with an agentic assistant.

Background

I have found it useful to let an AI agent run wild in a SQLite database containing the extracted text from a bunch of documents. I've explored giving that agent various tools to explore the database more effectively, including full-text and semantic searching. This provides a toolkit and agent to research and generate reports based on caches of PDF documents.

bartleby read handles the parsing side: OCR-ing and parsing PDFs (and converting HTML files) into a SQLite database, then paginating, summarizing, chunking, and embedding. This is valuable on its own regardless of your desire to sift through documents with an AI agent, as it enables all sorts of deeper explorations of large corpora.

bartleby write is the research agent: an interactive Q&A loop where you ask questions about your corpus and the agent searches, reads, and synthesizes answers with citations. It works well with paid models like gpt-5-nano and gpt-5-mini, and also with open-weights models like gpt-oss:20b, qwen3:8b, and qwen3:30b via Ollama.

A couple things to be aware of:

Token costs can add up, especially during document summarization in read and during research sessions in write. You have knobs for this (e.g., how many pages to summarize per PDF, which can be zero). The costs the tool shows are estimates.
I'm using the excellent (but pre-v0) sqlite-vec plugin for SQLite. There might be some instability there.

Installation

Prerequisites

Install system dependencies:

brew install tesseract
brew install uv

Install Bartleby

From the project directory:

uv tool install .

This installs bartleby as a command-line tool in an isolated environment.

For development:

uv tool install --editable .

Install Playwright browsers (optional, for HTML support)

If you want to process HTML files, install the Chromium browser for Playwright:

uv run playwright install chromium

This only needs to be done once. Skip this if you only process PDFs.

Quick start

1. Configure

Run the setup wizard to choose your LLM provider, model, and other settings:

bartleby ready

This walks you through configuring worker threads, LLM provider/model, API keys, summarization depth, and temperature. Settings are saved to ~/.bartleby/config.yaml.

2. Create a project

bartleby project create my-research

This creates a project directory and sets it as your active project. All subsequent commands use the active project by default.

3. Process documents

bartleby read --files /path/to/your/pdfs

Point this at a directory of PDFs (or HTML files) and Bartleby will extract text, generate embeddings, and optionally create LLM-powered summaries. Everything goes into a SQLite database in your project.

4. Ask questions

bartleby write

This starts an interactive research session. Ask questions about your corpus and the agent will search, read, and synthesize answers with source citations:

>: What does this corpus have to say about PM2.5 and equity?
  ✓ Listed documents (3 documents) ................... 0.2s
  ✓ Read summary (WANG-ET-AL_2024.pdf) ............... 0.3s
  ✓ Searched text (2 results) ........................ 3.1s
  ✓ Read passage (7 chunks) .......................... 6.0s
⠇ Thinking...

[Markdown-formatted answer with citations]

↑23.6k/↓5.4k/+29.0k (~$0.00)

Type /save to save the last answer as a timestamped report. Press Ctrl+C to exit.

Command reference

`bartleby ready`

Interactive configuration wizard. Asks for:

Setting	Default	Description
Worker threads	4	Parallel processing threads for `read`
LLM provider	anthropic	`anthropic`, `openai`, or `ollama`
Model	varies by provider	Model name (e.g., `claude-3-5-sonnet-20241022`)
API key	—	Required for Anthropic/OpenAI; can also use env vars
Pages to summarize	10	Per-PDF page limit for summarization (0 = skip)
Temperature	0	0 = deterministic, 1 = creative

API keys can be provided in the config or via environment variables: ANTHROPIC_API_KEY, OPENAI_API_KEY. For Ollama, configure the server URL (default http://localhost:11434) or set OLLAMA_API_BASE.

Config is saved to ~/.bartleby/config.yaml.

`bartleby project`

Manage project workspaces. Each project gets its own database, document archive, and output directory.

bartleby project create <name>    # Create and activate a new project
bartleby project list             # List all projects
bartleby project use <name>       # Switch active project
bartleby project info [name]      # Show project details (defaults to active)
bartleby project delete <name>    # Delete a project and its data (-y to skip prompt)

Project directory structure:

~/.bartleby/projects/<name>/
├── bartleby.db       # SQLite database (text, embeddings, summaries)
├── archive/          # Original PDF files (deduplicated by content hash)
└── book/             # Output artifacts
    ├── findings/     # Auto-saved Q&A results and research notes
    ├── report-*.md   # Saved reports (via /save)
    └── log.json      # Session log with tool calls and token usage

`bartleby read`

Process PDF and HTML documents into the project database.

bartleby read --files <path> [options]

Option	Description
`--files <path>`	Path to a file or directory of PDFs/HTML (required)
`--project <name>`	Target project (defaults to active)
`--max-workers <n>`	Worker threads (default: from config)
`--model <name>`	Override LLM model for summarization
`--provider <name>`	Override LLM provider (`anthropic` or `openai`)
`--docling`	Use Docling for layout-aware processing (see below)
`--verbose`	Show debug output

Processing pipeline (default):

Converts HTML to PDF (if applicable) via Playwright/Chromium
Extracts text from PDFs using PyMuPDF
Falls back to OCR (Tesseract) for image-based pages
Chunks text into segments (~400 characters with overlap)
Generates vector embeddings (BAAI/bge-base-en-v1.5)
Creates LLM-powered summaries for the first N pages (if configured)
Stores everything in SQLite with full-text search (FTS5) and vector search (sqlite-vec)

Processing pipeline (--docling):

The --docling flag swaps in IBM's Docling library for layout-aware document understanding. Instead of treating all text equally, Docling detects headings, tables, code blocks, formulas, and reading order using ML models, then chunks along structural boundaries.

Converts documents with Docling's DocumentConverter (ML-based layout analysis)
Chunks using Docling's HybridChunker (respects document structure)
Preserves heading hierarchy (section_heading) and content type (content_type: text/table/code/formula/list/picture) on each chunk
Embeds chunks with heading context prepended for better semantic search
Generates summaries from Docling's structured markdown export

Documents are processed sequentially (Docling loads heavy ML models that shouldn't be duplicated across processes). Search results include section_heading and content_type fields when available.

Supported file types: .pdf, .html, .htm

`bartleby write`

Interactive research agent for investigating your document corpus.

bartleby write [options]

Option	Description
`--project <name>`	Target project (defaults to active)
`--verbose`	Show debug output and full tracebacks

In-session commands:

Command	Description
`/save`	Save the last answer as `book/report-YYYYMMDDHHmm.md`
`Ctrl+C`	Exit the session

The agent has access to search tools (keyword and semantic), document reading tools, summarization, and note-taking. Each question-answer pair is auto-saved to book/findings/ for continuity across the session. Token usage and estimated costs are displayed after each answer.

`bartleby book`

View research activity and findings from your project. Each session gets a memorable name (e.g., "mighty-grove", "sharp-oak") derived from its ID.

bartleby book [subcommand] [options]

Subcommands:

Subcommand	Description
(none)	Overview: session count, notes, reports, total tokens
`sessions`	List all research sessions with stats
`notes [session]`	Show research notes (filter by session name)
`logs [--session <name>]`	Show tool usage and token breakdown

Option	Description
`--project <name>`	Target project (defaults to active)
`--full`	(notes only) Show full note content instead of titles

Example output:

$ bartleby book sessions
                      Sessions
┏━━━━━━━━━━━━━━┳━━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━┓
┃ Session      ┃ Time    ┃ Tools ┃ Notes ┃  Tokens ┃
┡━━━━━━━━━━━━━━╇━━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━┩
│ mighty-grove │ 18m ago │    10 │     3 │ 2196.4k │
│ sharp-oak    │ 3h ago  │    23 │     5 │ 6877.0k │
│ tawny-oak    │ 3h ago  │    10 │     2 │  336.2k │
└──────────────┴─────────┴───────┴───────┴─────────┘

Use bartleby book logs --session <name> to see a detailed timeline of tool calls for debugging or understanding agent behavior.

Supported LLM providers

Provider	Default model	Vision support	Notes
Anthropic	`claude-3-5-sonnet-20241022`	Claude 3+ models	Requires API key
OpenAI	`gpt-4-turbo`	GPT-4 vision models	Requires API key
Ollama	`llama3.2`	No	Requires local server

Vision-capable models can use page images during summarization for better results. Non-vision models fall back to text-only.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
bartleby		bartleby
.gitignore		.gitignore
.python-version		.python-version
LICENSE.txt		LICENSE.txt
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bartleby, the Scrivener

Background

Installation

Prerequisites

Install Bartleby

Install Playwright browsers (optional, for HTML support)

Quick start

1. Configure

2. Create a project

3. Process documents

4. Ask questions

Command reference

`bartleby ready`

`bartleby project`

`bartleby read`

`bartleby write`

`bartleby book`

Supported LLM providers

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Bartleby, the Scrivener

Background

Installation

Prerequisites

Install Bartleby

Install Playwright browsers (optional, for HTML support)

Quick start

1. Configure

2. Create a project

3. Process documents

4. Ask questions

Command reference

bartleby ready

bartleby project

bartleby read

bartleby write

bartleby book

Supported LLM providers

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`bartleby ready`

`bartleby project`

`bartleby read`

`bartleby write`

`bartleby book`

Packages