Archivist

Organize PDFs with AI.

Personally, I scan documents I'd like to keep to a network drive, where the archivist processes and organizes the PDFs for me.

Installation

Ollama is needed for the archivist's capabilities. Install Ollama somewhere accessible to the archivist.

The archivist is intended to run via Docker:

$ docker run \
  --net=host \
  -v ~/Documents:/archive \
  -v ~/Downloads/inbox:/inbox \
  -e ARCHIVIST_ARCHIVE_DIR="/archive" \
  -e ARCHIVIST_INBOX_DIR="/inbox" \
  ghcr.io/jdav-dev/archivist

ARCHIVIST_INBOX_DIR and ARCHIVIST_ARCHIVE_DIR are required environment variables, specifying the respective source and destination directories for PDFs. Optional environment variables include ARCHIVIST_OLLAMA_BASE_URL to override the URL for Ollama's API (defaults to "http://localhost:11434/api") and ARCHIVIST_OLLAMA_TIMEOUT_SECONDS to override how long to wait on responses from Ollama (defaults to "60").

Development

The archivist uses development containers for local development. After cloning the repository and before opening the project in its development container, a file must be created at .devcontainer/docker-compose.extend.yml. This file customizes the Docker Compose development stack for your local machine. For example:

services:
  archivist:
    volumes:
      - ~/Documents:/archive
      - ~/Downloads:/inbox
  ollama:
    deploy:
      resources:
        reservations:
          devices:
            - driver: nvidia
              count: 1
              capabilities: [ gpu ]

This will mount ~/Documents to /archive and ~/Downloads to ~/inbox in your development environment. It will also give the Ollama container access to a Nvidia GPU. Change the directories as needed for your machine, and change or remove the Ollama section if you do not have an Nvidia GPU available.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.devcontainer		.devcontainer
.github/workflows		.github/workflows
config		config
lib		lib
test		test
.formatter.exs		.formatter.exs
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
mix.exs		mix.exs
mix.lock		mix.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Archivist

Installation

Development

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Languages

License

jdav-dev/archivist

Folders and files

Latest commit

History

Repository files navigation

Archivist

Installation

Development

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Languages

Packages