Multilingual AI Document Assistant

Privacy-first document assistant with zero-retention architecture. Documents are processed but never stored on servers. All persistent data lives in the user's browser (EntityDB). Backend is stateless.

Key points:

No Redis — server stores nothing
No raw IndexedDB — we use EntityDB instead
EntityDB — IndexedDB under the hood + Transformers.js for embeddings and semantic search

Local Setup / Onboarding

Use this guide to get set up locally and ready to contribute.

Quick start (copy & paste):

git clone https://github.com/Resilient-Labs/multilingual-ai-document-assistant.git
cd multilingual-ai-document-assistant
npm install
npm run dev

Then open http://localhost:3000. No Redis or server storage required.

Prerequisites

Node.js 18.x or 20.x (nodejs.org)
npm 9+ (comes with Node.js)
Git (for cloning)

1. Clone the repository

git clone https://github.com/Resilient-Labs/multilingual-ai-document-assistant.git
cd multilingual-ai-document-assistant

2. Install dependencies

npm install

What this installs:

Package	What it does	Install notes
`next`, `react`, `react-dom`	Next.js app framework	Standard install
`@babycommando/entity-db`	In-browser vector DB (IndexedDB + Transformers.js under the hood)	May take 1–2 min; pulls WASM deps
`uuid`	Document ID generation	Standard install

Step-by-step:

Open a terminal in the project folder.
Run npm install.
Wait for it to finish (entity-db can take longer on first install).
Confirm: you should see added X packages and no errors.
If it fails, try npm ci for a clean install.

Installing a single package later:

npm install <package-name>

If npm install fails:

Run npm cache clean --force, then npm install again.
Ensure Node.js 18+ is installed: node -v.
On Windows, you may need to run the terminal as Administrator for native modules.

3. Environment variables

Optional. Copy .env.local.example to .env.local when you add OCR, LLM, or other API keys:

cp .env.local.example .env.local

No Redis or server storage is required. Add keys only when integrating external services.

4. Run the development server

npm run dev

Open http://localhost:3000 in your browser.

5. Verify setup

The app should load without errors.
API routes are stateless — they process and return; no server storage.

Onboarding checklist

Before you start contributing, confirm:

Node.js 18+ installed (node -v)
Repo cloned and npm install completed
npm run dev runs and localhost:3000 loads
You know your team's area (see Team ownership below)

Available scripts

Command	Description
`npm run dev`	Start development server (hot reload)
`npm run build`	Build for production
`npm run start`	Start production server
`npm run lint`	Run ESLint
`npm run format`	Check formatting with Prettier
`npm run typecheck`	Run TypeScript type checking

Linting & Formatting

This project uses ESLint, Prettier, and TypeScript. These checks run in CI.

npm run lint
npm run format
npx prettier --write .   # Fix formatting
npm run typecheck

Troubleshooting

Issue	Solution
Port 3000 in use	Run `npm run dev -- -p 3001` to use a different port
Build fails	Run `npm ci` for a clean install, then `npm run build`
EntityDB / Transformers.js errors	Check `next.config.js` has webpack aliases for `onnxruntime-node` and `sharp`

Key dependencies

npm install uuid
npm install github:babycommando/entity-db

Package	Purpose	Install source
`@babycommando/entity-db`	In-browser vector DB for chunks, embeddings, semantic search	GitHub
`uuid`	Document ID generation (`doc_${uuidv4()}`)	npm

EntityDB stores all data in the browser. Use lib/entitydb.ts:

import { insertChunk, queryChunks } from "@/lib/entitydb";

await insertChunk("Document text here", { docId: "doc_123", chunkId: "c1" });
const results = await queryChunks("search query", { limit: 5 });

Architecture: Zero-retention

User Browser
│
├── EntityDB (IndexedDB + Transformers.js)
│   Entities: Document, OCRBlock, Chunk, Embedding, Summary, ChatSession, ChatMessage, RiskFlag, Language
│
└── API requests
     │
     ▼
Stateless Backend (OCR, LLM, embeddings, translation, risk classification)

Server never stores documents. Everything persistent lives in EntityDB in the browser.

Entity model

Document (root)
├── OCRBlock → FieldCandidate
├── Chunk → Embedding
├── Summary
├── RiskFlag
├── Language
└── ChatSession → ChatMessage

See types/index.ts for full definitions.

Team ownership / areas of work

Team	Area	Files / endpoints	What to build
Team 1	Upload & OCR	`app/api/documents/upload`, `app/api/documents/extract`	File upload, OCR pipeline. Return JSON. Client stores in EntityDB.
Team 2	Summarization	`app/api/summarize`	Receive `fullText`, return summary via LLM. Stateless.
Team 3	RAG & embeddings	`app/api/ask`, `lib/entitydb.ts`	Chunking, embeddings in EntityDB, RAG. Client sends context; backend returns answer.
Team 4	Multilingual	(to be added)	Speech-to-text, translation, multilingual responses.
Team 5	Safety detection	`app/api/safety`	Receive text/blocks, return risk flags. Stateless.

Shared resources:

types/ — Entity definitions (Document, OCRBlock, Chunk, etc.)
lib/entitydb.ts — EntityDB client for chunks and semantic search
lib/constants.ts — File limits, allowed MIME types
lib/documentId.ts — Document ID generation

Project structure

app/
  api/
    documents/upload   # Stateless: OCR, return JSON
    documents/extract  # Stateless: OCR, return JSON
    ask               # Stateless: RAG (client sends context)
    summarize         # Stateless: summary (client sends fullText)
    safety            # Stateless: risk flags (client sends text)
components/           # Shared React components
lib/                  # entitydb, constants, documentId
types/                # Entity definitions

API endpoints

All endpoints are stateless. Client sends data; backend processes and returns. No server storage.

Endpoint	Method	Body	Description
`/api/documents/upload`	POST	`FormData` (file)	OCR, return docId + OCR JSON
`/api/documents/extract`	POST	`FormData` (file)	OCR, return OCR JSON
`/api/ask`	POST	`{ question, context? }` or `{ question, chunks? }`	RAG answer
`/api/summarize`	POST	`{ fullText }`	Summary
`/api/safety`	POST	`{ fullText?, blocks? }`	Risk flags

Storage limits

PDF / Images: ≤ 4.5 MB (client upload limit)

Privacy

Documents are processed but never stored on servers. All data stays in the user's browser.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
.github		.github
PRDs		PRDs
app		app
components		components
hooks		hooks
lib		lib
public		public
tests		tests
types		types
.env.local.example		.env.local.example
.eslintignore		.eslintignore
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
.prettierignore		.prettierignore
.prettierrc		.prettierrc
README.md		README.md
components.json		components.json
next-env.d.ts		next-env.d.ts
next.config.js		next.config.js
package-lock.json		package-lock.json
package.json		package.json
postcss.config.mjs		postcss.config.mjs
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multilingual AI Document Assistant

Local Setup / Onboarding

Prerequisites

1. Clone the repository

2. Install dependencies

3. Environment variables

4. Run the development server

5. Verify setup

Onboarding checklist

Available scripts

Linting & Formatting

Troubleshooting

Key dependencies

Architecture: Zero-retention

Entity model

Team ownership / areas of work

Project structure

API endpoints

Storage limits

Privacy

Architecture docs

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 6

Languages

Folders and files

Latest commit

History

Repository files navigation

Multilingual AI Document Assistant

Local Setup / Onboarding

Prerequisites

1. Clone the repository

2. Install dependencies

3. Environment variables

4. Run the development server

5. Verify setup

Onboarding checklist

Available scripts

Linting & Formatting

Troubleshooting

Key dependencies

Architecture: Zero-retention

Entity model

Team ownership / areas of work

Project structure

API endpoints

Storage limits

Privacy

Architecture docs

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 6

Languages

Packages