Deployment Guide

Production deployment recommendations for qp-vault.

Choosing a Backend

Scenario	Backend	Install
Development, prototyping, < 10K chunks	SQLite	`pip install qp-vault`
Production, multi-user, > 10K chunks	PostgreSQL	`pip install qp-vault[postgres]`
Air-gapped / SCIF	SQLite + local embeddings	`pip install qp-vault[local,encryption]`

SQLite uses brute-force cosine similarity (O(n*d) per search). PostgreSQL uses pgvector HNSW index (logarithmic). For vaults over 10,000 chunks, PostgreSQL is required for acceptable search latency.

SQLite (Default)

Zero config. Database created automatically.

vault = Vault("./my-knowledge")

File permissions: new databases are created with 0600 (owner-only read/write). WAL and SHM journal files are also restricted.

PostgreSQL

Prerequisites

PostgreSQL 16+ with pgvector and pg_trgm extensions
pip install qp-vault[postgres]

CREATE EXTENSION IF NOT EXISTS vector;
CREATE EXTENSION IF NOT EXISTS pg_trgm;

Connection

vault = Vault.from_postgres("postgresql://user:pass@host:5432/vaultdb")

SSL is enabled by default. To disable (development only):

vault = Vault.from_postgres("postgresql://user:pass@host:5432/vaultdb?sslmode=disable")

SSL verification can be enabled for production:

from qp_vault.config import VaultConfig
config = VaultConfig(postgres_ssl=True, postgres_ssl_verify=True)

Connection Pooling

The PostgreSQL backend uses asyncpg's built-in connection pool (min 2, max 10 connections, configurable command timeout).

Encryption

pip install qp-vault[encryption]      # AES-256-GCM
pip install qp-vault[encryption,pq]   # + ML-KEM-768, ML-DSA-65

from qp_vault.encryption import AESGCMEncryptor

enc = AESGCMEncryptor()           # Random 256-bit key
enc = AESGCMEncryptor(key=my_key) # Bring your own key (32 bytes)

ciphertext = enc.encrypt(b"secret data")
plaintext = enc.decrypt(ciphertext)

Key management: keys are zeroized from memory via ctypes.memset when the encryptor is garbage collected. For production, store keys in a secrets manager or HSM, not in code.

Embeddings

Provider	Install	Air-Gap	Dimensions
None (text-only search)	(default)	Yes	0
SentenceTransformers	`qp-vault[local]`	Yes	Model-dependent
OpenAI	`qp-vault[openai]`	No	1536 / 3072

from qp_vault.embeddings.sentence import SentenceTransformerEmbedder

vault = Vault("./knowledge", embedder=SentenceTransformerEmbedder())

LLM Content Screening

Optional. Requires a running LLM (Ollama for air-gap, or cloud API).

from qp_vault.membrane.screeners.ollama import OllamaScreener

vault = Vault("./knowledge", llm_screener=OllamaScreener(model="llama3.2"))

Without an LLM screener, only regex-based innate scan runs (still catches common attacks).

Production Checklist

Scaling

Resources	Chunks (est.)	Backend	Search Latency
100	500	SQLite	< 50ms
1,000	5,000	SQLite	< 200ms
10,000	50,000	PostgreSQL	< 100ms
100,000	500,000	PostgreSQL + HNSW	< 200ms

Health/status responses are cached (default 30s TTL) to avoid full vault scans on repeated calls.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deployment Guide

Choosing a Backend

SQLite (Default)

PostgreSQL

Prerequisites

Connection

Connection Pooling

Encryption

Embeddings

LLM Content Screening

Production Checklist

Scaling

FilesExpand file tree

deployment.md

Latest commit

History

deployment.md

File metadata and controls

Deployment Guide

Choosing a Backend

SQLite (Default)

PostgreSQL

Prerequisites

Connection

Connection Pooling

Encryption

Embeddings

LLM Content Screening

Production Checklist

Scaling