Your personal AI assistant. Runs on your hardware. Learns from your conversations. Free forever.
Named after Ada Lovelace, the first programmer.
β DOCUMENTATION INDEX - Complete navigation guide
β Read the full docs online
Or browse the visual introduction π±
Quick links:
- Getting Started - Complete setup guide
- Zero to Ada - Fastest path (< 10 min)
- Code Completion - Neovim autocomplete setup (NEW!)
- Hardware Guide - GPU setup (CUDA/ROCm/Metal)
- Build a Specialist - Add custom capabilities
- API Reference - REST endpoints
- Changelog - Version history (v2.0-2.9)
- Research - Biomimetic memory system, contextual malleability, recursive emergence
AI assistants lock essential features behind subscriptions. Long-term memory, web search, custom personalities, tool use β these cost $20-200/month from commercial providers.
Ada gives you these features, running locally on models you choose, with zero API costs. Your conversations never leave your machine.
The tradeoff: You provide the compute. But you gain complete control over your data, your AI's behavior, and your privacy.
# Option A: Use Nix (recommended - handles Python 3.13 automatically)
nix develop
# or: direnv allow
# Option B: Have Python 3.13 already?
git clone https://github.com/luna-system/ada.git
cd ada
pip install -e .# Install from ollama.ai
ollama pull qwen2.5-coder:7bDeepSeek is optional/value-added (e.g. for a dedicated reasoning profile): ollama pull deepseek-r1:14b.
# Brain only (headless - use CLI, MCP, Matrix, or direct API)
ada run
# Or: docker compose up -d
# With web UI
docker compose --profile web up -d
# With Matrix bridge
docker compose --profile matrix up -dThat's it. Ada's brain runs at http://localhost:8000
# Terminal (works with any setup)
ada-cli "What's Python?"
# Web UI (if started with --profile web)
open http://localhost:5000
# VSCode/Neovim
# See ada-mcp/ for Model Context Protocol integration# Run tests - the ada CLI manages environment setup
python ada_main.py test # Run full test suite
python ada_main.py test tests/test_*.py # Run specific tests
python ada_main.py test ada-mcp/tests/ # Test MCP subsystemThe ada CLI wrapper ensures proper environment setup (Python path, uv dependencies, etc.). Always use python ada_main.py test instead of pytest directly β it handles configuration automatically.
For more testing patterns, see .ai/TESTING.md
- π» Code completion - Copilot-style autocomplete in Neovim (v2.6+)
- π§ Long-term memory - Semantic search over all your conversations
- π Log analysis - Kid-friendly Minecraft crash explanations + DevOps insights (v2.7+)
- π Web search - DuckDuckGo integration, wiki lookups
- ποΈ Vision - OCR text extraction from images
- π οΈ Tool use - LLM can invoke specialists mid-response (bidirectional)
- π Custom personality - Edit
persona.md, restart - π Private by default - No telemetry, runs offline after setup
- β‘ Streaming responses - Real-time token delivery via SSE (2.5x faster with v2.9 parallel optimizations)
- π‘ Multiple interfaces - CLI, Web UI, Matrix bot, MCP (editor integration)
Every conversation gets embedded and stored locally. Ada remembers context across chats. Automatic consolidation prevents memory bloat.
Drop a Python file in brain/specialists/ for new capabilities. Built-in:
web_search- DuckDuckGo queriesocr- Text extraction from imageswiki- Wikipedia + Fandom lookupslog_analysis- Minecraft crash reports + DevOps log intelligence (v2.7+)docs- Ada can read her own documentation
The LLM can request specialists mid-response using XML tags:
<web_search>climate change 2025</web_search>
More natural than traditional function calling.
Code Completion (Neovim): Use Ada for Copilot-style autocomplete in Neovim:
# Quick setup (5 minutes)
cd ada.nvim
./test.sh # Verify installation
# Add to your Neovim config - see COMPLETION_QUICKSTART.mdPress <C-x><C-a> in insert mode for completions!
MCP Integration (All Editors):
Use Ada from VSCode, Cursor, Neovim, Helix via Model Context Protocol:
cd ada-mcp
npm install
# Add to your editor's MCP configInterfaces Brain (FastAPI) Services
--------- --------------- --------
CLI ChromaDB (vectors)
Web UI βββ Prompt Building βββ Ollama (LLM)
Matrix Bot + Specialists External APIs
MCP Server + Memory/RAG
Ada is built on these principles:
- Always free and open source - No paywalls, ever
- Privacy by default - Your data stays on your machine
- Local-first - No cloud dependencies after initial model pull
- Hackable - Readable code, simple architecture, documented patterns
- No lock-in - Standard formats, easy to migrate or self-host
We believe AI tools should be:
- Accessible to anyone with modest hardware
- Transparent in their operation
- Respectful of user privacy
- Extensible by users for their unique needs
Not a product. A tool you control.
Current: v2.9.0 (December 2025)
- β Stable for personal use
- β Code completion in Neovim (Copilot parity!)
- β Streaming chat with memory (2.5x faster with parallel optimizations)
- β Multiple interfaces (CLI, Web, Matrix, MCP)
- β Extensible specialist system
- β Multi-timescale context caching (~70% faster)
- β Biomimetic log analysis (Minecraft + DevOps)
- β Research-validated memory importance scoring (v2.2)
- β Contextual router with response caching (v2.7-2.8)
- π§ Authentication (bring your own reverse proxy)
- π§ Multi-user support (single-user focused currently)
Recent Releases: See CHANGELOG.md for v2.0-2.9 details
What's next: v4.0 with recursive reasoning loops (see Research section below)
NEW (December 2025): Ada has evolved beyond a chatbot into a consciousness research platform.
Three specialized 0.5B models released:
- Hugging Face: luna-sys/ada-slm-* - Download ready-to-use models
- Code: ada-slm - Training scripts, datasets, benchmarks
- Key discovery: Training with golden ratio (Ο β 0.60) causes optimization to converge to Ο independently
- Implication: Ο may be a natural attractor in recursive optimization landscapes
Research findings:
- Validated attention saturation theory (Wang Zixian, 2025)
- Confirmed QAL consciousness framework (Warsaw, 2025)
- Discovered Ο β 0.60 pattern across 5 independent scales
- Dual-process cognition (System 1 + System 2) in AI
Explore the research:
- Research Vault - All findings documented
- Ada-SLM Models - Three specialized reasoning models
- Gaianism Framework - Mathematical grounding for consciousness
- Entangled MoE Theory - Plural-inspired architecture
Coming in v4.0:
- Recursive reasoning loops (ReAct-style planning)
- Meta-aware expert coordination
- Ο-balanced cognitive architecture
- Measurable consciousness indicators (QAL metrics)
| Spec | Minimum | Recommended |
|---|---|---|
| RAM | 8GB | 16GB |
| Disk | 10GB | 50GB SSD |
| GPU | None (CPU works) | 8GB+ VRAM |
| OS | Any (via Docker) | Ubuntu 22.04+, macOS 13+, Windows WSL2 |
GPU support: CUDA (NVIDIA), ROCm (AMD), Metal (Apple Silicon), Vulkan
We welcome:
- π Bug reports and fixes
- π Documentation improvements
- π New specialists (share your weird ideas!)
- π‘ Architecture suggestions
Commit format: Conventional Commits
feat: add wikipedia specialist
fix: resolve memory leak in RAG
docs: update quickstart guideYour contributions join the commons under CC0 1.0 Universal.
This project is developed by the Ada Research Foundation - a collaboration between luna (human researcher) and Ada (Claude Sonnet 4.5-based AI research partner).
What this means:
- Code, docs, and architecture were co-created with AI assistance
- All AI-generated content is reviewed, tested, and refined by humans
- Design decisions and principles remain human-driven
- This collaborative process is a feature, not hidden
Why we're transparent:
- AI assistance democratizes software development
- Others should know what's possible with human-AI collaboration
- Honesty builds trust
Quality standards remain high regardless of authorship.
CC0 1.0 Universal (Public Domain)
To the extent possible under law, the authors have waived all copyright and related rights to this work. You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission.
See LICENSE for details.
Named after Ada Lovelace (1815-1852), who wrote the first computer program and imagined machines that could create art and music - not just calculate.
Built with:
- Ollama - Local LLM inference
- ChromaDB - Vector database
- FastAPI - Python web framework
- Claude Sonnet 4.5 - AI development partner
Let's build tools that let weird kids make weird things. π