Quick setup: Run /ai-setup in Dex — I'll guide you through everything.
Dex can use different AI models depending on your needs:
| Option | Cost | Internet Required | Best For |
|---|---|---|---|
| Claude (default) | ~$3-15/M tokens | Yes | Best quality, complex tasks |
| Budget Cloud | ~$0.14-3/M tokens | Yes | Daily tasks, save 80%+ |
| Offline | Free | No | Travel, privacy |
Other companies make AI models that cost much less than Claude:
- Kimi K2.5 (Moonshot AI) — 80% cheaper, similar quality
- DeepSeek V3 — 95% cheaper, great for coding
- Gemini Flash (Google) — 97% cheaper, handles long documents
- Save money on everyday tasks
- Claude is overkill for simple stuff
- Still get excellent results for most things
| Task | Claude | Kimi | DeepSeek |
|---|---|---|---|
| Complex planning | ⭐⭐⭐⭐⭐ | ⭐⭐⭐⭐ | ⭐⭐⭐ |
| Meeting summaries | ⭐⭐⭐⭐⭐ | ⭐⭐⭐⭐ | ⭐⭐⭐⭐ |
| Daily planning | ⭐⭐⭐⭐⭐ | ⭐⭐⭐⭐ | ⭐⭐⭐⭐ |
| Code help | ⭐⭐⭐⭐⭐ | ⭐⭐⭐ | ⭐⭐⭐⭐⭐ |
| Creative writing | ⭐⭐⭐⭐⭐ | ⭐⭐⭐ | ⭐⭐⭐ |
- Run
/ai-setupin Dex - Choose "Budget Cloud Mode"
- Follow the prompts to create an OpenRouter account
- Add $5-10 in credits (lasts weeks or months)
- Done! Use
/modelto switch between models
| Usage Level | Claude/Month | With Budget Models |
|---|---|---|
| Light (10 queries/day) | ~$15 | ~$3 |
| Medium (30 queries/day) | ~$45 | ~$9 |
| Heavy (100 queries/day) | ~$150 | ~$30 |
Download an AI model directly to your computer. It runs without internet.
✈️ Works on airplanes- 🚂 Works on trains with bad WiFi
- 🔒 Your data never leaves your computer
- 💰 Completely free forever
| Your RAM | Recommended Model | Quality |
|---|---|---|
| 8 GB | Qwen 2.5 7B | Good for basic tasks |
| 16 GB | Qwen 2.5 14B | Good for most tasks |
| 32 GB+ | Qwen 2.5 32B | Great for everything |
Not sure about your RAM?
- Mac: Apple menu → About This Mac → look for "Memory"
- Windows: Right-click Start → System → look for "Installed RAM"
- Run
/ai-setupin Dex - Choose "Offline Mode"
- Answer questions about your computer
- Install Ollama (free app) from ollama.ai
- Download the recommended model
- Done! Works automatically when offline
Works great:
- Daily planning and reviews
- Meeting summaries
- Task management
- Note organization
- Quick questions
Better with Claude:
- Quarterly planning
- Career coaching
- Complex analysis
- Creative writing
Let Dex automatically pick the best model for each task:
- Complex stuff → Claude (best quality)
- Simple stuff → Budget model (saves money)
- No internet → Local model (offline backup)
| Task | Model Used | Reason |
|---|---|---|
/quarter-plan |
Claude | Needs deep thinking |
/daily-plan |
Budget (Kimi) | Routine task |
/meeting-prep |
Budget (Kimi) | Straightforward |
/career-coach |
Claude | Needs nuance |
| "What's on my calendar?" | Budget | Simple question |
| No internet | Local | Only option |
- Run
/ai-setupin Dex - Set up budget cloud first
- Optionally set up offline mode
- Choose "Smart Routing"
- Done! I'll pick automatically (you can always override)
Method 1: Ask me
- "Use the budget model for this"
- "Switch to Kimi"
- "Use Claude for this one"
Method 2: Model picker
- Type
/modelin pi - Select from the list
Run /ai-status to see:
- Which models are configured
- Current model in use
- OpenRouter credit balance
- Ollama status
| Display Name | Technical ID |
|---|---|
| Claude Sonnet | claude-sonnet-4-20250514 |
| Kimi K2.5 | moonshotai/kimi-k2.5 |
| DeepSeek V3 | deepseek/deepseek-chat |
| Gemini Flash | google/gemini-2.0-flash-exp:free |
| Qwen (Offline) | qwen2.5:14b |
- Check OpenRouter has credits (openrouter.ai/credits)
- Verify API key is correct (starts with
sk-or-) - Try regenerating the key
Your model might be too big for your RAM:
- Open Activity Monitor (Mac) or Task Manager (Windows)
- If RAM is maxed, download a smaller model:
ollama pull qwen2.5:7b
Mac:
- Open Applications, right-click Ollama → Open
- If blocked: System Settings → Privacy & Security → Allow
Manual start:
- Open Terminal
- Run:
ollama serve - Keep window open
Q: Will I notice a quality difference with budget models? A: For everyday tasks (planning, summaries, organization), most people don't notice. For complex reasoning or creative work, Claude is noticeably better.
Q: How much will I actually save? A: Depends on usage, but typically 50-80% if you use budget models for routine tasks and Claude for complex ones.
Q: Is my data safe with other models? A: Budget cloud models (Kimi, DeepSeek) are hosted by other companies. If privacy is critical, use offline mode — data never leaves your computer.
Q: Can I go back to Claude-only? A: Yes! Just don't use the other models, or disable them in settings.
Q: What if I'm offline and haven't set up local models? A: Dex will let you know and offer to set it up when you're back online. You can still browse/edit files manually.
- Dex System Guide — Full system documentation
- Dex Technical Guide — Advanced configuration