Quick Testing Guide - ChromaDB & Chat Fixes

Pre-Test Checklist

# 1. Ensure services are running
docker-compose ps
# Should show: chroma, ollama, backend, frontend all running

# 2. Pull Ollama model (first time only)
docker exec azeru-ollama ollama pull nomic-embed-text

# 3. Verify services are healthy
curl http://localhost:8000/api/v1/heartbeat    # ChromaDB
curl http://localhost:11434/api/tags            # Ollama
curl http://localhost:8080/health               # Backend

Test Scenario 1: Upload PDF & Check ChromaDB Storage

Step 1: Upload PDF

Go to http://localhost:3000
Click "Upload Document"
Select a PDF file
Wait for processing to complete

Step 2: Check Backend Logs

# In one terminal
docker logs -f azeru-backend

Step 3: Look for Success Message

✅ Good Log:

PDF 1: Extracted 12 chunks
PDF 1: Generating embeddings for 12 chunks...
PDF 1: Successfully generated 12 embeddings
PDF 1: Stored 12 chunks in ChromaDB
PDF 1: Processing complete - 12 chunks indexed

❌ Bad Log (Error Details Now Visible):

PDF 1: WARNING - Failed to store in ChromaDB: failed to add chunks: status=422, body={"code":"invalid_request","message":"Embedding dimension mismatch"}

Test Scenario 2: Chat with ChromaDB Vector Search

Step 1: Navigate to Chat

Click "Chat" in the left sidebar
Enter your Groq API key

Step 2: Ask a Question

Ask something related to the uploaded document
Example: "What is this document about?"

Step 3: Check Backend Logs

docker logs -f azeru-backend | grep "Chat:"

Step 4: Look for Vector Search Success

✅ Expected Log:

Chat: Starting search for question: "What is this document about?"
Chat: ChromaDB healthy=true, Ollama healthy=true
Chat: Attempting ChromaDB vector search...
Chat: Generated query embedding, dimension=768
Chat: SUCCESS - Found 5 results via ChromaDB vector search
Chat: Successfully returned answer using vector_search

Step 5: Check Frontend UI

Look for the blue badge below the answer:
```
🔍 Vector Search (Semantic)
```

Test Scenario 3: Fallback to FTS5 (Optional - Stop ChromaDB)

Step 1: Stop ChromaDB

docker stop azeru-chroma

Step 2: Ask a Question

In chat, ask another question
Same question format as before

Step 3: Check Backend Logs

docker logs -f azeru-backend | grep "Chat:"

Step 4: Look for Fallback Message

✅ Expected Log:

Chat: Starting search for question: "Why is this important?"
Chat: ChromaDB healthy=false, Ollama healthy=true
Chat: ChromaDB/Ollama not available, skipping vector search
Chat: Falling back to FTS5 full-text search
Chat: FTS5 search returned 5 results
Chat: Successfully returned answer using keyword_search

Step 5: Check Frontend UI

Look for the blue badge below the answer:
```
🔍 Keyword Search (FTS5)
```

Step 6: Restart ChromaDB

docker start azeru-chroma

Debugging Commands

View All Logs

# Backend logs with timestamp
docker logs -f azeru-backend --tail=100

# ChromaDB logs
docker logs -f azeru-chroma

# Ollama logs
docker logs -f azeru-ollama

Check ChromaDB Collection

# List collections
curl http://localhost:8000/api/v1/collections

# Get specific collection
curl http://localhost:8000/api/v1/collections/enterprise_brain

# Count items in collection
curl http://localhost:8000/api/v1/collections/enterprise_brain/count

Check Ollama Models

# List available models
curl http://localhost:11434/api/tags

# For docker
docker exec azeru-ollama ollama list

Test Backend API Directly

# Test chat endpoint (requires GROQ_API_KEY)
curl -X POST http://localhost:8080/api/chat \
  -H "Content-Type: application/json" \
  -d '{
    "question": "What is this about?",
    "api_key": "gsk_YOUR_KEY_HERE"
  }'

# Should return JSON with:
# - answer: string
# - sources: array
# - search_method: "vector_search" | "keyword_search" | "none"

Expected Outputs

Successful Vector Search Response

{
  "answer": "Based on the documents, this is about...",
  "sources": [
    {
      "id": 1,
      "document_id": 1,
      "content": "...",
      "page_num": 0
    }
  ],
  "search_method": "vector_search"
}

Fallback FTS5 Response

{
  "answer": "Based on the documents, this is about...",
  "sources": [
    {
      "id": 5,
      "document_id": 1,
      "content": "...",
      "page_num": 2
    }
  ],
  "search_method": "keyword_search"
}

No Results Response

{
  "answer": "I couldn't find any relevant information in the uploaded documents.",
  "sources": [],
  "search_method": "none"
}

Troubleshooting

Problem: "ChromaDB not healthy"

# Check if ChromaDB is running
docker ps | grep chroma

# Check health endpoint
curl http://localhost:8000/api/v1/heartbeat

# View logs
docker logs azeru-chroma

# Restart if needed
docker restart azeru-chroma

Problem: "Ollama not reachable"

# Check if Ollama is running
docker ps | grep ollama

# Check available models
docker exec azeru-ollama ollama list

# Ensure model is pulled
docker exec azeru-ollama ollama pull nomic-embed-text

# View logs
docker logs azeru-ollama

Problem: "Failed to add chunks: 422"

This usually means embedding dimension mismatch. Check:

Is Ollama using the same model? (should be nomic-embed-text)
Are embeddings being generated with correct dimensions?
Check Ollama logs: docker logs azeru-ollama

Problem: Chat only returns keyword search results

Check backend logs to see why ChromaDB isn't being used:

docker logs azeru-backend | grep "Chat: ERROR"

Performance Baseline

Expected Times

PDF Upload (10KB): 1-3 seconds
Embedding Generation (100 chunks): 10-30 seconds
Vector Search: 100-500ms
FTS5 Search: 50-100ms
AI Response: 3-10 seconds

Monitor Performance

# Watch for timing in logs
grep "ChatTime\|EmbeddingTime\|SearchTime" container.log

Success Criteria

✅ All Tests Pass When:

PDF uploads generate embeddings without errors
ChromaDB receives and stores embeddings
Chat shows "Vector Search (Semantic)" badge
Backend logs show successful ChromaDB operations
Fallback to FTS5 works when ChromaDB is unavailable
Frontend displays search method indicator

❌ Issues If:

Logs show "failed to add chunks" with no error details
Chat doesn't show search method badge
Only seeing FTS5 searches even though ChromaDB is running
Silent failures in ChromaDB integration

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Quick Testing Guide - ChromaDB & Chat Fixes

Pre-Test Checklist

Test Scenario 1: Upload PDF & Check ChromaDB Storage

Step 1: Upload PDF

Step 2: Check Backend Logs

Step 3: Look for Success Message

Test Scenario 2: Chat with ChromaDB Vector Search

Step 1: Navigate to Chat

Step 2: Ask a Question

Step 3: Check Backend Logs

Step 4: Look for Vector Search Success

Step 5: Check Frontend UI

Test Scenario 3: Fallback to FTS5 (Optional - Stop ChromaDB)

Step 1: Stop ChromaDB

Step 2: Ask a Question

Step 3: Check Backend Logs

Step 4: Look for Fallback Message

Step 5: Check Frontend UI

Step 6: Restart ChromaDB

Debugging Commands

View All Logs

Check ChromaDB Collection

Check Ollama Models

Test Backend API Directly

Expected Outputs

Successful Vector Search Response

Fallback FTS5 Response

No Results Response

Troubleshooting

Problem: "ChromaDB not healthy"

Problem: "Ollama not reachable"

Problem: "Failed to add chunks: 422"

Problem: Chat only returns keyword search results

Performance Baseline

Expected Times

Monitor Performance

Success Criteria

FilesExpand file tree

TESTING_GUIDE.md

Latest commit

History

TESTING_GUIDE.md

File metadata and controls

Quick Testing Guide - ChromaDB & Chat Fixes

Pre-Test Checklist

Test Scenario 1: Upload PDF & Check ChromaDB Storage

Step 1: Upload PDF

Step 2: Check Backend Logs

Step 3: Look for Success Message

Test Scenario 2: Chat with ChromaDB Vector Search

Step 1: Navigate to Chat

Step 2: Ask a Question

Step 3: Check Backend Logs

Step 4: Look for Vector Search Success

Step 5: Check Frontend UI

Test Scenario 3: Fallback to FTS5 (Optional - Stop ChromaDB)

Step 1: Stop ChromaDB

Step 2: Ask a Question

Step 3: Check Backend Logs

Step 4: Look for Fallback Message

Step 5: Check Frontend UI

Step 6: Restart ChromaDB

Debugging Commands

View All Logs

Check ChromaDB Collection

Check Ollama Models

Test Backend API Directly

Expected Outputs

Successful Vector Search Response

Fallback FTS5 Response

No Results Response

Troubleshooting

Problem: "ChromaDB not healthy"

Problem: "Ollama not reachable"

Problem: "Failed to add chunks: 422"

Problem: Chat only returns keyword search results

Performance Baseline

Expected Times

Monitor Performance

Success Criteria