RAG server by yuqiannemo · Pull Request #9 · obro79/Tower

yuqiannemo · 2025-10-19T11:05:47Z

API Comparison: Unified Server vs Old Servers

Overview

The new unified_server.py combines the best of both main.py and rag_server.py, removing duplicates and adding Colinear Query Expansion.

🔄 Endpoint Mapping

OLD: main.py → NEW: unified_server.py

Old Endpoint	New Endpoint	Status	Notes
`GET /files/search`	`GET /search/keyword`	✅ Kept	Wildcard filename search
`GET /files/{file_id}`	`GET /files/{file_id}`	✅ Kept	Same functionality
`POST /files/register`	`POST /files/register`	✅ Enhanced	Now returns `action` field
`DELETE /files/{file_id}`	`DELETE /files/{file_id}`	✅ Enhanced	Also deletes embeddings

OLD: rag_server.py → NEW: unified_server.py

Old Endpoint	New Endpoint	Status	Notes
`POST /upload`	`POST /files/upload`	✅ Enhanced	Better naming, update support
`POST /search`	`POST /search/semantic`	✅ UPGRADED	Now with Query Expansion!
`GET /files`	`GET /files`	✅ Kept	Same functionality
`GET /files/{id}`	`GET /files/{file_id}`	✅ Merged	Combined with main.py version
`GET /stats`	`GET /stats`	✅ Enhanced	More detailed stats
`GET /`	`GET /`	✅ Enhanced	Better health check

🆕 What's New: Colinear Query Expansion

How It Works

When you search with POST /search/semantic:

{
  "query": "machine learning tutorial",
  "top_k": 5,
  "use_query_expansion": true,
  "expansion_count": 3
}

Behind the scenes:

Original Query: "machine learning tutorial"
Expanded Query 1: "document about machine learning tutorial"
Expanded Query 2: "file containing machine learning tutorial"
Expanded Query 3: "information regarding machine learning tutorial"

Each variant is:

Converted to an embedding
Searched against the vector database
Results are combined and deduplicated
Best match for each file is kept
Final results are ranked by similarity

Response Format

[
  {
    "file_id": 42,
    "filename": "ml_guide.pdf",
    "path": "/home/user/Documents/ml_guide.pdf",
    "device": "laptop",
    "device_ip": "192.168.1.10",
    "device_user": "john",
    "size": 1048576,
    "file_type": ".pdf",
    "similarity_score": 0.89,  // 0-1, higher = better match
    "last_modified_time": "2024-10-15T10:30:00",
    "matched_via": "expanded_query_2"  // Shows which query variant matched
  }
]

📊 Key Improvements

1. Unified API

Single server instead of two separate ones
Consistent naming conventions
No duplicate endpoints

2. Query Expansion

# OLD (rag_server.py):
POST /search → basic semantic search

# NEW (unified_server.py):
POST /search/semantic → semantic search + query expansion
  - Generates multiple query variants
  - Searches with each variant
  - Combines and ranks results
  - Shows which variant matched

3. Better Search Options

Keyword Search (GET /search/keyword): Fast filename matching
Semantic Search (POST /search/semantic): Content-aware with query expansion

4. Enhanced File Operations

Register (metadata only): POST /files/register
Upload (with embedding): POST /files/upload
Both support create and update operations
Both return action: "created" or "updated"

5. Improved Responses

All search results include similarity_score (0-1)
Semantic search shows matched_via field
Stats endpoint shows embedding model info

🚀 Migration Guide

For Existing Clients

Keyword Search (no changes needed):

# OLD: GET /files/search?query=*.txt
# NEW: GET /search/keyword?query=*.txt

Semantic Search (with new features):

# OLD:
curl -X POST http://localhost:8000/search \
  -H 'Content-Type: application/json' \
  -d '{"query": "python code", "top_k": 5}'

# NEW (with query expansion):
curl -X POST http://localhost:8000/search/semantic \
  -H 'Content-Type: application/json' \
  -d '{
    "query": "python code",
    "top_k": 5,
    "use_query_expansion": true,
    "expansion_count": 3
  }'

File Upload (new path):

# OLD: POST /upload
# NEW: POST /files/upload

🎯 Performance Comparison

Feature	main.py	rag_server.py	unified_server.py
Keyword Search	✅	❌	✅
Semantic Search	❌	✅	✅ + Query Expansion
File Registration (metadata only)	✅	❌	✅
File Upload (with embedding)	❌	✅	✅
Update Support	✅	❌	✅
Embedding Deletion	❌	❌	✅
Query Expansion	❌	❌	✅ NEW!
Local Model (no API key)	N/A	❌ (was using APIs)	✅

💡 Usage Examples

Example 1: Register File Metadata (No Upload)

curl -X POST http://localhost:8000/files/register \
  -H 'Content-Type: application/json' \
  -d '{
    "file_name": "report.pdf",
    "absolute_path": "/home/user/report.pdf",
    "device": "laptop",
    "device_ip": "192.168.1.10",
    "device_user": "john",
    "last_modified_time": "2024-10-19T12:00:00",
    "size": 2048000,
    "file_type": ".pdf"
  }'

Example 2: Upload File with Embedding

curl -X POST http://localhost:8000/files/upload \
  -F "file=@document.txt" \
  -F "device=laptop" \
  -F "device_ip=192.168.1.10" \
  -F "device_user=john" \
  -F "absolute_path=/home/user/document.txt"

Example 3: Semantic Search with Query Expansion

curl -X POST http://localhost:8000/search/semantic \
  -H 'Content-Type: application/json' \
  -d '{
    "query": "budget analysis",
    "top_k": 5,
    "use_query_expansion": true,
    "expansion_count": 3
  }'

Example 4: Keyword Search (Fast)

curl -X GET 'http://localhost:8000/search/keyword?query=*.xlsx'

🔧 Configuration

All settings in unified_server.py:

EMBEDDING_DIM = 384  # all-MiniLM-L6-v2 (local model)
DB_PATH = "file_records.db"  # SQLite database
VECTOR_DB_PATH = "vectors.db"  # Vector embeddings
FAISS_INDEX_PATH = "faiss.index"  # FAISS index

✅ Next Steps

Stop old servers (if running):
```
# Stop main.py or rag_server.py
```

Start unified server:

cd /data/qyu/projects/dubhacks1/backend
python unified_server.py

Test query expansion:

curl -X POST http://localhost:8000/search/semantic \
  -H 'Content-Type: application/json' \
  -d '{"query": "test", "top_k": 5, "use_query_expansion": true}'

Check stats:
```
curl http://localhost:8000/stats
```

Integrate frontend to backend

…-server

AndyJLi0 · 2025-10-25T02:19:56Z

Closing, changes already in master

yuqiannemo and others added 6 commits October 19, 2025 16:12

Add embedding function

25f50bb

Add example env file

d310fbd

Merge pull request #6 from obro79/integrate-frontend-to-backend

460ecb4

Integrate frontend to backend

Merge remote-tracking branch 'origin/nemo/mapping-func' into nemo/rag…

4922c34

…-server

Add rag server

1acae26

Refactor to unified server

35294a9

aaryan-rampal force-pushed the master branch from 460ecb4 to 7a62952 Compare October 19, 2025 14:08

AndyJLi0 closed this Oct 25, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RAG server#9

RAG server#9
yuqiannemo wants to merge 6 commits intomasterfrom
nemo/rag-server

yuqiannemo commented Oct 19, 2025 •

edited

Loading

Uh oh!

AndyJLi0 commented Oct 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

yuqiannemo commented Oct 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

API Comparison: Unified Server vs Old Servers

Overview

🔄 Endpoint Mapping

OLD: main.py → NEW: unified_server.py

OLD: rag_server.py → NEW: unified_server.py

🆕 What's New: Colinear Query Expansion

How It Works

Response Format

📊 Key Improvements

1. Unified API

2. Query Expansion

3. Better Search Options

4. Enhanced File Operations

5. Improved Responses

🚀 Migration Guide

For Existing Clients

🎯 Performance Comparison

💡 Usage Examples

Example 1: Register File Metadata (No Upload)

Example 2: Upload File with Embedding

Example 3: Semantic Search with Query Expansion

Example 4: Keyword Search (Fast)

🔧 Configuration

✅ Next Steps

Uh oh!

AndyJLi0 commented Oct 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

yuqiannemo commented Oct 19, 2025 •

edited

Loading