Relevance Search - AI-Powered Search Engine

An intelligent search engine that combines real-time web search with AI-powered answer generation using OpenRouter API.

🌐 Live Demo | Built on the excellent work of Wilson-ZheLin/SearchGPT

✨ Features

🔍 Real-time Web Search via Serper (Google API)
🤖 AI-Powered Answers using OpenRouter (multiple free models available)
🌐 Beautiful Streamlit Interface with Lottie animations
📚 Semantic Search with ChromaDB vector database
🎯 Smart Document Retrieval using text-embedding-3-small or Gitee BGE-M3
🔗 Source Citations with clickable references
🌍 Multi-language Support (auto-detects Chinese/English)
⚡ Multi-threaded Web Scraping for fast content extraction
💾 Export Results as TXT or JSON
🎨 No LangChain Required - lightweight and fast
📊 Complete Pipeline Tracing - see every step with timing, API calls, and similarity scores
🔎 Full Prompt Visibility - inspect exactly what's sent to the LLM
⏱️ Performance Metrics - track time spent on each pipeline step

🚀 Quick Start

Prerequisites

Python 3.11+ recommended
OpenRouter API Key (free tier available)
Serper API Key (2,500 free queries)

Installation

Clone the repository

git clone <your-repo-url>
cd Relevance Search

Install dependencies

pip install -r requirements.txt

Configure API Keys

You can either:

Enter them in the Streamlit UI when running the app, OR
Save them in src/config/config.yaml:

model_name: x-ai/grok-4.1-fast:free
openrouter_api_key: "your-openrouter-key-here"
serper_api_key: "your-serper-key-here"

Running the Application

Streamlit Web Interface (Recommended):

streamlit run app.py

Command Line:

python src/main.py

🎯 Available Models

All models are completely free via OpenRouter:

amazon/nova-2-lite-v1:free - Amazon's Nova 2 Lite model (default)
nvidia/nemotron-nano-9b-v2:free - NVIDIA's efficient 9B model
qwen/qwen3-4b:free - Alibaba's Qwen3 4B compact model
alibaba/tongyi-deepresearch-30b-a3b:free - Alibaba's research-focused 30B model

📁 Project Structure

Relevance Search/
├── app.py                      # Streamlit web interface
├── src/
│   ├── main.py                # CLI entry point
│   ├── fetch_web_content.py   # Web scraping with multi-threading
│   ├── serper_service.py      # Serper API integration
│   ├── retrieval.py           # Vector database & embeddings
│   ├── llm_answer.py          # AI answer generation
│   ├── llm_service.py         # OpenRouter API service
│   ├── text_utils.py          # Text processing utilities
│   └── config/
│       └── config.yaml        # Configuration file
├── requirements.txt           # Python dependencies
└── README.md                  # This file

🔧 Configuration

The src/config/config.yaml file supports:

model_name: AI model to use
openrouter_api_key: Your OpenRouter API key
serper_api_key: Your Serper API key
template: Custom prompt template for AI responses

📸 Screenshots

Main Interface

![Relevance Search Interface](assets/Relevance Search.png)

Pipeline Trace - Search & Scraping

Pipeline Trace - Embeddings & Retrieval

Pipeline Trace - Chunks & Similarity Scores

Pipeline Trace - Full Prompt & Generation

📖 Usage Examples

Via Streamlit UI

Launch the app: streamlit run app.py
Enter your API keys in the sidebar (or use saved keys)
Select your preferred AI model
Choose an AI profile (Researcher, Technical Expert, etc.)
Enter your search query
Click "🚀 Search"

Via Command Line

Edit the query in src/main.py and run:

python src/main.py

🛠️ Key Technologies

OpenRouter API - Access to multiple AI models
Serper API - Fast Google search results
ChromaDB - Vector database for semantic search
Streamlit - Modern web interface
BeautifulSoup4 - Web scraping
text-embedding-3-small - Efficient text embeddings

🔍 How It Works

Search: Query sent to Serper API for real-time web results
Scrape: Multi-threaded extraction of content from top results
Embed: Text chunked and converted to vector embeddings (OpenRouter or Gitee AI)
Retrieve: Semantic search finds most relevant content with similarity scores
Generate: AI model creates comprehensive answer with citations

Pipeline Tracing

Relevance Search provides complete visibility into every step of the RAG pipeline:

Step 1 - Search: See all URLs, titles, and snippets returned by Serper
Step 2 - Scraping: Track success/failure for each page with content previews
Step 3 - Embeddings: View API calls made, timing, and chunks processed
Step 4 - Retrieval: Inspect similarity scores for each retrieved chunk (color-coded by relevance)
Step 5 - Generation: See the exact prompt sent to the LLM and all context chunks

All steps include timing information to help identify bottlenecks and optimize performance.

📝 Logging

Comprehensive logging is built-in. Check your terminal for detailed logs:

Serper API requests and responses
Web scraping progress (per thread)
Embedding generation status
AI model responses
Error diagnostics

🤝 Contributing

Contributions are welcome! Feel free to:

Report bugs
Suggest features
Submit pull requests

📄 License

This project is licensed under the MIT License.

🙏 Acknowledgments

This project is built on the foundation of Wilson-ZheLin/SearchGPT. Significant enhancements include:

Migration from OpenAI to OpenRouter API
Removal of LangChain dependencies
Addition of Streamlit web interface
Modern ChromaDB integration
Comprehensive logging system
Updated embedding models

⭐ Star This Repo

If you find this project useful, please give it a star! ⭐

Name		Name	Last commit message	Last commit date
Latest commit History 62 Commits
assets		assets
docs		docs
src		src
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
README_CN.md		README_CN.md
README_STREAMLIT.md		README_STREAMLIT.md
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Relevance Search - AI-Powered Search Engine

✨ Features

🚀 Quick Start

Prerequisites

Installation

Running the Application

🎯 Available Models

📁 Project Structure

🔧 Configuration

📸 Screenshots

Main Interface

Pipeline Trace - Search & Scraping

Pipeline Trace - Embeddings & Retrieval

Pipeline Trace - Chunks & Similarity Scores

Pipeline Trace - Full Prompt & Generation

📖 Usage Examples

Via Streamlit UI

Via Command Line

🛠️ Key Technologies

🔍 How It Works

Pipeline Tracing

📝 Logging

🤝 Contributing

📄 License

🙏 Acknowledgments

⭐ Star This Repo

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Relevance Search - AI-Powered Search Engine

✨ Features

🚀 Quick Start

Prerequisites

Installation

Running the Application

🎯 Available Models

📁 Project Structure

🔧 Configuration

📸 Screenshots

Main Interface

Pipeline Trace - Search & Scraping

Pipeline Trace - Embeddings & Retrieval

Pipeline Trace - Chunks & Similarity Scores

Pipeline Trace - Full Prompt & Generation

📖 Usage Examples

Via Streamlit UI

Via Command Line

🛠️ Key Technologies

🔍 How It Works

Pipeline Tracing

📝 Logging

🤝 Contributing

📄 License

🙏 Acknowledgments

⭐ Star This Repo

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages