🥈 Silver Retriever (V1)

A Modular, Offline RAG System for Low-Resource Environments.

📖 Introduction

Silver Retriever is a specialized, offline-first RAG system. Inspired by AI Singapore's (AISG) robust Golden Retriever, this project represents my initiative to "walk the walk" in AI Engineering.

My goal was to reverse-engineer the core concepts of retrieval systems (like the Golden Retriever) but adapt them for extreme resource constraints. While modern systems rely on heavy GPUs for Neural Embeddings (BERT/LLMs), Silver Retriever proves that effective search pipelines can be built on legacy hardware (e.g., a 2017 MacBook Air) by using a smart hybrid approach:

Statistical Search (TF-IDF): Delivers instant, CPU-friendly retrieval speed.
Rule-Based Plugins (The Brain): Simulates "reasoning" by using domain-specific logic to extract deadlines, definitions, and tasks.

🚀 Key Features

100% Offline: No API keys, no internet required. Privacy-first.
Plugin System: Auto-detects intents (e.g., "When is the deadline?" triggers the Admin plugin).
Smart Chunking: Uses overlapping windows to preserve context across sentences.
Visual Feedback: UI badges indicate why a result was chosen (e.g., "Contains Date").
Multi-Subject: Supports any PDF/TXT file (AIAP, Feng Shui, Novels, etc.).

🏗️ Architecture

The system follows a 3-Layer Modular Architecture:

graph TD
    User([User Query]) --> UI[app.py - Interface]
    UI --> Brain[src/brain.py - The Manager]
    
    subgraph Logic["The Brain Layer"]
    Brain --> Check{Check Triggers}
    Check -->|Match| Plugin[src/plugins/*.py]
    Check -->|No Match| Engine
    end
    
    subgraph Muscle["The Engine Layer"]
    Engine[src/engine.py] -->|TF-IDF Search| Storage[(data/storage.pkl)]
    end
    
    Plugin -->|Boost Score| UI
    Engine -->|Raw Results| UI

📂 Project Structure

Silver_Retriever/
├── app.py                 # The User Interface (Streamlit)
├── src/
│   ├── engine.py          # The Core: Ingestion, Chunking, Vectorization
│   ├── brain.py           # The Manager: Routes queries to plugins
│   └── plugins/           # The Skills Folder
│       ├── admin.py       # Detects dates/tasks
│       ├── marketing.py   # Detects SEO/SEM terms
│       ├── feng_shui.py   # Detects placement advice
│       └── ...
├── data/
│   └── raw/               # (GitIgnored) Where your PDFs live
├── .github/workflows/     # CI/CD Pipelines
└── requirements.txt       # Dependencies

🛠️ Installation & Usage

Clone the repo

git clone https://github.com/Lwg78/Silver-Retriever.git
cd Silver-Retriever

Install Dependencies

pip install -r requirements.txt

Run the App

streamlit run app.py

📊 Insights & Limitations

Why TF-IDF? On a dual-core CPU with 8GB RAM, embedding models (like ChromaDB + SentenceTransformers) introduce 2-3 seconds of latency per query. TF-IDF is instant (<0.1s).
The Trade-off: TF-IDF searches for keywords, not meaning. "What is the cost?" might not match "The price is $50".
The Fix: We implemented Query Expansion (synonyms) and Plugins to bridge this gap without upgrading hardware.

🔜 Future Roadmap

Add BM25 Ranking for better document length normalization.
Add "Fuzzy Matching" for typo tolerance.
Export search results to CSV.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.github/workflows		.github/workflows
notebooks		notebooks
src		src
tests		tests
.gitignore		.gitignore
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🥈 Silver Retriever (V1)

📖 Introduction

🚀 Key Features

🏗️ Architecture

📂 Project Structure

🛠️ Installation & Usage

📊 Insights & Limitations

🔜 Future Roadmap

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🥈 Silver Retriever (V1)

📖 Introduction

🚀 Key Features

🏗️ Architecture

📂 Project Structure

🛠️ Installation & Usage

📊 Insights & Limitations

🔜 Future Roadmap

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages