Mini RAG Application

A lightweight Retrieval-Augmented Generation (RAG) application that allows you to ask questions about your documents using natural language. Built with FastAPI and designed for local development.

🌟 Features

Simple & Lightweight - Minimal dependencies, easy to set up and run
FastAPI Backend - Built with FastAPI for high performance
Document Support - Works with .txt, .md, and .markdown files
REST API - Fully documented API endpoints
Test Coverage - Comprehensive test suite included

🚀 Quick Start

Prerequisites

Python 3.9+
pip (Python package manager)

Installation

Clone the repository

git clone https://github.com/yourusername/mini-rag-app.git
cd mini-rag-app

Create and activate a virtual environment

# Windows
python -m venv venv
.\venv\Scripts\activate

# macOS/Linux
python3 -m venv venv
source venv/bin/activate

Install dependencies
```
pip install -r requirements.txt
```

🏃‍♂️ Running the Application

Start the FastAPI server
```
uvicorn app.main:app --reload
```
Access the application
- Web Interface: http://127.0.0.1:8000
- API Documentation: http://127.0.0.1:8000/docs
- Alternative Docs: http://127.0.0.1:8000/redoc

📚 API Endpoints

Ingest Documents

POST /ingest - Add documents to the knowledge base

{
  "documents": [
    {
      "text": "Your document text here",
      "metadata": {"source": "example.txt"}
    }
  ]
}

Query Documents

POST /query - Query the knowledge base

{
  "question": "Your question here",
  "k": 4
}

Health Check

GET /health - Check API status

🧪 Running Tests

Run the test suite with:

pytest tests/ -v

Or use the test runner script:

python run_tests.py

jupyter notebook notebook/ingest_and_build.ipynb

Run all cells in the notebook

5. Start the FastAPI server

uvicorn app.main:app --reload


## 📚 Usage

### 1. Add Documents
Place your text or markdown files in the `docs/` directory and run the ingestion notebook:

```bash
jupyter notebook notebook/ingest_and_build.ipynb

2. Query the API

Using cURL:

curl -X POST "http://localhost:8000/query" \
     -H "Content-Type: application/json" \
     -d '{"question": "What is this document about?"}'

Using Python:

import requests

response = requests.post(
    "http://localhost:8000/query",
    json={"question": "What is this document about?"}
)
print(response.json())

🛠️ API Endpoints

POST /query - Ask a question about your documents

{
    "question": "Your question here",
    "k": 4  // Optional: number of results to return
}

POST /ingest - Add new documents (advanced)
GET /health - Check API status
GET / - API documentation

🔧 Technical Details

Models Used

Embeddings: sentence-transformers/all-MiniLM-L6-v2
Language Model: ggml-gpt4all-j-v1.3-groovy

Project Structure

mini-rag-app/
├── README.md               # This file
├── requirements.txt        # Python dependencies
├── .env.example           # Example environment variables
├── docs/                  # Your documents go here
├── notebook/              # Jupyter notebook for document processing
│   └── ingest_and_build.ipynb
├── app/
│   ├── __init__.py
│   ├── main.py            # FastAPI application
│   ├── retriever.py       # Document retrieval with FAISS
│   └── utils.py           # Helper functions
├── Dockerfile             # For containerization
└── docker-compose.yml     # For easy Docker deployment

📦 Deployment

Production Deployment

For production, consider using:

Gunicorn with multiple workers
Nginx as a reverse proxy
Process manager like PM2 or systemd

Environment Variables

Copy .env.example to .env and adjust as needed:

# Model configuration
EMBEDDING_MODEL=all-MiniLM-L6-v2
CHAT_MODEL=ggml-gpt4all-j-v1.3-groovy

# Server configuration
HOST=0.0.0.0
PORT=8000

📝 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgements

sentence-transformers for the embedding model
FAISS for efficient similarity search
GPT4All for the local language model
FastAPI for the web framework

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Mini RAG Application

🌟 Features

🚀 Quick Start

Prerequisites

Installation

🏃‍♂️ Running the Application

📚 API Endpoints

Ingest Documents

Query Documents

Health Check

🧪 Running Tests

Run all cells in the notebook

5. Start the FastAPI server

2. Query the API

Using cURL:

Using Python:

🛠️ API Endpoints

🔧 Technical Details

Models Used

Project Structure

📦 Deployment

Production Deployment

Environment Variables

📝 License

🙏 Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
app		app
docs		docs
notebook		notebook
tests		tests
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
docker-compose.yml		docker-compose.yml
process_docs.py		process_docs.py
requirements.txt		requirements.txt
run_streamlit.bat		run_streamlit.bat
run_tests.bat		run_tests.bat
run_tests.py		run_tests.py
setup.py		setup.py
test_api.py		test_api.py
test_api_simple.py		test_api_simple.py

Mustafa1998-tech/mini-RAG-app

Folders and files

Latest commit

History

Repository files navigation

Mini RAG Application

🌟 Features

🚀 Quick Start

Prerequisites

Installation

🏃‍♂️ Running the Application

📚 API Endpoints

Ingest Documents

Query Documents

Health Check

🧪 Running Tests

Run all cells in the notebook

5. Start the FastAPI server

2. Query the API

Using cURL:

Using Python:

🛠️ API Endpoints

🔧 Technical Details

Models Used

Project Structure

📦 Deployment

Production Deployment

Environment Variables

📝 License

🙏 Acknowledgements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages