RAG-Forge API

A Retrieval Augmented Generation (RAG) API that allows users to upload PDF documents and ask questions about their content using natural language. The system uses OpenAI embeddings and Elasticsearch for efficient document retrieval, combined with LLM-powered question answering.

Features

PDF Document Processing
- Upload multiple PDF files concurrently
- Automatic text extraction and chunking
- Vector embeddings generation using OpenAI's embedding model
- Efficient storage and indexing in Elasticsearch
Question Answering
- Natural language question processing
- Semantic search using vector embeddings
- Context-aware answers using RAG
- Source chunks provided with answers
System Health Monitoring
- Elasticsearch cluster health checks
- Index status monitoring
- Real-time system status reporting

Prerequisites

Docker and Docker Compose
OpenAI API Key
(Optional) DeepSeek API Key for alternative LLM provider

Environment Setup

Clone the repository:

git clone git@github.com:TulioChiodi/rag_challenge_api.git
cd rag_challenge_api

Create a .env file in the project root by copying .env.example:

Running with Docker

The project includes three services:

API Service (FastAPI): Handles document processing and RAG queries
UI Service (Streamlit): Provides a user-friendly web interface
Elasticsearch: Stores and indexes document embeddings

Build and start all services:

docker compose up --build

Access the services:

Web UI: http://localhost:8501
API Documentation: http://localhost:8000/docs
Elasticsearch: http://localhost:9200

API Documentation

Once the service is running, you can access:

Swagger UI documentation: http://localhost:8000/docs
ReDoc documentation: http://localhost:8000/redoc

Quick Start Guide

You can interact with the system either through the Web UI or directly via the API:

Using the Web UI (Recommended for exploration)

Open the Streamlit interface in your browser:

http://0.0.0.0:8501

Use the intuitive interface to:
- Upload PDF documents using the file upload widget
- View processing status and document information
- Ask questions and see answers with source context
- Monitor system health

Using the API directly

Upload PDF documents:

curl -X POST "http://localhost:8000/documents" \
     -H "accept: application/json" \
     -H "Content-Type: multipart/form-data" \
     -F "files=@manual1.pdf" \
     -F "files=@manual2.pdf"

Ask questions about the documents:

curl -X POST "http://localhost:8000/question" \
     -H "accept: application/json" \
     -H "Content-Type: application/json" \
     -d '{"question": "What are the maintenance procedures?"}'

Development

For local development, we'll use a Python virtual environment and run Elasticsearch in Docker:

Create virtual environment

python -m venv venv

Activate it

source venv/bin/activate

Install project dependencies:

# For API development
pip install -r requirements.txt

# For UI development (Streamlit)
pip install -r ui-requirements.txt

Setup Elasticsearch:

# Pull the image
docker pull elasticsearch:9.0.1

# Run Elasticsearch container
docker run -d \
  -p 9200:9200 -p 9300:9300 \
  -e "discovery.type=single-node" \
  -e "xpack.security.enabled=false" \
  -e "ES_JAVA_OPTS=-Xms1g -Xmx1g" \
  --name es-dev \
  elasticsearch:9.0.1

Run the services:

# Terminal 1: Run the FastAPI application
python -m src.main

# Terminal 2: Run the Streamlit UI
streamlit run streamlit_app/app.py

The services will be available at:

Future Improvements

Monitoring

Prometheus + Grafana
- System metrics collection and visualization
- Performance monitoring
- Resource usage tracking

App Features

Conversational interface
- Chat history persistence
- Context-aware follow-up questions
Improve retrieval query quality
- Two-step prompts for query preprocessing
- Agentic RAG
Multilayer storage
- File metadata (name, description)
- Graph database integration for relationships
Hybrid Search on Elasticsearch
- Combine semantic and keyword search
- Boost results based on metadata
Semantic Chunker
- Experimental implementation
- Parameter tuning for optimal chunks

Management Features

Database control endpoints
- Recreate database
- Remove files (whole document)
- Retrieve file list
- Change document content/metadata

Contributions and feedback on these improvements are welcome!

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.vscode		.vscode
src		src
streamlit_app		streamlit_app
.env.example		.env.example
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
Dockerfile.api		Dockerfile.api
Dockerfile.ui		Dockerfile.ui
README.md		README.md
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt
ui-requirements.txt		ui-requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAG-Forge API

Features

Prerequisites

Environment Setup

Running with Docker

API Documentation

Quick Start Guide

Using the Web UI (Recommended for exploration)

Using the API directly

Development

Future Improvements

Monitoring

App Features

Management Features

About

Uh oh!

Releases

Packages

Uh oh!

Languages

TulioChiodi/rag-forge

Folders and files

Latest commit

History

Repository files navigation

RAG-Forge API

Features

Prerequisites

Environment Setup

Running with Docker

API Documentation

Quick Start Guide

Using the Web UI (Recommended for exploration)

Using the API directly

Development

Future Improvements

Monitoring

App Features

Management Features

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages