Local RAG Chatbot with Ollama

This project is a local implementation of a Retrieval-Augmented Generation (RAG) chatbot, originally based on the tutorial from Hugging Face. The implementation has been modified to run entirely locally using Ollama for model inference.

Features

🚀 Fully Local: No API keys or internet connection required after initial setup
🐱 Cat Facts Knowledge Base: Comes pre-loaded with interesting cat facts
🔍 Semantic Search: Finds relevant information using vector similarity
💬 Interactive Chat: Ask questions and get answers based on the knowledge base

Prerequisites

Python 3.8+
Ollama installed and running
Required Python packages (install via pip install -r requirements.txt)

Setup

Install Ollama Download and install Ollama from ollama.ai
Start the Ollama server
```
ollama serve
```

Pull required models (in a new terminal):

ollama pull nomic-embed-text
ollama pull llama3

Install Python dependencies:
```
pip install -r requirements.txt
```

Usage

Run the chatbot:
```
python main.py
```
When prompted, type your question about cats and press Enter
Type 'quit', 'exit', or 'q' to exit the program

How It Works

Data Loading: The script loads cat facts from cat-facts.txt
Embedding Generation: Each fact is converted into a vector embedding using nomic-embed-text
Query Processing: When you ask a question:
- The question is converted to an embedding
- The system finds the most similar facts using cosine similarity
- The relevant context is sent to the language model (llama3)
- The model generates a response based on the retrieved context

Customization

Using Different Models

You can change the models in main.py by modifying these lines:

EMBEDDING_MODEL = 'nomic-embed-text'  # Other options: 'all-minilm', 'bge-small', etc.
LANGUAGE_MODEL = 'llama3'  # Other options: 'mistral', 'llama2', etc.

Adding Your Own Knowledge Base

Replace cat-facts.txt with your own text file
Each line should contain a single fact or piece of information
The system will automatically process the new file when you run main.py

Troubleshooting

If you get model not found errors, make sure you've pulled the models with ollama pull
Ensure the Ollama server is running before starting the script
For large knowledge bases, embedding generation might take some time on the first run

Credits

Based on the tutorial: Make Your Own RAG with Hugging Face

License

This project is open source and available under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.idea		.idea
.gitignore		.gitignore
README.md		README.md
cat-facts.txt		cat-facts.txt
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Local RAG Chatbot with Ollama

Features

Prerequisites

Setup

Usage

How It Works

Customization

Using Different Models

Adding Your Own Knowledge Base

Troubleshooting

Credits

License

About

Uh oh!

Releases

Packages

Languages

osvaldoleiva/simple-rag-tutorial

Folders and files

Latest commit

History

Repository files navigation

Local RAG Chatbot with Ollama

Features

Prerequisites

Setup

Usage

How It Works

Customization

Using Different Models

Adding Your Own Knowledge Base

Troubleshooting

Credits

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages