RAG System with Ollama & LangChain

📌 Project Overview

This repository contains a Retrieval-Augmented Generation (RAG) implementation built with LangChain, ChromaDB, and Ollama.

The project demonstrates how a local LLM can answer user questions by retrieving relevant context from a small set of documents stored in a vector database.

This implementation focuses on core RAG concepts rather than full-scale optimization or sector-specific experimentation.

⚙️ Technologies Used

LangChain – RAG pipeline and LLM interface
ChromaDB – Vector store for similarity search
Ollama – Local LLM runtime
Ollama Embeddings – Vector embeddings for documents

🧠 How the RAG System Works

A set of local documents is defined in the script
Documents are converted into embeddings using OllamaEmbeddings
Embeddings are stored in ChromaDB
User questions are matched against the vector store
The most relevant document is retrieved
The retrieved context is injected into the LLM prompt
The LLM generates an answer based on the retrieved context

🧪 Example Documents

Zeynep Col has lived in NYC for 10 years.
Zeynep Col is an imaginary LLM engineer in the movie 'The Matrix'.
New York City's subway system is the oldest in the world.

Runtime Flow & Output

▶️ Runtime Flow

When the script is executed, the following steps occur in order:

Available Ollama models are listed from the local Ollama runtime
The user selects a model interactively
The user enters questions in a continuous loop
The system retrieves the most relevant document from ChromaDB
The retrieved context is injected into the LLM prompt
The LLM generates and prints the final response

📊 Output Section

During execution, the program prints:

The user question
The retrieved RAG context
The final prompt sent to the LLM
The LLM-generated response

This output flow makes it easy to observe how retrieval affects the final answer.

📊 Output Image

🏁 Summary

This runtime-focused README documents the interactive behavior and output structure of the RAG system.
It complements the main README by explaining how the system behaves during execution and how retrieved context influences LLM responses.

🤝 Contributing

Contributions are welcome!

📡 Contact

For any queries or collaborations, feel free to reach out!

🌐 GitHub: zeynepcol
👤 LinkedIn: zeynep-col

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
rag.py		rag.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAG System with Ollama & LangChain

📌 Project Overview

⚙️ Technologies Used

🧠 How the RAG System Works

🧪 Example Documents

Runtime Flow & Output

▶️ Runtime Flow

📊 Output Section

📊 Output Image

🏁 Summary

🤝 Contributing

📡 Contact

About

Uh oh!

Releases

Packages

Languages

zeynepcol/rag-system-optimization-project

Folders and files

Latest commit

History

Repository files navigation

RAG System with Ollama & LangChain

📌 Project Overview

⚙️ Technologies Used

🧠 How the RAG System Works

🧪 Example Documents

Runtime Flow & Output

▶️ Runtime Flow

📊 Output Section

📊 Output Image

🏁 Summary

🤝 Contributing

📡 Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages