local-rag

A local-first RAG system for talking to long-form content. Point it at YouTube playlists, podcasts, or blogs, and it ingests, chunks, indexes, and lets you have real conversations about the material — all running against a local LLM by default.

Why I built this

I kept saving hours of video and podcast content I never had time to revisit. Existing tools either shipped my data to a cloud provider or locked me into one ecosystem, so I built something that runs entirely on my own machine and lets me ask "what did this guy say about magnesium?" instead of scrubbing through a 3-hour episode.

Stack

Python 3 · FastAPI · WebSockets
LM Studio (default) for local inference, with OpenAI as an optional drop-in
Custom chunking + retrieval pipeline (no heavy vector DB dependency)
pdfplumber / PyPDF2 for document ingest
youtube-transcript-api for video sources

How it works

Pulls transcripts from a YouTube channel, playlist, or arbitrary podcast/blog source
Chunks content with overlap, runs an extraction pass to pull out the substantive claims, and builds a lightweight retrievable index
Serves a chat UI over WebSockets so you can ask questions and get answers grounded in the source material with citations back to the original timestamps
Swaps between a local LM Studio model and OpenAI with a single setting — same code path either way

Notes

HOW_IT_WORKS.md walks through the architecture and the design choices behind the no-vector-DB retrieval approach
WEB_INTERFACE.md documents the chat UI
RAG_PLAN_OPTIMIZED_NO_VECTORS.md and RAG_PLAN_UPDATED.md are the design docs that drove the build

Demo

Local-only — runs against LM Studio on localhost:1234 by default. Demo available on request.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.env.example		.env.example
.gitignore		.gitignore
HOW_IT_WORKS.md		HOW_IT_WORKS.md
RAG_PLAN_OPTIMIZED_NO_VECTORS.md		RAG_PLAN_OPTIMIZED_NO_VECTORS.md
RAG_PLAN_UPDATED.md		RAG_PLAN_UPDATED.md
README.md		README.md
WEB_INTERFACE.md		WEB_INTERFACE.md
chunker.py		chunker.py
config.py		config.py
document_processor.py		document_processor.py
llm_client.py		llm_client.py
rag_query.py		rag_query.py
requirements.txt		requirements.txt
requirements_rag.txt		requirements_rag.txt
settings.json		settings.json
start_web.sh		start_web.sh
transcript_fetcher.py		transcript_fetcher.py
vector_index.py		vector_index.py
web_app.py		web_app.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

local-rag

Why I built this

Stack

How it works

Notes

Demo

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

local-rag

Why I built this

Stack

How it works

Notes

Demo

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages