dRAG 🐉

chat with your docs using the power of RAG (retrieval augmented generation)! drop any documentation link and start asking questions in natural language.

what it does

dRAG crawls through your documentation, chunks it up intelligently, and creates a chatbot that can answer questions about your docs with precise references. no more ctrl+f hell!

key features

easy ingestion: just drop a url, we handle the rest
smart crawling: automatically detects and validates documentation sites
vector search: uses state-of-the-art embeddings for accurate retrieval
context-aware responses: provides answers with links to source documentation
multiple doc sets: maintain separate chatbots for different documentation
async processing: handles large documentation sets efficiently

tech stack

backend: fastapi + python (async all the way!)
vector store: postgres + pgvector
embeddings: openai
llm: anthropic's claude
crawler: crawlee
task orchestration: prefect

api usage

ingest new documentation

curl -X POST http://localhost:8000/docs/ingest \
  -H "Content-Type: application/json" \
  -d '{
    "url": "https://docs.example.com",
    "name": "example_docs",
    "max_pages": 100
  }'

chat with your docs

curl -X POST http://localhost:8000/chat \
  -H "Content-Type: application/json" \
  -d '{
    "identifier": {
      "id_or_name": "example_docs"
    },
    "message": "how do i get started?"
  }'

how it works

validation: first checks if the url points to valid documentation
crawling: recursively crawls the documentation site, respecting rate limits
processing: converts html to text, preserving important structure
chunking: splits content into optimal chunks for retrieval
embedding: generates embeddings for semantic search
storage: saves chunks and metadata in postgres
retrieval: uses vector similarity to find relevant context
generation: combines retrieved context with claude for accurate responses

contributing

check out CONTRIBUTING.md for development setup and guidelines!

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
.github/workflows		.github/workflows
app		app
.gitignore		.gitignore
.python-version		.python-version
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE.md		LICENSE.md
README.md		README.md
pyproject.toml		pyproject.toml
server.py		server.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

dRAG 🐉

what it does

key features

tech stack

api usage

ingest new documentation

chat with your docs

how it works

contributing

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Languages

License

ahhcash/drag

Folders and files

Latest commit

History

Repository files navigation

dRAG 🐉

what it does

key features

tech stack

api usage

ingest new documentation

chat with your docs

how it works

contributing

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Languages

Packages