RAG Chatbot (Website / PDF / Text)

A Streamlit-based Retrieval-Augmented Generation (RAG) chatbot that answers questions from websites, PDFs, or raw text using embeddings, FAISS vector search, and a Groq-powered LLM.

Features

Website-based Q&A
PDF-based Q&A
Text-based Q&A
Chat-style UI with history
New Chat (session reset)
Apply button (runs only on click)
No hallucinations (context-only answers)
Deployment-ready (Streamlit Cloud / AWS)

How It Works (RAG Flow)

User selects Website / PDF / Text
Content is ingested and split into chunks
Chunks are embedded and stored in FAISS
User asks a question
Relevant chunks are retrieved
LLM generates an answer strictly from context
Answer is shown in Streamlit UI

Project Structure

│
├── app.py # Streamlit UI (Frontend)
├── main.py # Backend Orchestrator
├── requirements.txt
├── README.md
├── .env
├── .gitignore
│
├── embeddings/ # FAISS vector database (auto-created)
├── logs/ # Application logs
├── uploaded_pdfs/ # Uploaded PDF files (optional use)
│
├── src/
│ ├── pycache/
│ │
│ ├── components/
│ │ ├── init.py
│ │ └── ragchatbot.py # RAG + Groq LLM logic
│ │
│ ├── datatransformer/
│ │ ├── init.py
│ │ ├── webdatatransfer.py # Website text extraction
│ │ ├── textdatatransfer.py# Text & PDF text splitting
│ │ └── pdfdatatransfer.py # (Optional PDF logic)
│ │
│ └── utils/
│ ├── init.py
│ ├── dataembedding.py # Embedding creation
│ └── dataingestion.py # Website ingestion logic

Installation & Setup

Clone the Repository

git clone https://github.com/kumar-kiran-24/chatbot
pip install -r requirments.txt

cd chatbot

streamlit run app.py

env file

.env

GROQ_API_KEY=your_groq_api_key_here

for test

https://chatbot-cah6sgfpqmndtmydp7hya4.streamlit.app/

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RAG Chatbot (Website / PDF / Text)

Features

How It Works (RAG Flow)

Project Structure

Installation & Setup

Clone the Repository

env file

for test

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.vscode		.vscode
__pycache__		__pycache__
embeddings		embeddings
src		src
.gitignore		.gitignore
README.md		README.md
_init__.py		_init__.py
app.py		app.py
main.py		main.py
requirements.txt		requirements.txt

kumar-kiran-24/chatbot

Folders and files

Latest commit

History

Repository files navigation

RAG Chatbot (Website / PDF / Text)

Features

How It Works (RAG Flow)

Project Structure

Installation & Setup

Clone the Repository

env file

for test

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages