📄 AskMyPDF AI — PDF RAG Chatbot

A smart digital assistant that lets you "talk" to your PDF documents and get instant answers — without reading through every page.

🔗 Live Demo: pdf-rag-chatbot-five.vercel.app

✨ Features

📤 Upload any PDF — Support for single or multiple PDF documents
💬 Conversational Q&A — Ask questions in natural language and get context-aware answers
🔍 RAG-Powered Retrieval — Uses Retrieval-Augmented Generation to find the most relevant chunks before answering
⚡ Fast Vector Search — FAISS-powered semantic search for low-latency results
🧠 LLM Integration — Leverages large language models via LangChain for intelligent responses
🌐 Modern UI — Clean, responsive Next.js frontend

🛠️ Tech Stack

Layer	Technology
Frontend	Next.js (TypeScript)
Backend	FastAPI (Python)
LLM	Groq (fast LLM inference)
Embeddings	Cohere Embeddings
AI Framework	LangChain
Vector DB	FAISS (Facebook AI Similarity Search)
Deployment	Vercel (frontend) Render (Backend)

📁 Project Structure

pdf-chatbot/
├── backend/
│   ├── app/
│   │   ├── services/
│   │   │   └── ai_service.py       # RAG pipeline logic (embedding, retrieval, LLM)
│   │   ├── main.py                 # FastAPI app entry point
│   │   ├── models.py               # Pydantic request/response models
│   │   └── routes.py               # API route definitions
│   ├── venv/                       # Python virtual environment
│   ├── .env                        # Environment variables (API keys)
│   ├── .gitignore
│   ├── requirements.txt            # Python dependencies
│   └── runtime.txt                 # Python runtime version
│
└── frontend/
    ├── src/
    │   └── app/
    │       ├── globals.css
    │       ├── layout.tsx
    │       └── page.tsx            # Main UI page
    └── .gitignore

🚀 Getting Started

Prerequisites

Node.js >= 18
Python >= 3.9
A Groq API key for LLM inference
A Cohere API key for embeddings

1. Clone the repository

git clone https://github.com/Resham011/pdf-rag-chatbot.git
cd pdf-rag-chatbot

2. Backend Setup

cd backend
python -m venv venv
source venv/bin/activate        # On Windows: venv\Scripts\activate
pip install -r requirements.txt

Create a .env file in the backend/ directory:

GROQ_API_KEY=your_groq_api_key_here
COHERE_API_KEY=your_cohere_api_key_here

Start the FastAPI server:

uvicorn app.main:app --reload --port 8000

The backend will be available at http://localhost:8000.

3. Frontend Setup

cd frontend
npm install

Create a .env.local file in the frontend/ directory:

NEXT_PUBLIC_API_URL=http://localhost:8000

Start the development server:

npm run dev

The app will be available at http://localhost:3000.

🧠 How It Works

Upload — User uploads a PDF via the frontend.
Parse & Chunk — The backend extracts text and splits it into overlapping chunks.
Embed — Each chunk is converted into a vector embedding using Cohere Embeddings.
Index — Embeddings are stored in a FAISS vector index.
Query — User asks a question; it is embedded (via Cohere) and the top-k most relevant chunks are retrieved.
Generate — The retrieved context + question are sent to a Groq-powered LLM (via LangChain) to produce an accurate, grounded answer.

PDF Upload → Text Extraction → Chunking → Embedding → FAISS Index
                                                              ↓
User Question → Embedding → Similarity Search → Top-K Chunks → LLM → Answer

📡 API Endpoints

Method	Endpoint	Description
POST	`/upload`	Upload and process a PDF file
POST	`/ask`	Ask a question about the uploaded PDF
GET	`/health`	Health check

🌍 Deployment

Frontend is deployed on Vercel.
Backend is deployed on Render.

Make sure to set the appropriate environment variables in your deployment platform.

🤝 Contributing

Contributions, issues, and feature requests are welcome!

Fork the repository
Create a new branch: git checkout -b feature/your-feature
Commit your changes: git commit -m 'Add your feature'
Push to the branch: git push origin feature/your-feature
Open a Pull Request

📜 License

This project is open source. See the LICENSE file for details.

👤 Author

Resham011
GitHub: @Resham011

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
backend		backend
frontend		frontend
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

📄 AskMyPDF AI — PDF RAG Chatbot

✨ Features

🛠️ Tech Stack

📁 Project Structure

🚀 Getting Started

Prerequisites

1. Clone the repository

2. Backend Setup

3. Frontend Setup

🧠 How It Works

📡 API Endpoints

🌍 Deployment

🤝 Contributing

📜 License

👤 Author

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

📄 AskMyPDF AI — PDF RAG Chatbot

✨ Features

🛠️ Tech Stack

📁 Project Structure

🚀 Getting Started

Prerequisites

1. Clone the repository

2. Backend Setup

3. Frontend Setup

🧠 How It Works

📡 API Endpoints

🌍 Deployment

🤝 Contributing

📜 License

👤 Author

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages