PDF Question-Answering System with CAG

A Python application that uses Groq LLM, LangChain, and Cache-Augmented Generation (CAG) to process PDFs and answer questions about their content.

Features

PDF Processing: Extract and process text from PDF documents
Vector Embeddings: Create semantic embeddings for efficient document retrieval
Cache-Augmented Generation: Intelligent caching system for improved performance
Groq Integration: Fast LLM inference using Groq's API
Interactive Q&A: Ask natural language questions about your PDF content

Setup

Install dependencies:
```
pip install -r requirements.txt
```
Set up environment variables: Create a .env file with your Groq API key:
```
GROQ_API_KEY=your_groq_api_key_here
```
Place your PDF file in the project directory and name it mypdf.pdf

Usage

Run the main application:

python main.py

Then ask questions about your PDF content interactively.

Project Structure

main.py - Main application entry point
pdf_processor.py - PDF text extraction and processing
cag_system.py - Cache-Augmented Generation implementation
vector_store.py - Vector database management
config.py - Configuration settings
requirements.txt - Python dependencies

How it Works

Document Ingestion: The PDF is processed and split into chunks
Embedding Creation: Text chunks are converted to vector embeddings
Vector Storage: Embeddings are stored in ChromaDB for fast retrieval
Caching Layer: CAG system caches frequently accessed information
Question Processing: User questions are embedded and matched against the document
Answer Generation: Relevant context is sent to Groq LLM for answer generation

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.github		.github
.vscode		.vscode
vector_db		vector_db
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
cag_system.py		cag_system.py
config.py		config.py
main.py		main.py
pdf_processor.py		pdf_processor.py
requirements.txt		requirements.txt
vector_store.py		vector_store.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PDF Question-Answering System with CAG

Features

Setup

Usage

Project Structure

How it Works

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

wish-team/hp-cag

Folders and files

Latest commit

History

Repository files navigation

PDF Question-Answering System with CAG

Features

Setup

Usage

Project Structure

How it Works

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages