Intelligent Document Query Engine

A cloud-native, LLM-powered query-retrieval system designed to perform contextual analysis on large, unstructured documents. Built for scalable, production-ready deployments.

🚀 Features

Retrieval-Augmented Generation (RAG) pipeline using:
- faiss-cpu for efficient vector indexing
- sentence-transformers CrossEncoder for high-precision reranking
Llama 3 LLM integration via the Groq API for final answer synthesis
Fine-tuned role-based system prompt for domain-specific query responses
Lazy-loading pattern for large AI models to optimize memory and performance
FastAPI backend with REST endpoints for document ingestion and query answering
Docker-based deployment for reproducibility and scalability

🌐 Live Demo

The application is deployed on Railway and available here: API Documentation (Swagger UI)

📂 Project Structure

├── main.py                 # FastAPI entry point
├── requirements.txt        # Dependencies
├── Dockerfile              # Container build setup
├── start.py / start.sh     # Application startup scripts
├── test_*.py               # API test scripts
└── deploy.sh / deploy.bat  # Deployment scripts

⚙️ Installation & Setup

Clone the repository

git clone https://github.com/yourusername/intelligent-document-query-engine.git
cd intelligent-document-query-engine

Install dependencies
```
pip install -r requirements.txt
```
Set environment variables Create a .env file with:
```
GROQ_API_KEY=your_groq_api_key
```
Run locally
```
uvicorn main:app --reload
```

☁️ Deployment

This project supports Railway deployment (Docker-based):

railway up

or locally via Docker:

docker build -t doc-query-engine .
docker run -p 8000:8000 doc-query-engine

📈 Performance Optimizations

Lazy loading of embedding and reranker models
FAISS in-memory index creation for efficient retrieval
Batched query processing for speed
Role-based prompts to reduce token usage

🧠 Tech Stack

Backend: Python, FastAPI
Vector Indexing: FAISS
LLM API: Groq (Llama 3)
Reranking: SentenceTransformers CrossEncoder
Deployment: Docker, Railway

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
build.sh		build.sh
deploy.bat		deploy.bat
deploy.sh		deploy.sh
deploy_manual.md		deploy_manual.md
main.py		main.py
railway.json		railway.json
requirements-no-torch.txt		requirements-no-torch.txt
requirements.txt		requirements.txt
start.py		start.py
start.sh		start.sh
test_accuracy.py		test_accuracy.py
test_api.ps1		test_api.ps1
test_api.py		test_api.py
test_hackathon.ps1		test_hackathon.ps1
test_hackathon.py		test_hackathon.py
test_optimized.py		test_optimized.py
test_simple.py		test_simple.py
verify_deployment.py		verify_deployment.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Intelligent Document Query Engine

🚀 Features

🌐 Live Demo

📂 Project Structure

⚙️ Installation & Setup

☁️ Deployment

📈 Performance Optimizations

🧠 Tech Stack

About

Uh oh!

Releases

Packages

Languages

Shreerang4/Intelligent-Document-Query-Engine

Folders and files

Latest commit

History

Repository files navigation

Intelligent Document Query Engine

🚀 Features

🌐 Live Demo

📂 Project Structure

⚙️ Installation & Setup

☁️ Deployment

📈 Performance Optimizations

🧠 Tech Stack

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages