🧠 RAG-Based Insurance Policy Chatbot

An end-to-end Retrieval-Augmented Generation (RAG) system built to query complex insurance documents. Combines OpenAI embeddings, ChromaDB for vector search, a cross-encoder for reranking, and GPT-3.5 for grounded, context-aware answers.

🔍 Features

📄 PDF-based policy ingestion with text + table extraction (pdfplumber)
🧱 Sliding-window chunking with metadata tagging
🔎 Semantic search using OpenAI text-embedding-ada-002
🧠 Cross-encoder reranking (ms-marco / stsb-roberta-base)
🤖 GPT-3.5-turbo LLM response generation using retrieved chunks only
🔁 ChromaDB-powered persistent cache to reduce redundant calls
🛡️ Hallucination-resistant by enforcing grounded generation
📸 Screenshot support for search and answer stages (for reporting)

📂 Project Structure

├── notebooks/
│   └── main_pipeline.ipynb        # End-to-end RAG pipeline
├── data/
│   └── Principal-Sample-Life-Insurance-Policy.pdf
├── screenshots/
│   └── query1_search.png
│   └── query1_answer.png
│   └── ...
├── RAG_Insurance_Chatbot_Documentation.docx
├── requirements.txt
└── README.md

🚀 Getting Started

1. Clone the Repository

git clone https://github.com/yourusername/insurance-rag-chatbot.git
cd insurance-rag-chatbot

2. Install Dependencies

pip install -r requirements.txt

3. Set Up Environment

Set your OpenAI API key in your environment:

export OPENAI_API_KEY="your-api-key-here"

4. Run the Notebook

Open main_pipeline.ipynb and run each cell to:

Load PDF
Generate embeddings
Query and rerank results
Generate grounded answers

💬 Example Queries

“What is the claim process for nominees of the policy?”
“How is the policy surrender value calculated, and when is it applicable?”
“Can a member convert their group life insurance to an individual policy after termination?”

📄 License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
ChromaDB_presistpath_Chuncking		ChromaDB_presistpath_Chuncking
.gitattributes		.gitattributes
Final_Generated_Answer_Generation_Layer_Query_1.png		Final_Generated_Answer_Generation_Layer_Query_1.png
Final_Generated_Answer_Generation_Layer_Query_2.png		Final_Generated_Answer_Generation_Layer_Query_2.png
Final_Generated_Answer_Generation_Layer_Query_3.png		Final_Generated_Answer_Generation_Layer_Query_3.png
Insurance_HelpMate_AI.ipynb		Insurance_HelpMate_AI.ipynb
LICENSE		LICENSE
Principal-Sample-Life-Insurance-Policy.pdf		Principal-Sample-Life-Insurance-Policy.pdf
RAG_Insurance_Chatbot_Documentation.pdf		RAG_Insurance_Chatbot_Documentation.pdf
README.md		README.md
Top_3_from_the_Search_Layer_Query_1.png		Top_3_from_the_Search_Layer_Query_1.png
Top_3_from_the_Search_Layer_Query_2.png		Top_3_from_the_Search_Layer_Query_2.png
Top_3_from_the_Search_Layer_Query_3.png		Top_3_from_the_Search_Layer_Query_3.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧠 RAG-Based Insurance Policy Chatbot

🔍 Features

📂 Project Structure

🚀 Getting Started

1. Clone the Repository

2. Install Dependencies

3. Set Up Environment

4. Run the Notebook

💬 Example Queries

📄 License

✨ Acknowledgments

About

Uh oh!

Releases

Packages

Languages

License

AnishRane-cox/Insurance-HelpMateAI

Folders and files

Latest commit

History

Repository files navigation

🧠 RAG-Based Insurance Policy Chatbot

🔍 Features

📂 Project Structure

🚀 Getting Started

1. Clone the Repository

2. Install Dependencies

3. Set Up Environment

4. Run the Notebook

💬 Example Queries

📄 License

✨ Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages