MedicalRAG.py

A multimodal Retrieval-Augmented Generation (RAG) pipeline designed to analyze veterinary medical documents, with a specific focus on dog health. This script uses cutting-edge language models, document parsing tools, and vector search technologies to provide advanced question answering and summarization capabilities.

Features

PDF Parsing: Extracts text, tables, and images from PDF files using the unstructured library.
Summarization: Uses Google Gemini LLMs to summarize each extracted element (text, table, and image).
Image Analysis: Processes and encodes images with Gemini's vision model to generate content descriptions.
Vector Store: Stores all summaries and metadata in a FAISS vector database for efficient similarity search and retrieval.
Question Answering: Answers user queries about dog health by retrieving relevant context from the vector database and providing multimodal responses (text, tables, and images).
Colab Integration: Designed for seamless use in Google Colab, including API key management via Colab’s userdata.

Dependencies

This project relies on the following key libraries and tools:

Python Libraries:
- LangChain
- Google Generative AI
- FAISS
- unstructured
System Dependencies:
- Tesseract OCR
- Poppler Utils

Usage

This script is designed to be used in a Google Colab environment.

Install Dependencies: Ensure all required Python libraries and system dependencies (Tesseract, Poppler) are installed in your environment.
Extract Elements: The script will automatically extract text, tables, and images from a specified veterinary PDF.
Summarize and Store: Each extracted element is summarized using Gemini models and stored in a FAISS vector store.
Ask Questions: You can then ask questions about the document. The script retrieves the most relevant context and generates a detailed, evidence-based answer.

Example

Given a veterinary PDF, you can ask a question like: The script will retrieve the relevant text, tables, and images and provide a detailed, model-generated answer. It can even display annotated images to support its response.

"What is Tartar"

Contributing

For any suggestions or issues, please open an issue in the repository.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
AC-Aids-for-Dogs_Canine-Periodontal-Disease_0.pdf		AC-Aids-for-Dogs_Canine-Periodontal-Disease_0.pdf
AC-Aids-for-Dogs_Preventive-Care-for-Periodontal-Disease-in-Dogs2.pdf		AC-Aids-for-Dogs_Preventive-Care-for-Periodontal-Disease-in-Dogs2.pdf
README.MD		README.MD
Screenshot 2025-08-01 222314.png		Screenshot 2025-08-01 222314.png
Screenshot 2025-08-01 222421.png		Screenshot 2025-08-01 222421.png
medicalrag.py		medicalrag.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MedicalRAG.py

Features

Dependencies

Usage

Example

"What is Tartar"

Contributing

About

Uh oh!

Releases

Packages

Languages

priyank766/Multimodel_RAG

Folders and files

Latest commit

History

Repository files navigation

MedicalRAG.py

Features

Dependencies

Usage

Example

"What is Tartar"

Contributing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages