VERIRAG: Verification-Enhanced Retrieval-Augmented Generation

A Self-Disagreement-Aware Retrieval-Augmented Generation system that detects conflicting evidence in documents and provides multi-view answers with calibrated confidence scores. Unlike traditional RAG systems that collapse disagreement into falsely confident responses, VERIRAG explicitly surfaces conflicts, quantifies uncertainty, and shows where sources disagree.

Features

Automatic Conflict Detection - Identifies contradictions at the claim level
Disagreement Graphs - Visualizations showing which claims support or conflict
Calibrated Confidence - Adjusts confidence (high/medium/low) based on source agreement
Multi-View Synthesis - Presents dominant view + alternative perspectives
Quantitative Metrics - Disagreement density, conflict ratio, entropy scores
Multi-Format Support - Works with PDF, DOCX, PPTX, TXT, MD, CSV
Dual-Model Architecture - Fast extraction (Local Llama3) + quality synthesis (Local Phi3 Mini)
Transparent Reasoning - See exactly why the system is confident or uncertain

Conflict Detection Process

Claim Extraction: Breaks down each retrieved document chunk into atomic factual claims
Pairwise Analysis: Compares claims to detect entailment, contradiction, or neutrality
Graph Construction: Builds a network where nodes are claims and edges represent relationships
Metric Computation: Calculates disagreement density, conflict ratio, and confidence entropy
Confidence Calibration: Adjusts system confidence based on quantitative disagreement metrics

System Architecture

Query → Vector Retrieval (Top-K Chunks, via LlamaIndex) → Claim Extraction (via Llama3) → Relationship Analysis (via Llama3) → Disagreement Graph Construction (NetworkX + Metrics) → Multi-View Synthesis (via Phi3:mini) → Calibrated Answer + Visualizations

Use Cases

Research & Academia: Detect conflicting findings, verify claims, and surface research gaps
Education & Study: Highlight disputed concepts to focus on deeper conceptual understanding
Legal & Medical: Identify precedent splits, guideline disagreements, and decision risk
Journalism & Verification: Cross-check sources, flag contradictions, and expose framing bias

How to Run

Make sure you have Python 3.8+ installed.
Clone this repository on your local machine.
Install the required dependencies:

pip install llama-index llama-index-llms-ollama llama-index-embeddings-huggingface sentence-transformers numpy pandas matplotlib seaborn networkx scikit-learn pypdf python-pptx python-docx nltk transformers torch

Set up the Corpus folder in the project directory, add the relevant files to this folder; the system will be working with this folder. Name the folder 'documents'.
Install Ollama & set up llama3:latest & phi3:mini.
Open and run the cells of the VERIRAG System.ipynb Jupyter Notebook, ask the relevant queries.

Contributing

Contributions are welcome!

License

Distributed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
LICENSE		LICENSE
README.md		README.md
VERIRAG System.ipynb		VERIRAG System.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VERIRAG: Verification-Enhanced Retrieval-Augmented Generation

Features

Conflict Detection Process

System Architecture

Use Cases

How to Run

Contributing

License

About

Uh oh!

Releases

Packages

Languages

License

krishang118/VERIRAG

Folders and files

Latest commit

History

Repository files navigation

VERIRAG: Verification-Enhanced Retrieval-Augmented Generation

Features

Conflict Detection Process

System Architecture

Use Cases

How to Run

Contributing

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages