Benchmark Datasets for LLM Evaluation

This repository contains three datasets designed for evaluating and benchmarking large language models (LLMs) on legal question answering (QA) and related tasks.

📁 Datasets

1. Legal QA from Public Court Sources

A synthetic QA dataset created from real legal documents. Each question and answer is synthesized based on publicly available legal texts.

QA Pairs: Each example includes a question, an answer, and a link to the source document.
Source Documents: PDF files from:

2. LegalBench Subset for LLM Benchmarking

A smaller version of the LegalBench dataset, reformatted for easier benchmarking.

Tasks: 129 legal reasoning tasks
Examples: 10 examples per task
Format: Reformatted for benchmark compatibility
License: Includes only tasks with reuse-friendly licenses

3. LegalBench-RAG QA Samples

A subset of the LegalBench-RAG dataset, focused on retrieval-augmented QA tasks.

QA Pairs: 200 examples
Domains: ContractNLI, CUAD, MAUD, PrivacyQA
Format: CSV files with prompt, expected answer, and reference context
Use Case: Useful for testing legal document QA with retrieval-based context

These datasets are designed for legal QA, model training, and benchmarking tasks. Please refer to each dataset’s source license for usage terms.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
legal		legal
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Benchmark Datasets for LLM Evaluation

📁 Datasets

1. Legal QA from Public Court Sources

2. LegalBench Subset for LLM Benchmarking

3. LegalBench-RAG QA Samples

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

datasaur-ai/datasaur-databench

Folders and files

Latest commit

History

Repository files navigation

Benchmark Datasets for LLM Evaluation

📁 Datasets

1. Legal QA from Public Court Sources

2. LegalBench Subset for LLM Benchmarking

3. LegalBench-RAG QA Samples

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages