Provider Database for SmarterRouter

Builds provider.db - a comprehensive benchmark database that powers intelligent model routing in SmarterRouter.

This project automatically collects benchmark scores from 36+ sources, applies intelligent heuristics for uncovered models, and produces a SQLite database with 100% coverage of all OpenRouter models. The database integrates seamlessly with SmarterRouter's RouterEngine to enable automatic model selection based on reasoning, coding, and general knowledge capabilities.

✨ Features

100% Model Coverage - All 436+ OpenRouter models have benchmark data
Real + Estimated Scores - 74% real benchmarks + 26% intelligent heuristics
SmarterRouter Compatible - Exact schema match: model_id, reasoning_score, coding_score, general_score, elo_rating
Dynamic Authority Weighting - Sources weighted by tier (Tier 1: 1.0, Tier 2: 0.9, Tier 3: 0.8) with consensus-based outlier penalization
Security Hardened - SQL injection protection, input validation, rate limiting
Vision/Tool Keyword Support - Automatic alias generation for capability detection
Configuration Driven - YAML configuration for API endpoints, rate limits, heuristics
Structured Logging - Comprehensive logging with operation tracking
Docker Ready - Single-command build with all dependencies included
Automated Scheduling - Crontab examples for twice-daily updates (10am/10pm EST)
Production Grade - 64 passing tests, validation commands, health checks

🚀 Quick Start

# Build Docker image
docker build -t provider-db-builder .

# Generate database (set OPENROUTER_API_KEY for full model list)
mkdir -p data
docker run --rm -v $(pwd)/data:/app/data provider-db-builder:latest
# Or with API key: docker run --rm -v $(pwd)/data:/app/data -e OPENROUTER_API_KEY=sk-xxx provider-db-builder:latest

# Validate for SmarterRouter
docker run --rm -v $(pwd)/data:/app/data provider-db-builder:latest validate

# Alternative using Docker Compose
docker compose build
docker compose run provider-db
# With API key: OPENROUTER_API_KEY=sk-xxx docker compose run provider-db

📊 Coverage

Metric	Value
Total OpenRouter Models	436
With Real Benchmarks	324 (74%)
With Heuristics	112 (26%)
ELO Range	1010-1505
Score Range	0-100 (all validated)
Test Coverage	64 tests passing
Security Fixes	SQL injection, input validation, rate limiting

🔧 CLI Commands

Local Installation

python -m router.provider_db build      # Build database
python -m router.provider_db stats      # Show statistics
python -m router.provider_db health     # Check health
python -m router.provider_db validate   # SmarterRouter compatibility
python -m router.provider_db inspect <model_id>  # View model scores

Docker Commands

# Build database (set OPENROUTER_API_KEY for full model list)
docker run --rm -v $(pwd)/data:/app/data provider-db-builder:latest
# Or with API key: docker run --rm -v $(pwd)/data:/app/data -e OPENROUTER_API_KEY=sk-xxx provider-db-builder:latest

# Validate database
docker run --rm -v $(pwd)/data:/app/data provider-db-builder:latest validate

# Show statistics
docker run --rm -v $(pwd)/data:/app/data provider-db-builder:latest stats

# Check health
docker run --rm -v $(pwd)/data:/app/data provider-db-builder:latest health

# Inspect specific model
docker run --rm -v $(pwd)/data:/app/data provider-db-builder:latest inspect <model_id>

# Using Docker Compose
docker compose build
docker compose run provider-db              # Build database
# With API key: OPENROUTER_API_KEY=sk-xxx docker compose run provider-db
docker compose run provider-db validate     # Validate
docker compose run provider-db inspect <model_id>  # Inspect

Helper Script

# Make script executable
chmod +x docker-build.sh

# Build image and run database build (set OPENROUTER_API_KEY for full model list)
./docker-build.sh build ./data
# With API key: OPENROUTER_API_KEY=sk-xxx ./docker-build.sh build ./data

# Open interactive shell in container
./docker-build.sh shell

# Show help
./docker-build.sh help

📦 What's Inside

40+ Benchmark Sources including:

LMSYS Chatbot Arena (ELO)
LiveBench (reasoning - combined sources)
BigCodeBench, EvalPlus (coding)
MMLU, MMLU-Pro (general knowledge)
SWE-bench, Aider, HumanEval, LiveCodeBench (coding variants)
GSM8K, ARC, BBH, MathVista, AGIEval (reasoning variants)
FrontierMath, AIME, SciCode (advanced math/coding)
MEGA-Bench, MixEval-X (multimodal)
Chinese benchmarks (C-Eval, C-MMLU, Chinese reasoning, Chinese coding, Chinese ELO)
Tool Use, Vision
TruthfulQA, HellaSwag (commonsense/factual)
GPQA, StateVal (domain-specific)
AILuminate, Domain-Specific, HELM
Hendrycks MATH (advanced math reasoning)
HellaSwag (commonsense reasoning)
TruthfulQA (factual accuracy)
Safety benchmarks (harmful content refusal)
Multilingual benchmarks (C-Eval, C-MMLU)

Smart Heuristics:

45+ provider baselines (OpenAI, Anthropic, Meta, etc.)
Size-based modifiers (405B, 70B, 30B, 13B, 7B, 3B, 1B)
Variant detection (reasoning, coding, vision, search, guard)
Category detection (search engines, research models, safety models)

Security & Reliability:

SQL injection protection with parameterized queries
Input validation and sanitization
Rate limiting for API calls
Critical source error handling
Structured logging and monitoring

🔄 Integration with SmarterRouter

Build provider.db using Docker
Copy to SmarterRouter's data directory
SmarterRouter automatically queries benchmark scores
Router applies formula: (benchmark × 1.5 × Q) + (elo × 1.0 × Q) + (profile × 0.8 × Q) + (inference × 0.4 × Q)
Models selected based on capability + hardware profile + user preferences

🏗️ Architecture

┌─────────────────┐     ┌──────────────────┐     ┌─────────────────┐
│  OpenRouter API │────▶│  33 Benchmark    │────▶│   Heuristic     │
│  (model list)   │     │   Sources        │     │   Estimator    │└─────────────────┘     └──────────────────┘     └─────────────────┘
                                                              │
                                                              ▼
┌─────────────────┐     ┌──────────────────┐     ┌─────────────────┐
│ SmarterRouter   │◀────│  provider.db     │◀────│   Builder      │
│  RouterEngine   │     │  (SQLite)        │     │   (Docker)     │
└─────────────────┘     └──────────────────┘     └─────────────────┘

📁 Repository Structure

provider/
├── Dockerfile                  # Builds provider-db-builder image
├── requirements.txt            # Python dependencies
├── config.yaml                 # Configuration file (API endpoints, rate limits)
├── data/                       # Output directory for provider.db (generated)
├── router/provider_db/
│   ├── builder.py              # Main orchestration
│   ├── database.py             # SQLite operations (security hardened)
│   ├── models.py               # Pydantic schemas
│   ├── model_mapper.py         # Name → canonical ID
│   ├── cli.py                  # Command-line interface
│   ├── utils.py                # Utilities (rate limiting, validation)
│   ├── logging_config.py       # Structured logging configuration
│   ├── sources/                # 33 benchmark fetchers
│   └── tests/                  # Test suite (64 tests)
└── docs/                       # Documentation
    ├── DEEPDIVE.md             # Technical deep dive (consolidated)
    └── CRONTAB_EXAMPLE.txt     # Automation examples

🧪 Testing

pytest router/provider_db/tests/ -v
# 64 tests passing (55 original + 9 security/feature tests)

# Run security-focused tests
pytest router/provider_db/tests/test_fixes.py -v

📝 License

MIT - see LICENSE file

🔒 Security

See SECURITY.md for security policy and vulnerability reporting.

🤝 Contributing

See CONTRIBUTING.md for guidelines.

Perfect for: SmarterRouter users, AI infrastructure teams, and anyone wanting automatic model selection based on capability metrics. The database is generated twice daily and committed to GitHub for easy consumption.

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
data		data
docs		docs
generated_fetchers		generated_fetchers
research		research
router/provider_db		router/provider_db
.dockerignore		.dockerignore
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
config.yaml		config.yaml
docker-build.sh		docker-build.sh
docker-compose.yml		docker-compose.yml
models.html		models.html
provider.db		provider.db
requirements.txt		requirements.txt
research_benchmark_search.py		research_benchmark_search.py
research_benchmark_search_v2.py		research_benchmark_search_v2.py
research_datasets.py		research_datasets.py
test_api_connectivity.py		test_api_connectivity.py
test_builder.py		test_builder.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Provider Database for SmarterRouter

✨ Features

🚀 Quick Start

📊 Coverage

🔧 CLI Commands

Local Installation

Docker Commands

Helper Script

📦 What's Inside

🔄 Integration with SmarterRouter

🏗️ Architecture

📁 Repository Structure

🧪 Testing

📝 License

🔒 Security

🤝 Contributing

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Provider Database for SmarterRouter

✨ Features

🚀 Quick Start

📊 Coverage

🔧 CLI Commands

Local Installation

Docker Commands

Helper Script

📦 What's Inside

🔄 Integration with SmarterRouter

🏗️ Architecture

📁 Repository Structure

🧪 Testing

📝 License

🔒 Security

🤝 Contributing

About

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages