evaluation-pipeline

Here are 2 public repositories matching this topic...

SuperfiedStudd / ai-evals-orchestration

End-to-end AI evals orchestration platform for comparing LLM outputs across providers with transcription, structured logging, human review, and Supabase-backed decision tracking.

gemini openai multi-model transcription human-in-the-loop model-comparison supabase anthropic llm-evaluation ai-evals evaluation-pipeline

Updated Mar 10, 2026
TypeScript

Harsh-1165 / AgentOS

Star

Production-grade Safe GenAI Agent Orchestrator with intent routing, hallucination guard, tool orchestration, evaluation pipeline, and multi-screen Next.js dashboard.

typescript nextjs ai-safety prisma ai-agent llm genai fullstack-ai agent-orchestrator evaluation-pipeline

Updated Feb 21, 2026
TypeScript

Improve this page

Add a description, image, and links to the evaluation-pipeline topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the evaluation-pipeline topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly