automated-evaluation

Here are 3 public repositories matching this topic...

nsourlos / LLM_evaluation_framework

Evaluate performance of LLM models for Q&A in any domain

Updated Jul 3, 2025
Python

MarsPain / auto_search_rubric

Automated search framework for rubric-based reward modeling. Features Evolutionary RTD (population search with elitism + successive halving) and Iterative RTD baseline. Supports tail-focused objectives, multi-role LLM backends, and rank-based preferences

python rl model-evaluation rubric llm reward-modeling automated-evaluation

Updated Feb 9, 2026
Python

kchernokozinsky / paper-sage

Star

AI-powered student assignment evaluator written in Rust. Supports code, PDF, and DOCX files. Uses local or remote LLMs to grade submissions based on configurable criteria, and exports results to Excel.

rust education ai grading openai code-review gpt student-assignments cli-tool excel-export pdf-processing docx-parser llm ollama automated-evaluation

Updated Oct 30, 2025
Rust

Improve this page

Add a description, image, and links to the automated-evaluation topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the automated-evaluation topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly