ELS JUDGE

A production-ready code evaluation application. It uses a Textual-based Terminal User Interface (TUI) to evaluate and improve code using multiple large language models. The engine sends your code and instructions simultaneously to two different models and displays a side-by-side comparison of the improvements directly in your terminal.

Features

TUI-based Workflow: Easy-to-use terminal interface via Textual.
Parallel Judge Execution: AI models are executed in parallel via asyncio, speeding up evaluations without blocking.
Multi-Model Support: Configured to use two state-of-the-art models for diverse opinions:
- Primary: zai/glm-4.5-flash (ZhipuAI/Zai)
- Secondary: gemini/gemini-flash-latest (Google Gemini)

Technologies & Dependencies

Core Technologies

Python 3.11 - Programming language
Textual - Terminal User Interface (TUI) framework for interactive CLI
AsyncIO - Asynchronous programming for parallel model execution
Git - Version control with worktree and branch management
Docker - Containerization for deployment

Python Libraries & Frameworks

Pydantic - Data validation and schema management
Pydantic Settings - Configuration management with environment variable support
SQLAlchemy - Object-Relational Mapping (ORM) for database operations
LiteLLM - Unified interface for multiple LLM providers
psycopg2-binary - PostgreSQL database adapter
Rich - Terminal formatting, tables, and rich output rendering

AI Models

ZhipuAI GLM-4.5-Flash - Primary AI model for code analysis
Google Gemini Flash - Secondary AI model for comparative analysis

Infrastructure

PostgreSQL - Primary database (with SQLite option for development)
Git Worktrees - Isolated environments for model-specific changes
Environment Variables - Configuration management via .env files

Quick Start

1. Clone the repository

git clone https://github.com/codedbyelif/els-judge.git
cd els-judge

2. Configure API keys

Create a .env file in the project root and add your API keys:

ZAI_API_KEY=your_key...
ZHIPUAI_API_KEY=your_key...
GEMINI_API_KEY=AIzaSy...

3. Run Locally

Create a virtual environment and run the application:

python3 -m venv venv
source venv/bin/activate
pip install -r requirements.txt
bash start.sh

4. Run with Docker (Optional)

You can also run the application using Docker. Note that if you are using Docker, you still have to pass the API keys appropriately.

docker build -t els-judge .
docker run -it --env-file .env els-judge

(Note that the Docker image is configured to run ./start.sh automatically in the container.)

How to Use the TUI

Run the app with bash start.sh or python cli.py.
Enter the Target Files you want to modify (e.g., core/config.py engine/reviewers.py).
Write what you want improved in the "What should be improved?" input box.
Click "Analyze with 2 AI Models" or press CTRL+R.
Check out the consolidated Markdown report comparing the generated diffs on the right side of the screen.

Architecture Patterns

This project was inspired by Microsoft's open-source LLM-as-Judge framework.

Key Patterns

Pattern	Implementation	Explanation
Parallel Execution	`engine/dispatcher.py`	`asyncio.gather` is used to invoke all 2 LLMs simultaneously, keeping latency to a minimum.
Orchestrator Pattern	`engine/dispatcher.py`	A single entry point that manages submission, gathering, diff analyzing, and report generation.
Unified Results	`engine/aggregator.py` & `reporter`	Consolidates individual model suggestions into one cohesive, viewable Markdown summary.

Project Structure

ai-code-judge/
  cli.py               # Textual TUI entry point
  start.sh             # Branded launcher script
  Dockerfile           # Docker specification
  requirements.txt     # Python dependencies
  core/                # Settings, DB, git management
  engine/              # Litellm integration, diff analyzers, report generation
  models/              # Domain components (if any)
  schemas/             # Pydantic schemas for data validation

Built by codedbyelif | ELS JUDGE v1.0

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
core		core
engine		engine
models		models
schemas		schemas
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
ai-code-judge.code-workspace		ai-code-judge.code-workspace
cli.py		cli.py
output.log		output.log
package-lock.json		package-lock.json
patch_readme.py		patch_readme.py
requirements.txt		requirements.txt
start.sh		start.sh
tui.png		tui.png
tui2.png		tui2.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ELS JUDGE

Features

Technologies & Dependencies

Core Technologies

Python Libraries & Frameworks

AI Models

Infrastructure

Quick Start

1. Clone the repository

2. Configure API keys

3. Run Locally

4. Run with Docker (Optional)

How to Use the TUI

Architecture Patterns

Key Patterns

Project Structure

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ELS JUDGE

Features

Technologies & Dependencies

Core Technologies

Python Libraries & Frameworks

AI Models

Infrastructure

Quick Start

1. Clone the repository

2. Configure API keys

3. Run Locally

4. Run with Docker (Optional)

How to Use the TUI

Architecture Patterns

Key Patterns

Project Structure

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages