Benchmark Script for local models

# Benchmark Script for local models

**Summary**  
Add `scripts/benchmark.py` to compare local HF models on a fixed diff corpus.

**Scope**  
- Inputs: a small corpus of representative diffs (JSONL).  
- Outputs: CSV with `model, tokens_in, tokens_out, latency_ms, provider_device`.  
- Optional: save a plot to `docs/benchmarks/`.

**Tasks**
- [ ] Implement benchmark harness with warmup and N runs.  
- [ ] Detect device (CUDA/MPS/CPU) and memory optimization flags.  
- [ ] Add sample corpus + instructions.  
- [ ] Document results format in README.

**Acceptance criteria**
- [ ] Running the script produces a CSV and a short summary line per model.  
- [ ] Works without cloud keys (local models only by default).



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Benchmark Script for local models #19

Benchmark Script for local models

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Benchmark Script for local models #19

Description

Benchmark Script for local models

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions