[Perf] Expand apples-to-apples kernel suite and scorecard

## Goal
Improve runtime competitiveness beyond single-kernel parity by expanding apples-to-apples coverage.

## Scope
Add at least 8 additional kernel shapes, each with L0+C equivalents and a shared harness:
1. integer arithmetic chain
2. bitwise-heavy kernel
3. branch-heavy kernel
4. memory load/store roundtrip
5. pointer arithmetic loop
6. function call chain
7. mixed arithmetic+branch kernel
8. small struct/aggregate pass

## Acceptance
- [ ] All kernels benchmarked in CI.
- [ ] Median-of-N reporting per kernel.
- [ ] Results page includes per-kernel winner and geometric mean.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Perf] Expand apples-to-apples kernel suite and scorecard #2

Goal

Scope

Acceptance

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

[Perf] Expand apples-to-apples kernel suite and scorecard #2

Description

Goal

Scope

Acceptance

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions