-
Notifications
You must be signed in to change notification settings - Fork 0
Open
Labels
area/benchmarkBenchmark methodology and automationBenchmark methodology and automationarea/perfCompiler/runtime performance workstreamsCompiler/runtime performance workstreamspriority/highHigh priorityHigh prioritytype/taskImplementation taskImplementation task
Description
Goal
Improve runtime competitiveness beyond single-kernel parity by expanding apples-to-apples coverage.
Scope
Add at least 8 additional kernel shapes, each with L0+C equivalents and a shared harness:
- integer arithmetic chain
- bitwise-heavy kernel
- branch-heavy kernel
- memory load/store roundtrip
- pointer arithmetic loop
- function call chain
- mixed arithmetic+branch kernel
- small struct/aggregate pass
Acceptance
- All kernels benchmarked in CI.
- Median-of-N reporting per kernel.
- Results page includes per-kernel winner and geometric mean.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
area/benchmarkBenchmark methodology and automationBenchmark methodology and automationarea/perfCompiler/runtime performance workstreamsCompiler/runtime performance workstreamspriority/highHigh priorityHigh prioritytype/taskImplementation taskImplementation task