Introduce compiler hints and pragmas to guide instruction scheduling,
loop unrolling, and memory access behavior.
The goal is to improve kernel performance without changing algorithms.
Planned Benchmarks
- Instruction count changes
- Occupancy impact
- Performance deltas
Learning Objectives
- Compiler behavior analysis
- Pragmas and attributes
- Micro-optimizations