fix(utils): add GPU warm-up for profiling by YutongJau · Pull Request #62 · aliyun/aicb

YutongJau · 2026-02-02T19:36:46Z

Motivation

The initial execution of measure_model captures significant overhead from CUDA context initialization and lazy kernel loading, resulting in heavily skewed profiling data, such as Emb layer metrics (e.g., observed 35000ms vs stable 550ms). This leads to inaccurate inputs for the AIOB simulator.

Changes

Implemented a 10-step warm-up loop in utils/utils.py to ensure the GPU is fully initialized before profiling.

Impact

Eliminates cold-start outliers and improves profiling accuracy.

Verification (Environment: NVIDIA RTX 4090):

Metric (Unit: ms)	No Warm-up	10-step Warm-up	Status
Emb	35,479.4	550.0	Stabilized
Param	4,974.6	1,496.5	Stabilized

CLAassistant · 2026-02-02T19:36:53Z

All committers have signed the CLA.

fix(utils): add GPU warm-up for profiling

a5b5d6e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(utils): add GPU warm-up for profiling#62

fix(utils): add GPU warm-up for profiling#62
YutongJau wants to merge 1 commit intoaliyun:masterfrom
YutongJau:fix/gpu-warmup

YutongJau commented Feb 2, 2026

Uh oh!

CLAassistant commented Feb 2, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

YutongJau commented Feb 2, 2026

Motivation

Changes

Impact

Uh oh!

CLAassistant commented Feb 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

CLAassistant commented Feb 2, 2026 •

edited

Loading