Skip to content

Fix GRPO weight-sync hangs and HCCL resource exhaustion on NPU #253

Fix GRPO weight-sync hangs and HCCL resource exhaustion on NPU

Fix GRPO weight-sync hangs and HCCL resource exhaustion on NPU #253

Triggered via pull request March 5, 2026 07:03
Status Cancelled
Total duration 1d 0h 0m 2s
Artifacts

citest.yaml

on: pull_request
Fit to window
Zoom out
Zoom in

Annotations

1 error
unittest
The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s