Skip to content

fix ddp bug when --overlap-grad-reduce and --num-optim > 1#3693

Open
wplf wants to merge 3 commits intoNVIDIA:mainfrom
wplf:fix-ddp-bug-with-overlap
Open

fix ddp bug when --overlap-grad-reduce and --num-optim > 1#3693
wplf wants to merge 3 commits intoNVIDIA:mainfrom
wplf:fix-ddp-bug-with-overlap

Conversation

@wplf
Copy link
Member

@wplf wplf commented Mar 4, 2026

Thanks for zyeric's issue, you can see more details in #3670.

Bug reproduce and fix

image

Any member of core-adlr and core-nemo will be able to merge your PR.

@wplf wplf requested review from a team as code owners March 4, 2026 09:36
@copy-pr-bot
Copy link

copy-pr-bot bot commented Mar 4, 2026

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@svcnvidia-nemo-ci svcnvidia-nemo-ci requested a review from a team March 4, 2026 09:36
@wplf wplf marked this pull request as draft March 4, 2026 09:45
@wplf wplf closed this Mar 4, 2026
@wplf wplf reopened this Mar 4, 2026
@wplf wplf marked this pull request as ready for review March 4, 2026 14:23
@Phlip79 Phlip79 added the Final Review PR is in the "final review" stage label Mar 4, 2026
@wplf
Copy link
Member Author

wplf commented Mar 5, 2026

Hi @zyeric, Please review this PR and thank you for your contribution again.

@wplf
Copy link
Member Author

wplf commented Mar 5, 2026

/claude review

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Final Review PR is in the "final review" stage

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants