-
Notifications
You must be signed in to change notification settings - Fork 3.7k
Pull requests: NVIDIA/Megatron-LM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Normalize tool_calls and gate parser tool-calls to tool-enabled requests
complexity: low
Final Review
PR is in the "final review" stage
#3710
opened Mar 4, 2026 by
i-riyad
Loading…
6 tasks
Claude to add complexity label
complexity: low
Expert Review
Apply this label to indicate that your PR is ready for expert review.
[Main][feat] Support CUDA Graph capture offloading modules
complexity: medium
enhancement
New feature or request
chore: CLI launch internal CI
Expert Review
Apply this label to indicate that your PR is ready for expert review.
fix ddp bug when --overlap-grad-reduce and --num-distributed-optimi for dev
#3694
opened Mar 4, 2026 by
wplf
Loading…
6 tasks
fix ddp bug when --overlap-grad-reduce and --num-optim > 1
Final Review
PR is in the "final review" stage
#3693
opened Mar 4, 2026 by
wplf
Loading…
Modify mfsdp default data-parallel-sharding-strategy for dev
#3692
opened Mar 4, 2026 by
wplf
Loading…
6 tasks
fix: skip FSDP DTensor boundary validation under fake process group
Final Review
PR is in the "final review" stage
Fix: Defensively close GPU device FDs in dataloader worker processes
#3684
opened Mar 4, 2026 by
hexinw-nvidia
•
Draft
Improve error logging when invalid number of tokens is requested.
complexity: low
Expert Review
Apply this label to indicate that your PR is ready for expert review.
Fix split state dict main
Final Review
PR is in the "final review" stage
#3676
opened Mar 3, 2026 by
kunlunl
Loading…
6 tasks
refactor: migrate TransformerConfig validations to __post_init__ (Part of #3568)
community-request
Final Review
PR is in the "final review" stage
#3675
opened Mar 3, 2026 by
CodersAcademy006
Loading…
Enable DSA CP/absorbed/THD paths with TileLang fused ops
community-request
#3674
opened Mar 3, 2026 by
HollowMan6
•
Draft
6 tasks done
Fix split_state_dict function for MoE models
community-request
Expert Review
Apply this label to indicate that your PR is ready for expert review.
Previous Next
ProTip!
Exclude everything labeled
bug with -label:bug.