Skip to content

Pull requests: NVIDIA/Megatron-LM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Normalize tool_calls and gate parser tool-calls to tool-enabled requests complexity: low Final Review PR is in the "final review" stage
#3710 opened Mar 4, 2026 by i-riyad Loading…
6 tasks
Claude to add complexity label complexity: low Expert Review Apply this label to indicate that your PR is ready for expert review.
#3709 opened Mar 4, 2026 by Phlip79 Loading…
6 tasks
Core 0.16
Extract the changes from Jorge's branch
#3701 opened Mar 4, 2026 by tdene Draft
6 tasks
Fix config.softmax_scale not being considered Final Review PR is in the "final review" stage
#3698 opened Mar 4, 2026 by janEbert Loading… Core 0.16
chore: CLI launch internal CI Expert Review Apply this label to indicate that your PR is ready for expert review.
#3695 opened Mar 4, 2026 by ko3n1g Loading…
6 tasks
Core 0.16
fix ddp bug when --overlap-grad-reduce and --num-optim > 1 Final Review PR is in the "final review" stage
#3693 opened Mar 4, 2026 by wplf Loading…
Modify mfsdp default data-parallel-sharding-strategy for dev
#3692 opened Mar 4, 2026 by wplf Loading…
6 tasks
Add Engram model structure integration (v1)
#3689 opened Mar 4, 2026 by ilml Draft
3 tasks
fix: skip FSDP DTensor boundary validation under fake process group Final Review PR is in the "final review" stage
#3686 opened Mar 4, 2026 by Victarry Loading…
6 tasks
Core 0.16
Add doc for layerwise distributed optimizer
#3682 opened Mar 3, 2026 by BoxiangW Draft
6 tasks
Improve error logging when invalid number of tokens is requested. complexity: low Expert Review Apply this label to indicate that your PR is ready for expert review.
#3680 opened Mar 3, 2026 by yobibyte Loading… Core 0.16
Fix split state dict main Final Review PR is in the "final review" stage
#3676 opened Mar 3, 2026 by kunlunl Loading…
6 tasks
Fix split_state_dict function for MoE models community-request Expert Review Apply this label to indicate that your PR is ready for expert review.
#3667 opened Mar 3, 2026 by eternally-z Loading…
6 tasks
Core 0.16
Minor inference changes for NemoRL
#3666 opened Mar 3, 2026 by ArEsKay3 Draft
6 tasks
ProTip! Exclude everything labeled bug with -label:bug.