forked from pytorch/torchtitan
-
Notifications
You must be signed in to change notification settings - Fork 6
Pull requests: NousResearch/torchtitan
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix: fix NaN loss in MoE models with CPU offload enabled #2247
#43
opened Jan 16, 2026 by
xrsrke
Loading…
Implement SimKO to add entropy in TopK token sampling during RL
#13
opened Oct 29, 2025 by
ighoshsubho
Loading…
ProTip!
Mix and match filters to narrow down what you’re looking for.