generated from amazon-archives/__template_MIT-0
-
Notifications
You must be signed in to change notification settings - Fork 175
Pull requests: awslabs/awsome-distributed-training
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat: Instance Compatibility Framework — multi-instance profiles and documentation for all test cases
#1015
opened Mar 11, 2026 by
nkumaraws
Loading…
Add NCCL send/recv ring benchmark for multi-GPU testing
#1013
opened Mar 10, 2026 by
paulogallotti
Loading…
feat: add script to delete specific HyperPod nodes and sync Terraform state
#1011
opened Mar 10, 2026 by
paragao
Loading…
Add NeMo RL GRPO training with fault tolerance (NVRx) on EKS
#1010
opened Mar 9, 2026 by
dmvevents
Loading…
6 tasks
Add DeepSpeed 103B GPT pretraining benchmark and standardize containers for B200
#1009
opened Mar 6, 2026 by
paragao
Loading…
Add optional Training Plan support for HyperPod instance groups
#1004
opened Feb 26, 2026 by
newabdosheham
Loading…
Syntax improvements and code quality enhancements for EFA node exporter
#966
opened Feb 17, 2026 by
KeitaW
Loading…
ProTip!
What’s not been updated in a month: updated:<2026-02-11.