Bugfix: fix memory allocation issues during multi-GPU training #67

LittleNyima · 2026-01-14T10:57:20Z

During multi-GPU training, the current implementation causes all text encoders across all ranks to be loaded onto the same GPU (thus causing CUDA OOM issues). The correct approach is to initialize the accelerator first and then load the model to CUDA.

Bugfix: fix memory allocation issues during multi-GPU training

45a2975

LittleNyima mentioned this pull request Jan 14, 2026

Training LoRA using FSDP config #57

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Bugfix: fix memory allocation issues during multi-GPU training #67

Bugfix: fix memory allocation issues during multi-GPU training #67

LittleNyima commented Jan 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Bugfix: fix memory allocation issues during multi-GPU training #67

Are you sure you want to change the base?

Bugfix: fix memory allocation issues during multi-GPU training #67

Conversation

LittleNyima commented Jan 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant