-
Notifications
You must be signed in to change notification settings - Fork 3
Open
Description
I followed the steps on my server with 120GB CPU-memory, but the model is said to have an OOM error and the training process is killed, unfortunately.
Before the processed is terminated, we found that the 120G CPU-memory is fully consumed but the GPU memory is not used at all.
I wonder how large the size of the CPU-memory is sufficient to run the model.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels