GPU runs out of memory with --batch_num=32

Could you please help me out?

When I use --batch_num=32, I cannot run the code on a single GPU. My GPU is Tesla P100-SXM2 16G. I only can run the code with --batch_num=1. Since we have create_graph=True and retain_graph=True in the inner loop with torch.autograd.grad, and M=10, the  GPU memory gets allocated too fast. It is also very tricky to run MAML on multiple GPUs. 

I am wondering in your implementation, what was the batch size? and how did you address the problem of increasing GPU memory allocation in the inner for loop? Did you use multiple GPUs or a single GPU?

Thanks alot





Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GPU runs out of memory with --batch_num=32 #12

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

GPU runs out of memory with --batch_num=32 #12

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions