about the hyperparameters

Hi ! I'm interesting in your awesome work and I want to reproduce the results in the paper. However, it seems that the hyperpaprameters(eg. the BASE_LR=0.0000125) in Base-DiffusionInst.yaml was changed and is not the same with the value mention in the paper (with a learning rate of 2.5e-5 and a weight decay of 1e-4.) Since I only have one RTX 3090 with 24G RAM, can you give me some advice about the  choice of learning rate and decay coefficient？Besides, does the schedule 5x in the paper refer to increasing the number of training iters by five times?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

about the hyperparameters #19

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

about the hyperparameters #19

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions