Skip to content

about the hyperparameters #19

@Devil-Ideal

Description

@Devil-Ideal

Hi ! I'm interesting in your awesome work and I want to reproduce the results in the paper. However, it seems that the hyperpaprameters(eg. the BASE_LR=0.0000125) in Base-DiffusionInst.yaml was changed and is not the same with the value mention in the paper (with a learning rate of 2.5e-5 and a weight decay of 1e-4.) Since I only have one RTX 3090 with 24G RAM, can you give me some advice about the choice of learning rate and decay coefficient?Besides, does the schedule 5x in the paper refer to increasing the number of training iters by five times?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions