Skip to content

Training problem #11

@chenyinlin1

Description

@chenyinlin1

Hello. Thank you for your outstanding work. However, I am having some problems reproducing the training portion of the code and am not getting the expected training results. Your code originally appeared to have all losses as nan, as shown below.
image

I tried to modify the loss function a bit, but it seems that there is no backpropagation, although the losses are no longer nan.
image

where all the parameters use the default training parameters,Except that batch_size was changed from 36 to 24

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't workinggood first issueGood for newcomers

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions