Skip to content

grad become nan in first iter update #1

@MingChaoXu

Description

@MingChaoXu

image

after my first backword, the grad of shared model parameter become nan, what's the reason?

backword in this place:
https://github.com/brianlan/complex-grad-norm/blob/master/src/gradnorm.py#L90

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions