Skip to content

Use η0 / (1 + λ η0 t)^0.75 by default instead of ...^2/3. #12

@npinto

Description

@npinto

The learning rate has the form η0 / (1 + λ η0 t)^0.75 where λ is the regularization constant.

See:
http://leon.bottou.org/projects/sgd

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions