Optimization-algorithms

Implementation and exploration of different optimization algorithms.

ADAM analysis

One of the complaints about ADAM is that still, the adjusted learning rates (see third math line in https://docs.google.com/presentation/d/1UFmncokDwUC4xLChHOfhe26MJucMyrjPjP9iLcKo30A/edit#slide=id.g5871948b3d_0_86) can be extream (very high or very low). Let's try to confirm this:

Modify your step_adam code to calculate the smallest and largest adjusted rates in the network in every step, and plot that as a curve (X axis is step number, and Y axis is the adjusted LR value). Plot two curves: one for min and one for max. What's your conclusions?

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
Optimization_Algorithms.ipynb		Optimization_Algorithms.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Optimization-algorithms

ADAM analysis

About

Uh oh!

Releases

Packages

Languages

shunitavni/Optimization-algorithms

Folders and files

Latest commit

History

Repository files navigation

Optimization-algorithms

ADAM analysis

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages