With semi-frequent eculid restarts it would be good if we can restart training from a checkpoint.
I think it used to work, and maybe I have deactivated it at some point. It looks like checkpoints are saved but not used when starting training.
@ctorney could you take a look? Branch master.