-
Notifications
You must be signed in to change notification settings - Fork 62
Open
Description
Hello all,
I'm seeking clarification on how Ithaca's evaluation/checkpointing system works.
From my understanding, the evaluate function should calculate the evaluation metrics and store the checkpoint's pickle file on disk. However, I'm uncertain about when this function is called.
Currently, when I execute the code, I notice that it just generate a log file containing the training loss and the accuracy. However, it doesn't include information about the validation loss, nor a checkpoint is produced.
Also when I try to run:
python3 experiment.py --config=config.py --jaxline_mode=eval --logtostderr
it says:
Checkpoint None invalid or already evaluated, waiting.
Thank you for your time and assistance.
Best regards,
Alessandro
Metadata
Metadata
Assignees
Labels
No labels