Thanks for the excellent work.
I have followed the readme to run python main.py --dataset cifar100 --num-labeled 50 --out Results --num-super 10 --arch wideresnet --lambda_oem 0.1 --lambda_socr 1.0 --batch-size 64 --lr 0.03 --expand-labels --seed 0 --mu 2
The final performance after epoch 512 is

Does the Table 1 include the best performance on test set or the mean of last 20 epochs?
If run with the amp, the result is

Is it normal?