When reproducing Table 3 from https://arxiv.org/pdf/2304.04512, all went well until running python eval.py --dataset=disentangling -t, which gave me (after fixing the dataset usage) 77.78% and 82.78% instead of 43.27% resp. 71.93% (as written in Table 3).
What's wrong?
Thanks a lot for helping figure this out!