Hi, I follow the notebook from https://colab.research.google.com/drive/1QKOtYP7bMpdgDJEipDxaJqOchv7oQ-_l#scrollTo=F0vEOd_nmpTR and find that training with the st model is 100x slower than expected. DataLoader takes up to 70 seconds to load a single batch instead of the expected sub-second performance shown in tutorials. ``` Epoch 0: 100%|██████████| 200/200 [56:16<00:00, 0.06it/s, v_num=0] ``` Has anyone experienced similar performance issues?