Hi! It's a nice job!
It seems use data augmentation for nuscenes datasets, maintaining 7724 samples (total batch sizes=16), which is more than the original number of nuscenes samples.
I want to know the gpu nums you used and how long to pretrain the swin-base Transformer.
Looking forward to your reply! Thank you!