Hi, Thanks for sharing the nice work!
I’m curious about the differences between the pretrained models
For example,
BiFuseV2.pth : trained on Matterport3D-all
BiFuseV2_SpatialAudioGen.tph : trained on Matterport3D-all + tuned on SpatialAudioGen-all (p)
BiFuseV2_st3d.pth : trained on Matterport3D-all + tuned on Structured3D-all (p)
Is that correct?