Thank you for the excellent work.
Is the code for Additive-MoF publicly available?
Alternatively, could you provide details on the implementation?
For example, do you L2 normalize both the CLIP features and DINO features before passing them through the adapter, and any other specific details would be greatly appreciated.