This repository was archived by the owner on May 1, 2025. It is now read-only.

Description
In the paper, you have mentioned "With the proposed φ, the similarity in a loose cluster (larger φ) are down-scaled,
pulling embeddings closer to the prototype", but i am wondering why the down-scaled similarity can force them get closer?
Could you please explain it more detailedly? Thanks!