question about  concentration around a prototype

In the paper, you have mentioned "With the proposed φ, the similarity in a loose cluster (larger φ) are down-scaled,
pulling embeddings closer to the prototype", but i am wondering why the  down-scaled similarity can force them get closer? 
Could you please explain it more detailedly? Thanks!