whitening operation, which is implemented by a non-parametric layer normalization operator without scaling and bias
You mentioned that the whitening operation is non-parametric. But it seems you implemented it by norm operation from the original paper which is not non-parametric.
if self.feat_after_norm:
if 'CLIP' in self.pred_feat:
x_tgt = self.feature_model.visual.ln_post(x_tgt)