The parameters are initialized according to [12] and the weights of fully connected layer are using Xavier initialization [10].
is written in your paper. However, Pytorch code for ImageNet only uses the default initialization of full connection but not Xavier initialization.
Thus, is Xavier initialization used for fc on ImageNet on earth?