Curious about the question in the title. Wonder if you discovered an instability or inefficiency with F.normalize() instead of the other norm function?