The training loss for the generation task seems to be converging very slowly. After 80,000 steps, the loss is still around 1.4. To help me verify if my training is on the right track, would it be possible for you to share the loss curve from your own experiments about model SphereAR-B?
