Questions about Training loss curve

The training loss for the generation task seems to be converging very slowly. After 80,000 steps, the loss is still around 1.4. To help me verify if my training is on the right track, would it be possible for you to share the loss curve from your own experiments about model SphereAR-B?

<img width="1098" height="682" alt="Image" src="https://github.com/user-attachments/assets/1fee6683-4913-4768-95b1-1e4536a1a59e" />

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Questions about Training loss curve #2

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Questions about Training loss curve #2

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions