Question about timestep select with win and lose

<img width="854" height="275" alt="Image" src="https://github.com/user-attachments/assets/8d72ea29-8ced-4a24-b9ae-8d7f4ca8625f" />

`index = torch.randint(
            0, self.num_ddim_timesteps, (x.shape[0],), device=self.device
        ).long()`

maybe the timestep with win and lose should be same.

The curve of my training process is shown below. Why is it that while my lose_diff is increasing, my win_diff is also increasing? In principle, shouldn't lose_diff continuously increase and win_diff continuously decrease

![Image](https://github.com/user-attachments/assets/ac1f5361-da99-45d7-b8b2-f000b97407d9)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about timestep select with win and lose #11

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Question about timestep select with win and lose #11

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions