thanks for your work, I use the scripts to finetune SVD, I don't konw if it can directly use scheduler.add_noise to create noise latent, than use unet to denoise, and I use one video to overfit test the scripts, I find all is welll, except the color, the overfit model generate video color is deeper than the input model, I don't know why