As I understand you making following steps:
- Fine tune dreambooth on all frames
- Fine tune animatediff motion module
- Extract controlnet from video
- Combine them to infer
Can you share hyperparameters of finetuning?
- Do you finetune LoRA or Dreambooth? How many steps, what lr?
- Do you finetune all module or selected layers?