Mismatch shape when applying and merging a locon model to a SanaTransformer2DModel module

I have trained a locon model based on Efficient-Large-Model/Sana_1600M_1024px_diffusers that I want to apply and merge.
https://huggingface.co/docs/diffusers/en/api/models/sana_transformer2d

Here is a zero initialized rank 4 locon model (untrained)
[0.zip](https://github.com/user-attachments/files/18321301/0.zip)

I'm getting a mismatch size error (expecting [11200, 11200, 3, 3] but got [11200, 1, 3, 3]:
![image](https://github.com/user-attachments/assets/9f06571e-cf86-4379-907f-49a0dfdbfc9b)
![image](https://github.com/user-attachments/assets/242c3d19-469a-4cc6-8157-3e337e71c024)
![image](https://github.com/user-attachments/assets/04792639-1ee1-46c7-b666-3ea88d8b6d55)
![image](https://github.com/user-attachments/assets/1cdd02fa-f3b1-4aa7-aa15-ad99cd8be7bd)
![image](https://github.com/user-attachments/assets/89bac618-0fb7-47ad-ad1b-b22ceb33b849)

```
import torch
from diffusers import SanaTransformer2
from lycoris import create_lycoris_from_weights
step = '0'
transformer = SanaTransformer2DModel.from_pretrained("Efficient-Large-Model/Sana_1600M_1024px_diffusers", subfolder='transformer').to(dtype=torch.float16)
transformer.train(False)
with torch.no_grad():
    lycoris_net, weights = create_lycoris_from_weights(1.0, '0.safetensors', transformer)
    lycoris_net.train(False)
    for lora in lycoris_net.loras:
        lora = lora.to(dtype=torch.float16, device='cuda:1')
    lycoris_net.apply_to()
    lycoris_net.merge_to()

pipe = SanaPipeline.from_pretrained(
  "Efficient-Large-Model/Sana_1600M_1024px_diffusers",
  torch_dtype=torch.float16,
  variant='fp16',
  transformer=transformer,
  vae = None
)
```


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mismatch shape when applying and merging a locon model to a SanaTransformer2DModel module #230

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Mismatch shape when applying and merging a locon model to a SanaTransformer2DModel module #230

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions