Fix incorrect projection dimension in PretransformConditioner #231

dariowsz · 2026-01-20T07:11:12Z

The Issue

PretransformConditioner was applying nn.Linear projection to the wrong dimension. Pretransform outputs are [batch, channels, time], but Linear operates on the last dimension, so it was projecting time instead of channels.

Fix

Added a transpose before projection to convert [batch, channels, time] → [batch, time, channels] so the Linear layer projects channels correctly. Output is now [batch, time, output_dim], matching other text-based conditioners.

Changes

Transpose latents before applying proj_out
Updated mask generation to use the correct dimension

fix: transpose latents before projection in PretransformConditioner

3fa63ee

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix incorrect projection dimension in PretransformConditioner #231

Fix incorrect projection dimension in PretransformConditioner #231

Uh oh!

dariowsz commented Jan 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Fix incorrect projection dimension in PretransformConditioner #231

Are you sure you want to change the base?

Fix incorrect projection dimension in PretransformConditioner #231

Uh oh!

Conversation

dariowsz commented Jan 20, 2026

The Issue

Fix

Changes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant