difference between z and z_q is a lot for the provided pretrained model. When z_q is used for reconstruction the resultant audio becomes very different.
I wonder is it the problem due to code change for quantizer since 2024, or the model was not trained properly.
Thanks.