hi, i have a problem here. During inferencing stage, must the length of input wavs be 12s? Or it could be any length?