You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I carefully looked through the code, and I found that during the training stage, the attention mask of the teacher task (ref output) is None since you set the fix_attn_mask as false by default, which is a little bit strange to me.