You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thank you for sharing the code. I'd like to ask a question regarding the MHSA module. In the original text, the output description seems to suggest attention weights, but why does the code directly output the result of V multiplied by attention?
Thank you for sharing the code. I'd like to ask a question regarding the MHSA module. In the original text, the output description seems to suggest attention weights, but why does the code directly output the result of V multiplied by attention?
