You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello author, may I ask why you want to elaborate on this statement in your paper? Why does the model need to use a class token instead of the average token and add x to the output in order for the model to work with the VIT backbone?