-
Notifications
You must be signed in to change notification settings - Fork 2
Academic advising #4
Copy link
Copy link
Open
Description
How is the final MLP layer designed? The decoding generates a tensor of [batch,200,dim], do you use the view function to change it linearly after it becomes [batch,200×dim]? Or do you only make a linear change for the vector of that 200th dim dimension? Because it is the speed of generating the ith moment.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels