-
Notifications
You must be signed in to change notification settings - Fork 2
Open
Description
How is the final MLP layer designed? The decoding generates a tensor of [batch,200,dim], do you use the view function to change it linearly after it becomes [batch,200×dim]? Or do you only make a linear change for the vector of that 200th dim dimension? Because it is the speed of generating the ith moment.
Metadata
Metadata
Assignees
Labels
No labels