-
Notifications
You must be signed in to change notification settings - Fork 28
Open
Description
Hi, thank you so much for releasing code for this inspiring work. I noticed in lit_gt/config.py, for model "GatedDeltaNet_1.3B", n_head is set to 16. However, it seems like this value is never passed when constructing the key module: GatedDeltaNet(hidden_size=config.n_embd), so n_head is effectively 9.
I'm wondering would you mind sharingthe exact config of this model? Thank you very much for your time and help!
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels