How to add custom models in NeMo? #13158
Unanswered
gagangayari
asked this question in
Q&A
Replies: 1 comment
-
Hi, thanks for your question! Many adjustments to the architecture can be made by tweaking the model's GPTConfig. Using MLA requires swapping out MCore's |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I wanted to play around with different model components. Eg Plugging in MLA instead of MHA. Or change the forward method somewhat. I guess we can specify it via MCoreSpec. Can somebody please help me with the complete flow on how to add such a customized model and use it for training. If there is any recipe that would be a great help. Thanks
Beta Was this translation helpful? Give feedback.
All reactions