Skip to content

Why certain parameters are in multiple places for Nemo 2.0 #13256

Answered by ashors1
paniabhisek asked this question in Q&A
Discussion options

You must be logged in to vote

Hi, thanks for the question, and apologies for the confusion here. These duplicate parameters essentially arise because NeMo handles precision exclusively through the mixed precision plugin, while PyTorch lightning and Megatron have slightly different ways of handling precision. Any precision-related settings in NeMo 2.0 should be specified from MegatronMixedPrecision directly. You can find some examples of various mixed precision plugins here.

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@paniabhisek
Comment options

Answer selected by ashors1
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants