-
Notifications
You must be signed in to change notification settings - Fork 8
docs: Update docs for Megatron Bridge checkpoint evaluation #388
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
Signed-off-by: Abhishree <[email protected]>
Signed-off-by: Abhishree <[email protected]>
| ## Deploy | ||
|
|
||
| CHECKPOINT_PATH="/checkpoints/llama-3_2-1b-instruct_v2.0" | ||
| CHECKPOINT_PATH="/checkpoints/llama-3_2-1b-instruct_v2.0/iter_0000000" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
can we get this checkpoint from somewhere? The old one could be downloaded from NGC
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't think its available yet on NGC, but let me ask around. But its possible to download from HF and convert to MBridge format. Maybe we can point to that in the meantime ?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, I think pointing to instructions on converting the model from HF would be sufficient
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Added in 6fc329d
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Completed tech pubs review and provided a few copyedits and suggested text revisions.
Co-authored-by: jgerh <[email protected]> Signed-off-by: Abhishree Thittenamane <[email protected]>
Co-authored-by: jgerh <[email protected]> Signed-off-by: Abhishree Thittenamane <[email protected]>
Co-authored-by: jgerh <[email protected]> Signed-off-by: Abhishree Thittenamane <[email protected]>
Co-authored-by: jgerh <[email protected]> Signed-off-by: Abhishree Thittenamane <[email protected]>
Co-authored-by: jgerh <[email protected]> Signed-off-by: Abhishree Thittenamane <[email protected]>
Co-authored-by: jgerh <[email protected]> Signed-off-by: Abhishree Thittenamane <[email protected]>
Co-authored-by: jgerh <[email protected]> Signed-off-by: Abhishree Thittenamane <[email protected]>
Co-authored-by: jgerh <[email protected]> Signed-off-by: Abhishree Thittenamane <[email protected]>
Co-authored-by: jgerh <[email protected]> Signed-off-by: Abhishree Thittenamane <[email protected]>
Co-authored-by: jgerh <[email protected]> Signed-off-by: Abhishree Thittenamane <[email protected]>
Co-authored-by: jgerh <[email protected]> Signed-off-by: Abhishree Thittenamane <[email protected]>
Signed-off-by: Abhishree <[email protected]>
Signed-off-by: Abhishree Thittenamane <[email protected]>
|
/ok to test a517f03 |
Signed-off-by: Abhishree Thittenamane <[email protected]>
Signed-off-by: Abhishree Thittenamane <[email protected]>
No description provided.