Skip to content

deepseek update callback#868

Draft
denys-fridman wants to merge 16 commits intomlcommons:masterfrom
denys-fridman:dfridman/deepseek-update-callback
Draft

deepseek update callback#868
denys-fridman wants to merge 16 commits intomlcommons:masterfrom
denys-fridman:dfridman/deepseek-update-callback

Conversation

@denys-fridman
Copy link
Contributor

@denys-fridman denys-fridman commented Mar 2, 2026

This MR:

  • Removes redundant logging
  • Uses an official release tag for Megatron-Bridge instead of a commit
  • Adds Megatron-Bridge patches instead of cherry-picking in the Dockerfile
  • Fixes the script for converting HF checkpoint to Megatron-LM

None of the changes affect the convergence

@denys-fridman denys-fridman requested a review from a team as a code owner March 2, 2026 15:01
@github-actions
Copy link

github-actions bot commented Mar 2, 2026

MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅

@denys-fridman denys-fridman marked this pull request as draft March 2, 2026 15:02
@ShriyaRishab
Copy link
Contributor

@denys-fridman can you provide a description of this change and explain if it impacts convergence/accuracy/RCPs?

@denys-fridman
Copy link
Contributor Author

@ShriyaRishab Added the description. It's still a Draft and is in the process of testing.

@ShriyaRishab
Copy link
Contributor

@denys-fridman is this ready?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants