[test] fix: repair CP packed SFT functional setup by yaoyu-33 · Pull Request #4374 · NVIDIA-NeMo/Megatron-Bridge

yaoyu-33 · 2026-06-15T23:23:27Z

Summary

Split the CP + packed SFT functional checkpoint setup fix out from PR [test] fix: register internal pytest marker #4359 for faster review.
Create a tiny pretrain checkpoint first, verify it, then use it as checkpoint.pretrained_checkpoint for the SFT run.
Keep checkpoint.load = None so this exercises the intended finetune-from-pretrained path instead of trying to resume.

Validation

git diff --check HEAD~1..HEAD
uv tool run ruff check tests/functional_tests/test_groups/training/test_seqpacking_cp_example.py
uv tool run ruff format --check tests/functional_tests/test_groups/training/test_seqpacking_cp_example.py
uv run --no-sync python -m py_compile tests/functional_tests/test_groups/training/test_seqpacking_cp_example.py
uv run --no-sync pre-commit run --files tests/functional_tests/test_groups/training/test_seqpacking_cp_example.py

Exact GPU functional execution is left to CI.

Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>

copy-pr-bot · 2026-06-15T23:23:31Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

yaoyu-33 · 2026-06-15T23:23:55Z

/ok to test dc094b0

claude · 2026-06-15T23:25:21Z

Review

LGTM. The change correctly splits the test into a two-phase pretrain → finetune flow:

Creates a pretrain checkpoint via pretrain() with llama32_1b_pretrain_config
Verifies the pretrain checkpoint exists
Passes it as cfg.checkpoint.pretrained_checkpoint to the SFT step
Sets cfg.checkpoint.load = None so SFT exercises the finetune-from-pretrained path (not resume)

Imports, config fields, and function signatures all check out.

Suggested test cases: No perf tests impacted.

test: fix seqpacking CP SFT checkpoint setup

dc094b0

Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>

yaoyu-33 added area:training Training loop, callbacks, and runtime integration bug Something isn't working needs-review PR is ready for code review and waiting on a reviewer labels Jun 15, 2026

copy-pr-bot Bot temporarily deployed to public June 15, 2026 23:24 Inactive

copy-pr-bot Bot temporarily deployed to test June 15, 2026 23:24 Inactive

yaoyu-33 merged commit 579f5c8 into main Jun 15, 2026
20 checks passed

yaoyu-33 deleted the yuya/seqpacking-cp-sft-checkpoint-fix branch June 15, 2026 23:25

copy-pr-bot Bot temporarily deployed to public June 15, 2026 23:34 Inactive

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[test] fix: repair CP packed SFT functional setup#4374

[test] fix: repair CP packed SFT functional setup#4374
yaoyu-33 merged 1 commit into
mainfrom
yuya/seqpacking-cp-sft-checkpoint-fix

yaoyu-33 commented Jun 15, 2026

Uh oh!

copy-pr-bot Bot commented Jun 15, 2026

Uh oh!

yaoyu-33 commented Jun 15, 2026

Uh oh!

claude Bot commented Jun 15, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

yaoyu-33 commented Jun 15, 2026

Summary

Validation

Uh oh!

copy-pr-bot Bot commented Jun 15, 2026

Uh oh!

yaoyu-33 commented Jun 15, 2026

Uh oh!

claude Bot commented Jun 15, 2026

Review

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant