Skip to content

Conversation

@akoumpa
Copy link
Contributor

@akoumpa akoumpa commented Nov 21, 2025

No description provided.

Signed-off-by: Alexandros Koumparoulis <[email protected]>
@copy-pr-bot
Copy link

copy-pr-bot bot commented Nov 21, 2025

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@akoumpa akoumpa linked an issue Nov 21, 2025 that may be closed by this pull request
@akoumpa
Copy link
Contributor Author

akoumpa commented Nov 21, 2025

/ok to test 830b528

@ZhiyuLi-Nvidia
Copy link
Contributor

@akoumpa

As discussed offline:

  • qwen3(Qwen3ForCausalLM) --> fails with SP
  • qwen2(Qwen2ForCausalLM) --> works with SP

We can either keep qwen2 or bypass the following functional test
https://github.com/NVIDIA-NeMo/Automodel/blob/main/tests/functional_tests/hf_transformer_finetune/L2_HF_Transformer_PEFT_Benchmark_qwen2_custom.sh

Signed-off-by: Alexandros Koumparoulis <[email protected]>
@akoumpa akoumpa removed the r0.2.0 Add for cherry-pick into release branch 0.2.0 label Nov 21, 2025
Signed-off-by: Alexandros Koumparoulis <[email protected]>
@akoumpa
Copy link
Contributor Author

akoumpa commented Nov 21, 2025

/ok to test 229bff5

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Qwen parallelizer with sequence parallelism

3 participants