Skip to content

[Test] E2E Nemotron-3-Super tests#36803

Open
roikoren755 wants to merge 3 commits intovllm-project:mainfrom
roikoren755:test/nemotron-3-super-e2e
Open

[Test] E2E Nemotron-3-Super tests#36803
roikoren755 wants to merge 3 commits intovllm-project:mainfrom
roikoren755:test/nemotron-3-super-e2e

Conversation

@roikoren755
Copy link
Contributor

@roikoren755 roikoren755 commented Mar 11, 2026

Purpose

Adding 3 E2E tests for Nemotron-3-Super, in BF16, FP8 and NVFP4, with speculative decoding.

Test Plan

Three new tests pass.

Test Result

They do 🎉


Essential Elements of an Effective PR Description Checklist
  • The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
  • The test plan, such as providing test command.
  • The test results, such as pasting the results comparison before and after, or e2e results
  • (Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
  • (Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: Roi Koren <roik@nvidia.com>
Signed-off-by: Roi Koren <roik@nvidia.com>
@roikoren755 roikoren755 requested a review from mgoin as a code owner March 11, 2026 16:36
@mergify mergify bot added the ci/build label Mar 11, 2026
Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request adds end-to-end tests for Nemotron-3-Super models in BF16, FP8, and NVFP4 formats. The changes include new Buildkite pipeline steps and corresponding test configuration files. My review found a potential issue with inconsistencies between filenames and the model_name specified within the new YAML configuration files. This could lead to confusion or incorrect model testing and should be addressed.

@@ -0,0 +1,11 @@
model_name: "nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF16"
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

high

There's an inconsistency between the filename, which contains A20B, and the model_name value, which contains A12B. To maintain consistency and prevent potential confusion, they should match. If A20B is correct, please update the model name as suggested. However, if A12B is the correct model identifier, please rename this file accordingly and update its reference in tests/evals/gsm8k/configs/models-nemotron-3-super.txt.

model_name: "nvidia/NVIDIA-Nemotron-3-Super-120B-A20B-BF16"

@@ -0,0 +1,11 @@
model_name: "nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-FP8"
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

high

There's an inconsistency between the filename, which contains A20B, and the model_name value, which contains A12B. To maintain consistency and prevent potential confusion, they should match. If A20B is correct, please update the model name as suggested. However, if A12B is the correct model identifier, please rename this file accordingly and update its reference in tests/evals/gsm8k/configs/models-nemotron-3-super.txt.

model_name: "nvidia/NVIDIA-Nemotron-3-Super-120B-A20B-FP8"

@@ -0,0 +1,11 @@
model_name: "nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-NVFP4"
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

high

There's an inconsistency between the filename, which contains A20B, and the model_name value, which contains A12B. To maintain consistency and prevent potential confusion, they should match. If A20B is correct, please update the model name as suggested. However, if A12B is the correct model identifier, please rename this file accordingly and update its reference in tests/evals/gsm8k/configs/models-nemotron-3-super-b200.txt.

model_name: "nvidia/NVIDIA-Nemotron-3-Super-120B-A20B-NVFP4"

@mgoin mgoin added ready ONLY add when PR is ready to merge/full CI is needed nvidia labels Mar 11, 2026
Signed-off-by: Roi Koren <roik@nvidia.com>
Comment on lines +85 to +89
- label: Nemotron-3 Super 120B GSM8K (B200)
timeout_in_minutes: 120
device: b200
optional: true
num_devices: 4
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We only have B200 runners with 2 devices, so please update to that config

Comment on lines +77 to +81
- label: Nemotron-3 Super 120B GSM8K (H200)
timeout_in_minutes: 120
device: h200
optional: true
num_devices: 4
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think our h200 resource is still disabled, only does 8 devices. I think you should be able to run on 4 H100 though

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ci/build nvidia ready ONLY add when PR is ready to merge/full CI is needed

Projects

Status: No status

Development

Successfully merging this pull request may close these issues.

2 participants