Skip to content

[Misc] downgrade nvidia-cutlass-dsl to 4.5.0#43230

Open
ZJY0516 wants to merge 1 commit into
vllm-project:mainfrom
ZJY0516:downgrade-cutedsl
Open

[Misc] downgrade nvidia-cutlass-dsl to 4.5.0#43230
ZJY0516 wants to merge 1 commit into
vllm-project:mainfrom
ZJY0516:downgrade-cutedsl

Conversation

@ZJY0516
Copy link
Copy Markdown
Member

@ZJY0516 ZJY0516 commented May 20, 2026

Purpose

4.5.1 breaks the flashinfer GDN kernel on gb200.

How to reproduce:

pytest -s -v tests/gdn/test_prefill_delta_rule.py::test_prefill_kernel_basic[float16-64-seq_lens0-1-1-1-128-1.0-False-True]

Test Plan

# SPDX-License-Identifier: Apache-2.0
# SPDX-FileCopyrightText: Copyright contributors to the vLLM project

from vllm import LLM, SamplingParams

# Sample prompts.
prompts = [
    "Hello, my name is",
    "The president of the United States is",
    "The capital of France is",
    "The future of AI is",
]
# Create a sampling params object.
sampling_params = SamplingParams(temperature=0.8, top_p=0.95)


def main():
    # Create an LLM.
    llm = LLM(model="Qwen/Qwen3.6-35B-A3B")
    # Generate texts from the prompts.
    # The output is a list of RequestOutput objects
    # that contain the prompt, generated text, and other information.
    outputs = llm.generate(prompts, sampling_params)
    # Print the outputs.
    print("\nGenerated Outputs:\n" + "-" * 60)
    for output in outputs:
        prompt = output.prompt
        generated_text = output.outputs[0].text
        print(f"Prompt:    {prompt!r}")
        print(f"Output:    {generated_text!r}")
        print("-" * 60)


if __name__ == "__main__":
    main()

Test Result


Essential Elements of an Effective PR Description Checklist
  • The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
  • The test plan, such as providing test command.
  • The test results, such as pasting the results comparison before and after, or e2e results
  • (Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.

Signed-off-by: zjy0516 <riverclouds.zhu@qq.com>
Copy link
Copy Markdown
Contributor

@gemini-code-assist gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request downgrades the nvidia-cutlass-dsl[cu13] package from version 4.5.1 to 4.5.0 in the requirements/cuda.txt file. I have no feedback to provide as there were no review comments to evaluate.

@ZJY0516 ZJY0516 added the ready ONLY add when PR is ready to merge/full CI is needed label May 20, 2026
Copy link
Copy Markdown
Collaborator

@MatthewBonanni MatthewBonanni left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@github-project-automation github-project-automation Bot moved this to Ready in NVIDIA May 20, 2026
@MatthewBonanni MatthewBonanni enabled auto-merge (squash) May 20, 2026 17:04
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ci/build nvidia ready ONLY add when PR is ready to merge/full CI is needed

Projects

Status: Ready

Development

Successfully merging this pull request may close these issues.

2 participants