[Usage]: how to use 1.3.0rc6 to deploy qwen3.5 27B on dgx spark

### System Info

**System Information:**
- OS:dgx is ubuntu24.04
- Python version:3.13
- CUDA version:13.0
- GPU model(s):gb10
- Driver version:580.126.09
- TensorRT-LLM version:docker 1.3.0rc6

**Detailed output:**
```text
Paste the output of the above commands here
```


### How would you like to use TensorRT-LLM

I want to run inference of a qwen3.5 27B,I don't know how to integrate it with TensorRT-LLM.
Have tried 1.3.0rc 5 or 6, all failed


### Before submitting a new issue...

- [x] Make sure you already searched for relevant issues, and checked the [documentation](https://nvidia.github.io/TensorRT-LLM/) and [examples](https://github.com/NVIDIA/TensorRT-LLM/tree/main/examples) for answers to frequently asked questions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Usage]: how to use 1.3.0rc6 to deploy qwen3.5 27B on dgx spark #12000

System Info

How would you like to use TensorRT-LLM

Before submitting a new issue...

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Usage]: how to use 1.3.0rc6 to deploy qwen3.5 27B on dgx spark #12000

Description

System Info

How would you like to use TensorRT-LLM

Before submitting a new issue...

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions