[Bug]: openai/gpt-oss-120b freeze during init_cache pass

### System Info

H100

### Who can help?

_No response_

### Information

- [ ] The official example scripts
- [ ] My own modified scripts

### Tasks

- [ ] An officially supported task in the `examples` folder (such as GLUE/SQuAD, ...)
- [ ] My own task or dataset (give details below)

### Reproduction

`python3  examples/auto_deploy/build_and_run_ad.py --model openai/gpt-oss-120b --args.yaml-extra  examples/auto_deploy/model_registry/configs/dashboard_default.yaml --args.yaml-extra  examples/auto_deploy/model_registry/configs/world_size_8.yaml --args.yaml-extra  examples/auto_deploy/model_registry/configs/num_hidden_layers_5.yaml`

### Expected behavior

should pass

### actual behavior

freeze for +30mins

### additional notes

NA

### Before submitting a new issue...

- [x] Make sure you already searched for relevant issues, and checked the [documentation](https://nvidia.github.io/TensorRT-LLM/) and [examples](https://github.com/NVIDIA/TensorRT-LLM/tree/main/examples) for answers to frequently asked questions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug]: openai/gpt-oss-120b freeze during init_cache pass #12006

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

actual behavior

additional notes

Before submitting a new issue...

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Bug]: openai/gpt-oss-120b freeze during init_cache pass #12006

Description

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

actual behavior

additional notes

Before submitting a new issue...

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions