[Bug]: The Qwen3 - Next - 80B - A3B - Instruct model is deployed on 8 cards of the 910B with 64G memory, and out - of - memory (OOM) occurs.

### Your current environment

### vllm-ascend
**version**:v0.11.0rc2-openeuler

### Docker Compose

<img width="1423" height="588" alt="Image" src="https://github.com/user-attachments/assets/aaf9433b-9a83-4b31-b594-97062796e068" />

### 🐛 Describe the bug

### Docker Container Logs

<img width="1047" height="728" alt="Image" src="https://github.com/user-attachments/assets/dc5ce106-1472-43dd-b8ac-06bae1a62a26" />

<img width="1051" height="736" alt="Image" src="https://github.com/user-attachments/assets/d2f4ee48-845f-48f7-b6f9-51a54371a6eb" />

<img width="1025" height="714" alt="Image" src="https://github.com/user-attachments/assets/cc068d08-eafb-4431-83e1-1a0b8a119556" />

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Bug]: The Qwen3 - Next - 80B - A3B - Instruct model is deployed on 8 cards of the 910B with 64G memory, and out - of - memory (OOM) occurs. #4379

Your current environment

vllm-ascend

Docker Compose

🐛 Describe the bug

Docker Container Logs

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Bug]: The Qwen3 - Next - 80B - A3B - Instruct model is deployed on 8 cards of the 910B with 64G memory, and out - of - memory (OOM) occurs. #4379

Description

Your current environment

vllm-ascend

Docker Compose

🐛 Describe the bug

Docker Container Logs

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions