V1.17.0-cu129 运行qwen3-next-80B-A3B-Instruct报错——'Qwen3NextConfig' object has no attribute 'dtype'-4张L20

### System Info / 系統信息

ubuntu  22.04 
docker 

### Running Xinference with Docker? / 是否使用 Docker 运行 Xinfernece？

- [x] docker / docker
- [ ] pip install / 通过 pip install 安装
- [ ] installation from source / 从源码安装

### Version info / 版本信息

V1.17.0-cu129

### The command used to start Xinference / 用以启动 xinference 的命令

xinference launch  --model-engin vllm --model-name Qwen3-Next-Instruct --size-in-billon 80 --quantization none --model-format pytorch   --model-path /opt/xxxx/Qwen3-Next-80B-A3B-Instruct  --n-gpu 4 --n-worker 1 --max-model-len 3000 --gpu-memory-utilization 0.8

### Reproduction / 复现过程

直接加载模型后，运行到一段时间后报错

'Qwen3NextConfig' object has no attribute 'dtype'

比较奇怪的是，我是用vllm的运行引擎，因为transformers版本不够，但是在运行到这个报错之前，调用的方法是

/transformers/configuration_util.py，是在这报出的错误。


我看了看模型的config.json文件，里面是有“torch_type”:"bfloat16"的字段，和别的模型是一样的，但是别的模型能够正常加载，这个就不行。

### Expected behavior / 期待表现

能够成功加载模型

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

V1.17.0-cu129 运行qwen3-next-80B-A3B-Instruct报错——'Qwen3NextConfig' object has no attribute 'dtype'-4张L20 #4516

System Info / 系統信息

Running Xinference with Docker? / 是否使用 Docker 运行 Xinfernece？

Version info / 版本信息

The command used to start Xinference / 用以启动 xinference 的命令

Reproduction / 复现过程

Expected behavior / 期待表现

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

V1.17.0-cu129 运行qwen3-next-80B-A3B-Instruct报错——'Qwen3NextConfig' object has no attribute 'dtype'-4张L20 #4516

Description

System Info / 系統信息

Running Xinference with Docker? / 是否使用 Docker 运行 Xinfernece？

Version info / 版本信息

The command used to start Xinference / 用以启动 xinference 的命令

Reproduction / 复现过程

Expected behavior / 期待表现

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions