[New Model]: Step-Audio-R1.1

### The model to consider.

https://huggingface.co/stepfun-ai/Step-Audio-R1.1

### The closest model vllm-omni already supports.

_No response_

### What's your difficulty of supporting the model you want?

_No response_

### Use case and motivation

Step-Audio R1.1 (Realtime) is a major upgrade to Step-Audio-R1, designed for interactive spoken dialogue with both real-time responsiveness and strong reasoning capability.

Unlike conventional streaming speech models that trade intelligence for latency, R1.1 enables thinking while speaking, achieving high intelligence without sacrificing speed.

### Before submitting a new issue...

- [x] Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the [documentation page](https://vllm-omni.readthedocs.io), which can answer lots of frequently asked questions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[New Model]: Step-Audio-R1.1 #862

The model to consider.

The closest model vllm-omni already supports.

What's your difficulty of supporting the model you want?

Use case and motivation

Before submitting a new issue...

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[New Model]: Step-Audio-R1.1 #862

Description

The model to consider.

The closest model vllm-omni already supports.

What's your difficulty of supporting the model you want?

Use case and motivation

Before submitting a new issue...

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions