Skip to content

[New Model]: Step-Audio-R1.1 #862

@fake0fan

Description

@fake0fan

The model to consider.

https://huggingface.co/stepfun-ai/Step-Audio-R1.1

The closest model vllm-omni already supports.

No response

What's your difficulty of supporting the model you want?

No response

Use case and motivation

Step-Audio R1.1 (Realtime) is a major upgrade to Step-Audio-R1, designed for interactive spoken dialogue with both real-time responsiveness and strong reasoning capability.

Unlike conventional streaming speech models that trade intelligence for latency, R1.1 enables thinking while speaking, achieving high intelligence without sacrificing speed.

Before submitting a new issue...

  • Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions