Feature Request: Add FunASR as speech recognition backend

Hi! Ichigo is an interesting local realtime voice AI project.

I'd like to suggest adding [FunASR](https://github.com/modelscope/FunASR) (16K+ stars) as an ASR backend option:

- **170x real-time GPU speed** — minimal latency for real-time voice interaction
- **Streaming ASR**: Paraformer-streaming model designed for real-time with sub-second latency
- **Built-in VAD + punctuation**: No separate preprocessing needed
- **50+ languages**: SenseVoice model with automatic language detection
- **Local/offline**: Runs entirely locally, aligning with Ichigo's local-first approach
- **OpenAI-compatible API**: `funasr-server --device cuda` serves at `/v1/audio/transcriptions`

```python
from funasr import AutoModel
model = AutoModel(model="iic/SenseVoiceSmall")
result = model.generate(input="audio.wav")
```

Happy to help with integration!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature Request: Add FunASR as speech recognition backend #233

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Feature Request: Add FunASR as speech recognition backend #233

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions