Skip to content

Feature Request: Add FunASR as speech recognition backend #233

@LauraGPT

Description

@LauraGPT

Hi! Ichigo is an interesting local realtime voice AI project.

I'd like to suggest adding FunASR (16K+ stars) as an ASR backend option:

  • 170x real-time GPU speed — minimal latency for real-time voice interaction
  • Streaming ASR: Paraformer-streaming model designed for real-time with sub-second latency
  • Built-in VAD + punctuation: No separate preprocessing needed
  • 50+ languages: SenseVoice model with automatic language detection
  • Local/offline: Runs entirely locally, aligning with Ichigo's local-first approach
  • OpenAI-compatible API: funasr-server --device cuda serves at /v1/audio/transcriptions
from funasr import AutoModel
model = AutoModel(model="iic/SenseVoiceSmall")
result = model.generate(input="audio.wav")

Happy to help with integration!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    Status
    No status

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions