Skip to content

Feature Request: Add FunASR/SenseVoice as STT backend #385

@LauraGPT

Description

@LauraGPT

Feature Request

Dograh as an open-source voice AI platform could benefit from FunASR/SenseVoice as an STT backend.

Why

  • OpenAI-compatible APIfunasr-server provides /v1/audio/transcriptions endpoint
  • Real-time WebSocket streaming — built-in streaming server at ws://localhost:10095
  • Self-hosted — fully offline, no external APIs
  • 50+ languages with automatic language detection
  • 170x real-time on GPU — non-autoregressive, very fast
  • Speaker diarization — built-in cam++ model for multi-speaker scenarios
  • Emotion detection — SenseVoice classifies speech emotion

Quick Setup

pip install funasr
funasr-server --device cuda

Streaming Mode

python -m funasr.bin.ws_server --device cuda
# WebSocket endpoint: ws://localhost:10095

Happy to help with integration!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions