Skip to content

Feature Request: Add FunASR Speech-to-Text component #13426

@LauraGPT

Description

@LauraGPT

Feature Request

Langflow is an excellent visual AI workflow builder. A Speech-to-Text component powered by FunASR would enable audio processing workflows.

Use cases:

  • Voice input → STT → LLM processing → TTS output
  • Audio file transcription → text processing pipeline
  • Meeting recording → transcription → summarization

Why FunASR?

  • OpenAI-compatible API: funasr-server provides /v1/audio/transcriptions endpoint
  • SenseVoice: Ultra-fast ASR (234M params), 50+ languages
  • Complete pipeline: VAD + ASR + punctuation + speaker diarization in one call
  • Self-hosted: Fully local, no API key needed

Quick start:

pip install funasr vllm fastapi uvicorn python-multipart
funasr-server --device cuda
# /v1/audio/transcriptions at localhost:8000

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions