Hi! EchoKit is a great open-source voice agent platform.
For the ASR/STT component, SenseVoice could significantly reduce latency:
Why SenseVoice?
- Non-autoregressive — complete transcription in single forward pass
- 5x faster than Whisper — critical for real-time voice agents
- 234M params — lightweight
- OpenAI-compatible API:
funasr-server --device cuda serves at /v1/audio/transcriptions
- 50+ languages, auto-detection
Quick integration
from funasr import AutoModel
model = AutoModel(model="iic/SenseVoiceSmall", vad_model="fsmn-vad")
result = model.generate(input=audio_chunk)
Links
Hi! EchoKit is a great open-source voice agent platform.
For the ASR/STT component, SenseVoice could significantly reduce latency:
Why SenseVoice?
funasr-server --device cudaserves at/v1/audio/transcriptionsQuick integration
Links