Hi! Ichigo is an interesting local realtime voice AI project.
I'd like to suggest adding FunASR (16K+ stars) as an ASR backend option:
- 170x real-time GPU speed — minimal latency for real-time voice interaction
- Streaming ASR: Paraformer-streaming model designed for real-time with sub-second latency
- Built-in VAD + punctuation: No separate preprocessing needed
- 50+ languages: SenseVoice model with automatic language detection
- Local/offline: Runs entirely locally, aligning with Ichigo's local-first approach
- OpenAI-compatible API:
funasr-server --device cuda serves at /v1/audio/transcriptions
from funasr import AutoModel
model = AutoModel(model="iic/SenseVoiceSmall")
result = model.generate(input="audio.wav")
Happy to help with integration!
Hi! Ichigo is an interesting local realtime voice AI project.
I'd like to suggest adding FunASR (16K+ stars) as an ASR backend option:
funasr-server --device cudaserves at/v1/audio/transcriptionsHappy to help with integration!