Clarify STT model modes#5569
Merged
Merged
Conversation
✅ Deploy Preview for old-char canceled.
|
1574cfb to
6f00dd2
Compare
There was a problem hiding this comment.
Cursor Bugbot has reviewed your changes using default effort and found 1 potential issue.
❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.
Reviewed by Cursor Bugbot for commit 6f00dd2. Configure here.
6f00dd2 to
d6a6956
Compare
Show batch and realtime badges for Soniox, AssemblyAI, ElevenLabs, and Mistral model variants, and keep realtime aliases safe for batch transcription.
d6a6956 to
8a9764e
Compare
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.

Show batch and realtime badges for Soniox, AssemblyAI, ElevenLabs, and Mistral model variants, and keep realtime aliases safe for batch transcription.
Note
Medium Risk
Touches the live WebSocket relay and provider model resolution; ElevenLabs streaming behavior changes for proxied binary audio, though batch misuse of realtime IDs is explicitly guarded.
Overview
STT settings now expose more realtime variants and label models with Batch vs Realtime badges for AssemblyAI, ElevenLabs, Mistral, and Soniox (not only Soniox). Provider pickers add
u3-rt-pro,scribe_v2_realtime, andvoxtral-mini-transcribe-realtime-2602, with clearer display names (e.g. Universal-3 Pro Streaming).Batch transcription maps realtime model IDs to each provider’s batch default (
scribe_v2_realtime→scribe_v2, Mistral realtime →voxtral-mini-2602) so choosing a streaming model does not break file/batch paths.Live proxy adds an optional client binary message mapper on the WebSocket relay (single and stereo split). For ElevenLabs, inbound binary audio is rewritten to JSON
input_audio_chunkwith base64 before upstream send, matching ElevenLabs’ text-based streaming API.Reviewed by Cursor Bugbot for commit 8a9764e. Bugbot is set up for automated code reviews on this repo. Configure here.