You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Improve: Enhance client-side Live Mode voice activity detection and transcription responsiveness
- Add speechFrames counter to track continuous speech duration (more responsive than just pause detection)
- Reduce minPauseFramesForTranscription from 8 to 5 for faster chunk detection
- Add minSpeechFramesBeforeSending to ensure minimum speech duration (0.4s) before transcribing
- Implement auto-send every 8 seconds for long monologues to avoid huge text blocks
- Add timeSinceLastSend tracking to force periodic sends during continuous speech
- Improve VAD thresholds: speechThreshold -35dB, noiseThreshold -45dB for better sensitivity
- Add console logging for debugging (chunk size, speech detection, transcription progress)
- Fix VAD edge case: handle ambiguous zone between speech and silence states properly
Result: Client-side Live Mode now transcribes chunks more responsively with better natural pause detection
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-Authored-By: Claude <[email protected]>
0 commit comments