You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
|**Second press**| stop ⇢ [transcribe ⇢ copy to clipboard] ⇢ types the output into any focused app |
142
+
|**Second press**| stop ⇢ [finalize transcription ⇢ copy to clipboard] ⇢ types any remaining output into any focused app |
142
143
143
-
Otherwise, if in --flux (beta), **just speak**.
144
+
### 🎙️ Streaming Mode (Default)
145
+
146
+
VOXD uses **streaming transcription** by default, which means:
147
+
148
+
-**Real-time typing**: Text appears incrementally as you speak, not after you stop recording
149
+
-**Chunk-based processing**: Audio is processed in overlapping chunks (default: 3 seconds) for continuous transcription
150
+
-**Incremental updates**: Text is typed word-by-word or phrase-by-phrase as it's transcribed (typically every 2 seconds or 3 words)
151
+
-**Seamless experience**: You see your words appear in real-time, making it feel like natural voice-typing
152
+
153
+
**How it works:**
154
+
1. Press hotkey to start → VOXD begins recording and transcribing
155
+
2. As you speak → Text appears incrementally in your focused application
156
+
3. Press hotkey again → Finalizes any remaining transcription and copies to clipboard
157
+
158
+
This streaming behavior is enabled by default in CLI (`voxd`), GUI (`voxd --gui`), and Tray (`voxd --tray`) modes. The old "record-then-transcribe" behavior is no longer used.
159
+
160
+
**Note:** If in `--flux` mode (beta), **just speak** - no hotkey needed, voice activity detection triggers recording automatically.
144
161
145
162
### Autostart
146
163
For practical reasons (always ready to type & low system footprint), it is advised to enable voxd user daemon:
@@ -307,6 +324,15 @@ llamacpp_server_timeout: 30
307
324
# Selected models per provider (automatically updated by VOXD)
0 commit comments