Improve streaming #4

twitchard · 2025-04-18T10:48:42Z

New --instant-mode Flag: Introduced an --instant-mode flag (config: tts.instantMode) which can be used with streaming (--streaming) and single generation (--num-generations=1) to potentially achieve faster TTS synthesis results.
Improved Streaming Playback: Audio playback during streaming (--play all or --play first with --streaming) now pipes audio data directly to the detected audio player's standard input (stdin). This enables lower latency playback as audio chunks are received, without needing to write temporary snippet files to disk first.
One file per generation, not one file per chunk: When using streaming mode, the tool now saves one consolidated audio file per generation (e.g., output_gen123.wav) instead of multiple files per snippet (e.g., output_gen123.0.wav, output_gen123.1.wav).

twitchard added 5 commits April 18, 2025 03:45

Improve streaming

b50026b

docs and tests

1bb1952

Allow instant mode with either voice OR continuation

0627955

update test not to require ffplay

e20acc9

format

74e4ded

twitchard merged commit afb2b0c into main Apr 18, 2025
1 check passed

twitchard deleted the twitchard/improve-streaming branch April 18, 2025 18:35

Provide feedback