Skip to content

Commit c339b2d

Browse files
🌿 Fern Regeneration -- April 18, 2025 (#282)
Co-authored-by: fern-api <115122769+fern-api[bot]@users.noreply.github.com>
1 parent 147eaa7 commit c339b2d

File tree

5 files changed

+103
-1
lines changed

5 files changed

+103
-1
lines changed

.mock/definition/tts/__package__.yml

Lines changed: 12 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -344,6 +344,14 @@ types:
344344
When setting to `false`, avoid including utterances with long `text`,
345345
as this can result in distorted output.
346346
default: true
347+
strip_headers:
348+
type: optional<boolean>
349+
docs: >-
350+
If enabled, the audio for all the chunks of a generation, once
351+
concatenated together, will constitute a single audio file. Otherwise,
352+
if disabled, each chunk's audio will be its own audio file, each with
353+
its own headers (if applicable).
354+
default: false
347355
utterances:
348356
docs: >-
349357
A list of **Utterances** to be converted to speech output.
@@ -374,6 +382,10 @@ types:
374382
[/v0/tts/stream/json](/reference/text-to-speech-tts/synthesize-json-streaming),
375383
[/v0/tts/stream/file](/reference/text-to-speech-tts/synthesize-file-streaming)).
376384
385+
- Ensure only a single generation is requested
386+
([num_generations](/reference/text-to-speech-tts/synthesize-json-streaming#request.body.num_generations)
387+
must be `1` or omitted).
388+
377389
- With `instant_mode` enabled, **requests incur a 10% higher cost**
378390
due to increased compute and resource requirements.
379391
default: false

pyproject.toml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
[tool.poetry]
22
name = "hume"
3-
version = "0.8.1"
3+
version = "0.8.2"
44
description = "A Python SDK for Hume AI"
55
readme = "README.md"
66
authors = []

reference.md

Lines changed: 36 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -120,11 +120,20 @@ This setting affects how the `snippets` array is structured in the response, whi
120120
<dl>
121121
<dd>
122122

123+
**strip_headers:** `typing.Optional[bool]` — If enabled, the audio for all the chunks of a generation, once concatenated together, will constitute a single audio file. Otherwise, if disabled, each chunk's audio will be its own audio file, each with its own headers (if applicable).
124+
125+
</dd>
126+
</dl>
127+
128+
<dl>
129+
<dd>
130+
123131
**instant_mode:** `typing.Optional[bool]`
124132

125133
Enables ultra-low latency streaming, significantly reducing the time until the first audio chunk is received. Recommended for real-time applications requiring immediate audio playback. For further details, see our documentation on [instant mode](/docs/text-to-speech-tts/overview#ultra-low-latency-streaming-instant-mode).
126134
- Dynamic voice generation is not supported with this mode; a predefined [voice](/reference/text-to-speech-tts/synthesize-json-streaming#request.body.utterances.voice) must be specified in your request.
127135
- This mode is only supported for streaming endpoints (e.g., [/v0/tts/stream/json](/reference/text-to-speech-tts/synthesize-json-streaming), [/v0/tts/stream/file](/reference/text-to-speech-tts/synthesize-file-streaming)).
136+
- Ensure only a single generation is requested ([num_generations](/reference/text-to-speech-tts/synthesize-json-streaming#request.body.num_generations) must be `1` or omitted).
128137
- With `instant_mode` enabled, **requests incur a 10% higher cost** due to increased compute and resource requirements.
129138

130139
</dd>
@@ -260,11 +269,20 @@ This setting affects how the `snippets` array is structured in the response, whi
260269
<dl>
261270
<dd>
262271

272+
**strip_headers:** `typing.Optional[bool]` — If enabled, the audio for all the chunks of a generation, once concatenated together, will constitute a single audio file. Otherwise, if disabled, each chunk's audio will be its own audio file, each with its own headers (if applicable).
273+
274+
</dd>
275+
</dl>
276+
277+
<dl>
278+
<dd>
279+
263280
**instant_mode:** `typing.Optional[bool]`
264281

265282
Enables ultra-low latency streaming, significantly reducing the time until the first audio chunk is received. Recommended for real-time applications requiring immediate audio playback. For further details, see our documentation on [instant mode](/docs/text-to-speech-tts/overview#ultra-low-latency-streaming-instant-mode).
266283
- Dynamic voice generation is not supported with this mode; a predefined [voice](/reference/text-to-speech-tts/synthesize-json-streaming#request.body.utterances.voice) must be specified in your request.
267284
- This mode is only supported for streaming endpoints (e.g., [/v0/tts/stream/json](/reference/text-to-speech-tts/synthesize-json-streaming), [/v0/tts/stream/file](/reference/text-to-speech-tts/synthesize-file-streaming)).
285+
- Ensure only a single generation is requested ([num_generations](/reference/text-to-speech-tts/synthesize-json-streaming#request.body.num_generations) must be `1` or omitted).
268286
- With `instant_mode` enabled, **requests incur a 10% higher cost** due to increased compute and resource requirements.
269287

270288
</dd>
@@ -398,11 +416,20 @@ This setting affects how the `snippets` array is structured in the response, whi
398416
<dl>
399417
<dd>
400418

419+
**strip_headers:** `typing.Optional[bool]` — If enabled, the audio for all the chunks of a generation, once concatenated together, will constitute a single audio file. Otherwise, if disabled, each chunk's audio will be its own audio file, each with its own headers (if applicable).
420+
421+
</dd>
422+
</dl>
423+
424+
<dl>
425+
<dd>
426+
401427
**instant_mode:** `typing.Optional[bool]`
402428

403429
Enables ultra-low latency streaming, significantly reducing the time until the first audio chunk is received. Recommended for real-time applications requiring immediate audio playback. For further details, see our documentation on [instant mode](/docs/text-to-speech-tts/overview#ultra-low-latency-streaming-instant-mode).
404430
- Dynamic voice generation is not supported with this mode; a predefined [voice](/reference/text-to-speech-tts/synthesize-json-streaming#request.body.utterances.voice) must be specified in your request.
405431
- This mode is only supported for streaming endpoints (e.g., [/v0/tts/stream/json](/reference/text-to-speech-tts/synthesize-json-streaming), [/v0/tts/stream/file](/reference/text-to-speech-tts/synthesize-file-streaming)).
432+
- Ensure only a single generation is requested ([num_generations](/reference/text-to-speech-tts/synthesize-json-streaming#request.body.num_generations) must be `1` or omitted).
406433
- With `instant_mode` enabled, **requests incur a 10% higher cost** due to increased compute and resource requirements.
407434

408435
</dd>
@@ -544,11 +571,20 @@ This setting affects how the `snippets` array is structured in the response, whi
544571
<dl>
545572
<dd>
546573

574+
**strip_headers:** `typing.Optional[bool]` — If enabled, the audio for all the chunks of a generation, once concatenated together, will constitute a single audio file. Otherwise, if disabled, each chunk's audio will be its own audio file, each with its own headers (if applicable).
575+
576+
</dd>
577+
</dl>
578+
579+
<dl>
580+
<dd>
581+
547582
**instant_mode:** `typing.Optional[bool]`
548583

549584
Enables ultra-low latency streaming, significantly reducing the time until the first audio chunk is received. Recommended for real-time applications requiring immediate audio playback. For further details, see our documentation on [instant mode](/docs/text-to-speech-tts/overview#ultra-low-latency-streaming-instant-mode).
550585
- Dynamic voice generation is not supported with this mode; a predefined [voice](/reference/text-to-speech-tts/synthesize-json-streaming#request.body.utterances.voice) must be specified in your request.
551586
- This mode is only supported for streaming endpoints (e.g., [/v0/tts/stream/json](/reference/text-to-speech-tts/synthesize-json-streaming), [/v0/tts/stream/file](/reference/text-to-speech-tts/synthesize-file-streaming)).
587+
- Ensure only a single generation is requested ([num_generations](/reference/text-to-speech-tts/synthesize-json-streaming#request.body.num_generations) must be `1` or omitted).
552588
- With `instant_mode` enabled, **requests incur a 10% higher cost** due to increased compute and resource requirements.
553589

554590
</dd>

0 commit comments

Comments
 (0)