Feature request: support saving/loading quantized language model

As far as I can tell there is no way to save/load a quantized TTSModel. Being able to do that would allow for a further reduction in file size and avoid redundant re-quantizing, resulting in faster loading times.

Is there a workaround for the existing behavior/logic that exists for 2.1.0 (that doesn't involve loading a model twice -- e.g. first loading the default model and then overwriting things with a quantized state_dict)?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feature request: support saving/loading quantized language model #185

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Uh oh!

Feature request: support saving/loading quantized language model #185

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions