Skip to content

Feature request: support saving/loading quantized language model #185

@mscdex

Description

@mscdex

As far as I can tell there is no way to save/load a quantized TTSModel. Being able to do that would allow for a further reduction in file size and avoid redundant re-quantizing, resulting in faster loading times.

Is there a workaround for the existing behavior/logic that exists for 2.1.0 (that doesn't involve loading a model twice -- e.g. first loading the default model and then overwriting things with a quantized state_dict)?

Metadata

Metadata

Assignees

No one assigned

    Labels

    help wantedExtra attention is needed

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions