As far as I can tell there is no way to save/load a quantized TTSModel. Being able to do that would allow for a further reduction in file size and avoid redundant re-quantizing, resulting in faster loading times.
Is there a workaround for the existing behavior/logic that exists for 2.1.0 (that doesn't involve loading a model twice -- e.g. first loading the default model and then overwriting things with a quantized state_dict)?
As far as I can tell there is no way to save/load a quantized TTSModel. Being able to do that would allow for a further reduction in file size and avoid redundant re-quantizing, resulting in faster loading times.
Is there a workaround for the existing behavior/logic that exists for 2.1.0 (that doesn't involve loading a model twice -- e.g. first loading the default model and then overwriting things with a quantized state_dict)?