Hello
How are you?
I found two kinds of inference for the offline model in your demo page.

I checked the script "run_voice_clone.py" but I am NOT sure how I can do two kinds of inference in the script.
In your demo page, ONLY a prompt speech is inputted for Reference Utterance as Prompt.
In the script "run_voice_clone.py", both prompt text and prompt audio are inputted.
Besides, the limitation for lengths of prompt text or audio are NOT mentioned.
Hello
How are you?
I found two kinds of inference for the offline model in your demo page.
I checked the script "run_voice_clone.py" but I am NOT sure how I can do two kinds of inference in the script.
In your demo page, ONLY a prompt speech is inputted for Reference Utterance as Prompt.
In the script "run_voice_clone.py", both prompt text and prompt audio are inputted.
Besides, the limitation for lengths of prompt text or audio are NOT mentioned.