This repository was archived by the owner on Dec 8, 2023. It is now read-only.

Description
Hello, thank you for amazing work! Couldn't understand how to translate English text (on which I want to inference your model) to torch tensor of tokens IDs. As far as I understand you firstly convert string to sequence of phonemas and then to their's indexes, am I right? Could you help me please how to do it?