- Near real-time fast text-to-speech,
- Supports two-person dialogue
- Multiple voice options available
- Supports text in multiple languages
- Easy integration with ComfyUI workflows
[2025-06-02]⚒️: Supports two-person dialogue.
[2025-03-22]⚒️: Code refactoring, faster generation speed.
[2025-03-05]⚒️: Supports 8 languages, 150 voices.
- American English 美式英语
- British English 英语
- Japanese 日语
- Chinese 中文
- Spanish 西班牙语
- French 法语
- Hindi 印地语
- Italian 意大利语
- Brazilian Portuguese 巴西葡萄牙语
- Text-to-Speech:
- English Two-Person Dialogue:
- Chinese Two-Person Dialogue:
cd ComfyUI/custom_nodes
git clone https://github.com/billwuhao/ComfyUI_KokoroTTS_MW.git
cd ComfyUI_KokoroTTS_MW
pip install -r requirements.txt
# python_embeded
./python_embeded/python.exe -m pip install -r requirements.txt
- Models and voices need to be manually downloaded and placed in the
ComfyUI\models\Kokorotts
path:
Structure should be as follows:
ComfyUI\models\Kokorotts
│ Kokoro-82M
└── voices
config.json
kokoro-v1_0.pth
| Kokoro-82M-v1.1-zh
└── voices
config.json
kokoro-v1_1-zh.pth