NEKO Local Lightweight TTS

This service is the first-phase local TTS bridge for NEKO. It deliberately implements the same WebSocket protocol expected by local_cosyvoice_worker, so NEKO can use it without changing the main TTS pipeline.

Protocol

Endpoint:

ws://127.0.0.1:50000/v1/audio/speech/stream

Client messages:

{"voice":"kokoro:zf_001","speed":1.0}
{"text":"Hello from NEKO."}
{"text":"Local Kokoro TTS test."}
{"event":"end"}

Server response:

binary PCM s16le chunks, mono, 22050 Hz

NEKO's existing local_cosyvoice_worker then resamples this audio to 48 kHz.

Start

From the repository root:

uv run python local_server/local_tts_server/server.py --host 127.0.0.1 --port 50000

In NEKO settings, use the existing local custom TTS path:

ws://127.0.0.1:50000

Keep the existing custom/GPT-SoVITS toggle enabled, because the current router uses that switch to route ws:// custom TTS URLs into local_cosyvoice_worker.

Voice Selector

The service accepts a model prefix in voice:

kokoro:<voice>
melotts:<voice>
melo:<voice>
chattts:<voice>

If the prefix is missing, LOCAL_TTS_DEFAULT_MODEL is used. The default is kokoro.

Kokoro / MeloTTS / ChatTTS

These are exposed through command adapters for now. The command must write a 16-bit WAV file to {out_file}.

The Kokoro launcher defaults to the Chinese-enhanced hexgrad/Kokoro-82M-v1.1-zh model and voice zf_001. If local_server/local_tts_server/kokoro_models/Kokoro-82M-v1.1-zh exists, the launcher uses that local model directory before falling back to Hugging Face cache/download.

set LOCAL_TTS_KOKORO_MODEL_DIR=F:\models\Kokoro-82M-v1.1-zh
set LOCAL_TTS_KOKORO_REPO_ID=hexgrad/Kokoro-82M-v1.1-zh
set LOCAL_TTS_KOKORO_DEFAULT_VOICE=zf_001
set LOCAL_TTS_KOKORO_CMD=python F:\tts_wrappers\kokoro_cli.py "{text_file}" "{out_file}" "{voice}" {speed}
set LOCAL_TTS_MELOTTS_CMD=python F:\tts_wrappers\melotts_cli.py --text-file "{text_file}" --out "{out_file}" --voice "{voice}" --speed {speed}
set LOCAL_TTS_CHATTTS_CMD=python F:\tts_wrappers\chattts_cli.py --text-file "{text_file}" --out "{out_file}" --voice "{voice}" --speed {speed}

ChatTTS is AGPL-3.0. Keep it as an optional external backend unless the product licensing story is settled.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NEKO Local Lightweight TTS

Protocol

Start

Voice Selector

Kokoro / MeloTTS / ChatTTS

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

NEKO Local Lightweight TTS

Protocol

Start

Voice Selector

Kokoro / MeloTTS / ChatTTS