@@ -39,18 +39,16 @@ hardware-optimized one at first run after a system probe.
3939| ` flm ` | ` npu ` | XDNA2 NPU | Cannot be packaging-time bundled on Linux. |
4040| ` ryzenai-llm ` | ` npu ` | XDNA2 NPU | Windows only. Best for the Hybrid model family. |
4141
42- ### Speech-to-text ( ` whispercpp ` )
42+ ### Speech-to-text
4343
44- | Backend | Hardware | OS |
45- | ---| ---| ---|
46- | ` npu ` | XDNA2 NPU | Windows |
47- | ` vulkan ` | x86_64 CPU | Linux |
48- | ` cpu ` | x86_64 CPU | Windows, Linux |
49-
50- ### Speech-to-text on NPU (` flm ` recipe)
44+ Two NPU paths exist. ** Prefer ` flm ` for NPU** .
5145
52- For NPU-accelerated transcription on Ryzen AI, use the ` flm ` (FastFlowLM)
53- recipe with ` whisper-v3-turbo-FLM ` . XDNA2 NPU, Windows only.
46+ | Recipe | Backend | Model | Hardware | OS |
47+ | ---| ---| ---| ---| ---|
48+ | ` flm ` | ` npu ` | ` whisper-v3-turbo-FLM ` | XDNA2 NPU | Windows |
49+ | ` whispercpp ` | ` cpu ` | ` Whisper-Large-v3-Turbo ` | x86_64 CPU | Windows, Linux |
50+ | ` whispercpp ` | ` vulkan ` | ` Whisper-Large-v3-Turbo ` | x86_64 CPU | Linux |
51+ | ` whispercpp ` | ` npu ` | ` .rai ` -cached whisper model | XDNA2 NPU | Windows (avoid) |
5452
5553### Text-to-speech
5654
@@ -199,7 +197,7 @@ hand-editing `config.json`, or at runtime via `POST /internal/set`.
199197| ` llamacpp_backend ` | string | Pin to ` rocm ` / ` vulkan ` / ` cpu ` / ` metal ` ; leave unset for auto |
200198| ` llamacpp_args ` | string | Raw args appended to ` llama-server ` |
201199| ` sdcpp_backend ` | string | ` rocm ` / ` cpu ` |
202- | ` whispercpp_backend ` | string | ` npu ` / ` vulkan ` / ` cpu ` |
200+ | ` whispercpp_backend ` | string | ` npu ` / ` cpu ` (Windows), ` cpu ` / ` vulkan ` (Linux). For NPU prefer the ` flm ` recipe instead |
203201| ` whispercpp_args ` | string | Raw whisper.cpp args |
204202| ` flm_args ` | string | Raw FastFlowLM args |
205203| ` steps ` | int | SD step count |
0 commit comments