|
| 1 | +.. _featured-community-checkpoints: |
| 2 | + |
| 3 | +Featured Community Checkpoints |
| 4 | +============================== |
| 5 | + |
| 6 | +Community fine-tunes built on NVIDIA NeMo ASR checkpoints and published on Hugging Face. |
| 7 | +For NVIDIA-published checkpoints, see :doc:`./asr_checkpoints` and the `NVIDIA Hugging Face organization <https://huggingface.co/nvidia>`__. |
| 8 | + |
| 9 | +.. note:: |
| 10 | + |
| 11 | + Community checkpoints are maintained by their authors, not by the NeMo team. |
| 12 | + Use each model's Hugging Face model card and the framework project linked below for up-to-date setup and inference instructions. |
| 13 | + |
| 14 | +.. list-table:: |
| 15 | + :header-rows: 1 |
| 16 | + :widths: 28 52 20 |
| 17 | + |
| 18 | + * - Checkpoint |
| 19 | + - What's special |
| 20 | + - Framework |
| 21 | + * - `akera/parakeet-tdt-salt <https://huggingface.co/akera/parakeet-tdt-salt>`__ |
| 22 | + - SALT multilingual ASR for 10 East African languages. Hybrid TDT+CTC FastConformer (600M), fine-tuned from `parakeet-tdt-0.6b-v3 <https://huggingface.co/nvidia/parakeet-tdt-0.6b-v3>`__. |
| 23 | + - NeMo |
| 24 | + * - `johannhartmann/parakeet_de_med <https://huggingface.co/johannhartmann/parakeet_de_med>`__ |
| 25 | + - German medical documentation ASR (PEFT). WER 11.73% → 3.28% on a 122-sample medical eval set. |
| 26 | + - NeMo |
| 27 | + * - `qenneth/parakeet-tdt-0.6b-v3-finetuned-for-ATC <https://huggingface.co/qenneth/parakeet-tdt-0.6b-v3-finetuned-for-ATC>`__ |
| 28 | + - ATC English ASR on `jacktol/ATC-ASR-Dataset <https://huggingface.co/datasets/jacktol/ATC-ASR-Dataset>`__. Test WER 5.99%. |
| 29 | + - NeMo |
| 30 | + * - `KasuleTrevor/parakeet-0.6b-cv-sw-5hr_v9 <https://huggingface.co/KasuleTrevor/parakeet-0.6b-cv-sw-5hr_v9>`__ |
| 31 | + - Swahili ASR fine-tune on ~5 hours of Common Voice data. |
| 32 | + - NeMo |
| 33 | + * - `NeurologyAI/neuro-parakeet-mlx <https://huggingface.co/NeurologyAI/neuro-parakeet-mlx>`__ |
| 34 | + - German medical/neurology ASR for Apple Silicon. WER 1.04% on the author's medical validation set. |
| 35 | + - MLX |
| 36 | + * - `cstr/parakeet-tdt-0.6b-v3-GGUF <https://huggingface.co/cstr/parakeet-tdt-0.6b-v3-GGUF>`__ |
| 37 | + - Quantised Parakeet TDT (Q4_K ~467 MB). 25 EU languages, word-level timestamps. |
| 38 | + - GGUF (`CrispASR <https://github.com/CrispStrobe/CrispASR>`__) |
| 39 | + * - `cstr/canary-1b-v2-GGUF <https://huggingface.co/cstr/canary-1b-v2-GGUF>`__ |
| 40 | + - Quantised Canary 1B (Q4_K ~673 MB). Multilingual ASR and speech translation. |
| 41 | + - GGUF (`CrispASR <https://github.com/CrispStrobe/CrispASR>`__) |
| 42 | + |
| 43 | + |
| 44 | +.. _submit-a-community-checkpoint: |
| 45 | + |
| 46 | +Submit a Community Checkpoint |
| 47 | +----------------------------- |
| 48 | + |
| 49 | +To suggest a checkpoint for this page, open a `GitHub issue <https://github.com/NVIDIA-NeMo/NeMo/issues/new>`__ with the Hugging Face model link, NeMo base checkpoint, task, languages, evaluation results, and inference framework. |
0 commit comments