Skip to content

Commit 50ecb05

Browse files
authored
adding community featured models (#15744)
* adding community featured models Signed-off-by: Ssofja <sofiakostandian@gmail.com> * Change the structure of the page based on comment Signed-off-by: Ssofja <sofiakostandian@gmail.com> --------- Signed-off-by: Ssofja <sofiakostandian@gmail.com>
1 parent 0d85884 commit 50ecb05

4 files changed

Lines changed: 53 additions & 0 deletions

File tree

docs/source/asr/asr_checkpoints.rst

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -7,6 +7,8 @@ ASR Model Checkpoints
77
This page lists all supported ASR model checkpoints released by NVIDIA NeMo.
88
Benchmark scores for each model can be found on its `HuggingFace model card <https://huggingface.co/nvidia>`__.
99

10+
For community fine-tunes built on these checkpoints, see :doc:`Featured Community Checkpoints <./featured_community_checkpoints>`.
11+
1012
Glossary
1113
--------
1214

Lines changed: 49 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,49 @@
1+
.. _featured-community-checkpoints:
2+
3+
Featured Community Checkpoints
4+
==============================
5+
6+
Community fine-tunes built on NVIDIA NeMo ASR checkpoints and published on Hugging Face.
7+
For NVIDIA-published checkpoints, see :doc:`./asr_checkpoints` and the `NVIDIA Hugging Face organization <https://huggingface.co/nvidia>`__.
8+
9+
.. note::
10+
11+
Community checkpoints are maintained by their authors, not by the NeMo team.
12+
Use each model's Hugging Face model card and the framework project linked below for up-to-date setup and inference instructions.
13+
14+
.. list-table::
15+
:header-rows: 1
16+
:widths: 28 52 20
17+
18+
* - Checkpoint
19+
- What's special
20+
- Framework
21+
* - `akera/parakeet-tdt-salt <https://huggingface.co/akera/parakeet-tdt-salt>`__
22+
- SALT multilingual ASR for 10 East African languages. Hybrid TDT+CTC FastConformer (600M), fine-tuned from `parakeet-tdt-0.6b-v3 <https://huggingface.co/nvidia/parakeet-tdt-0.6b-v3>`__.
23+
- NeMo
24+
* - `johannhartmann/parakeet_de_med <https://huggingface.co/johannhartmann/parakeet_de_med>`__
25+
- German medical documentation ASR (PEFT). WER 11.73% → 3.28% on a 122-sample medical eval set.
26+
- NeMo
27+
* - `qenneth/parakeet-tdt-0.6b-v3-finetuned-for-ATC <https://huggingface.co/qenneth/parakeet-tdt-0.6b-v3-finetuned-for-ATC>`__
28+
- ATC English ASR on `jacktol/ATC-ASR-Dataset <https://huggingface.co/datasets/jacktol/ATC-ASR-Dataset>`__. Test WER 5.99%.
29+
- NeMo
30+
* - `KasuleTrevor/parakeet-0.6b-cv-sw-5hr_v9 <https://huggingface.co/KasuleTrevor/parakeet-0.6b-cv-sw-5hr_v9>`__
31+
- Swahili ASR fine-tune on ~5 hours of Common Voice data.
32+
- NeMo
33+
* - `NeurologyAI/neuro-parakeet-mlx <https://huggingface.co/NeurologyAI/neuro-parakeet-mlx>`__
34+
- German medical/neurology ASR for Apple Silicon. WER 1.04% on the author's medical validation set.
35+
- MLX
36+
* - `cstr/parakeet-tdt-0.6b-v3-GGUF <https://huggingface.co/cstr/parakeet-tdt-0.6b-v3-GGUF>`__
37+
- Quantised Parakeet TDT (Q4_K ~467 MB). 25 EU languages, word-level timestamps.
38+
- GGUF (`CrispASR <https://github.com/CrispStrobe/CrispASR>`__)
39+
* - `cstr/canary-1b-v2-GGUF <https://huggingface.co/cstr/canary-1b-v2-GGUF>`__
40+
- Quantised Canary 1B (Q4_K ~673 MB). Multilingual ASR and speech translation.
41+
- GGUF (`CrispASR <https://github.com/CrispStrobe/CrispASR>`__)
42+
43+
44+
.. _submit-a-community-checkpoint:
45+
46+
Submit a Community Checkpoint
47+
-----------------------------
48+
49+
To suggest a checkpoint for this page, open a `GitHub issue <https://github.com/NVIDIA-NeMo/NeMo/issues/new>`__ with the Hugging Face model link, NeMo base checkpoint, task, languages, evaluation results, and inference framework.

docs/source/asr/intro.rst

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -72,3 +72,4 @@ Further Reading
7272
asr_language_modeling_and_customization
7373
configs
7474
api
75+
featured_community_checkpoints

docs/source/starthere/choosing_a_model.rst

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -132,6 +132,7 @@ All pretrained NeMo models are available on:
132132

133133
- `HuggingFace Hub (nvidia) <https://huggingface.co/nvidia>`_ — search for "nemo" or specific model names
134134
- `NGC Model Catalog <https://catalog.ngc.nvidia.com/models?query=nemo&orderBy=weightPopularDESC>`_ — NVIDIA's model registry
135+
- :doc:`Featured Community Checkpoints </asr/featured_community_checkpoints>` — fine-tunes from external users
135136

136137
See :doc:`../checkpoints/intro` for instructions on loading pretrained models.
137138

0 commit comments

Comments
 (0)