Minimal example on how to use pre-trained ASR models for audio transcriptions on a laptop (without GPUs) #3553
okuchaiev
started this conversation in
Show and tell
Replies: 2 comments
-
This is dope |
Beta Was this translation helpful? Give feedback.
0 replies
-
I had to comment again, to say thank you for this awesome snippet! |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
NeMo is a toolkit for training and fine-tuning Conversational AI models. However, sometimes, for quick prototyping purposes many of the pre-trained NeMo models can be used directly from NeMo on CPUs. For real production deployment of NeMo ASR models we recommend NVIDIA Riva.
The below steps will work on (Intel) MacBook without NVIDIA GPU:
In your terminal, first install Anaconda and then perform the following steps to install NeMo and its dependencies:
conda create -n cputest python=3.8 conda activate cputest pip install nemo_toolkit['all']==1.6.2
Get a sample audio file. You can use your own, just make sure it is Mono and sampled at 16Khz.
Start a Python shell and do:
TIP: to see a list of available pre-trained ASR models, in a Python shell do:
Beta Was this translation helpful? Give feedback.
All reactions