Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
asr_paradigm_meta.pdf		asr_paradigm_meta.pdf
asr_tf.ipynb		asr_tf.ipynb
asr_torch.ipynb		asr_torch.ipynb
image-1.png		image-1.png
image.png		image.png
requirements.txt		requirements.txt
wav2vec2.pdf		wav2vec2.pdf

README.md

speech

repo for gripping speech analysis problem

questions

How to define a speech analysis problem?
What are some ML models/ algorithms (SOTA?) for speech analysis?
What are the basic tools for speech analysis?

answers

Speech analysis - the process of extracting meaningful information from speech signals
Basic tools of speech analysis (in Python) are listed in references
ML models - how to find them? ASR @ huggingface link,

problem

resources

data

VOiCES dataset link

tools

SpeechRecognition
Librosa
Deepgram
lots of APIs like Whisper API, Google Cloud Speech API, IBM Watson Speech to Text API, etc.
espnet (kinda cool, seems like E2E toolkit)
wav2vec2

metrics

WER - Word Error Rate

pipeline

wav2vec2 -> ... (?)
(general) waveform -> vectors (?) -> features -> model -> output