Skip to content

Latest commit

 

History

History

README.md

speech

repo for gripping speech analysis problem

questions

  • How to define a speech analysis problem?
  • What are some ML models/ algorithms (SOTA?) for speech analysis?
  • What are the basic tools for speech analysis?

answers

  • Speech analysis - the process of extracting meaningful information from speech signals
  • Basic tools of speech analysis (in Python) are listed in references
  • ML models - how to find them? ASR @ huggingface link,

problem

Alt text

resources

data

  • VOiCES dataset link

tools

  • SpeechRecognition
  • Librosa
  • Deepgram
  • lots of APIs like Whisper API, Google Cloud Speech API, IBM Watson Speech to Text API, etc.
  • espnet (kinda cool, seems like E2E toolkit)
  • wav2vec2

metrics

  • WER - Word Error Rate

Alt text

pipeline

  • wav2vec2 -> ... (?)
  • (general) waveform -> vectors (?) -> features -> model -> output