speech tools

These files consitute a set of files that can be used to run the analyses for cross-linguistic analysis of large speech model embeddings.

We currently support functionality for the following models and corpora

wav2vec 2.0 HuBERT (TODO: WavLM)

Hindi Commonvoice (13) AI for Bharat () Vaani ()

English Librispeech Wall Street Journal Corpus

Korean Seoul Corpus

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
__pycache__		__pycache__
corpora		corpora
testing		testing
README.md		README.md
extract_reps.py		extract_reps.py
extract_reps_finetuned_models.sh		extract_reps_finetuned_models.sh
l2-speech-tools.code-workspace		l2-speech-tools.code-workspace
model_utils.py		model_utils.py
run_extract_reps.sh		run_extract_reps.sh
train_classifiers.py		train_classifiers.py
train_classifiers.sh		train_classifiers.sh