A Hierarchical Regression Chain Framework for Affective Vocal Burst Recognition

Info

Our work on ACII-AVB 2022 Challenge, winner in the tasks of A-VB Two and A-VB Culture, second in the task of A-VB High.

Files

config/: configurations for data (data.yaml), model (model.yaml), training (train.yaml) and logger logger.yaml.
- train.lite configs training environment such as ddp.
- logger.wandb: W&B logger, initialize your API key at the first time if >0; will save model in <logger.dir>/wandb/latest-runn/files if >1.
- all of the configs of module, optimizer, iterator, callbacks can also be passed/overided through trainer using __ (recursively for callbacks).
filelists/: splitted filelists for train, validation and test
models/: nn modules, such as upstream, downstream, losses, etc.
trainer/: support wrappers of trainer with loggers and callbacks
utils/: data process, callbacks and metrics
cv.py: cross-validation
data_augment.py: data augmentation
dataset.py: data preparation
lite.py: training wrapper
run_exp.sh, nex_exp.sh: run a set of experiments
requirements.txt: auto generated by pipreqs . with no strict version specification
test.py: model evaluation
train.py: main training file with config of data, model, callbacks, etc.

Process

Setup environment (generated by pipreqs, python version is 3.9, recommend our Docker Image)

conda create -n pt python==3.9.12 pytorch==1.11.0 torchaudio==0.11.0 cudatoolkit -c pytorch -y -q # may need cuda version for `cudatoolkit` (nvcc --version)
conda activate pt
pip install -q -r requirements.txt &
echo "export PYTHONPATH=${PYTHONPATH}:$(pwd)" >> ~/.<shell>rc  # add the path of the workspace
source ~/.<shell>rc  # update shell environment

Trim silence in wav files

python3 utils/preprocess.py --src_dir /path/to/wav --tgt_dir /path/to/output/dir

Create filelists

python3 utils/create_splits.py --data_dir=/path/to/data --save_path=./filelists

Training

Run the following cmd to train the model.

python3 train.py
# pkill -f train.py (if stucked)

This will train the model with default setting using the model in models.ssl_trans.MTL. If you want to train other models or modify the parameters, please refer the config files under the config dir.

Cross-validation

python3 cv.py

This will run cross-validation with default setting.

Reference

The ACII 2022 Affective Vocal Bursts Workshop & Competition: Understanding a critically understudied modality of emotional expression Code
Exploring the Effectiveness of Self-supervised Learning and Classifier Chains in Emotion Recognition of Nonverbal Vocalizations Code
Environment supported by LightningLite
Trainer modified from skorch
Config supported by hydra_core

Authors

Please give me a 🌟 if this repository helps you 🤗

If you have any questions, please feel free to issue or contact me (Jinchao).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A Hierarchical Regression Chain Framework for Affective Vocal Burst Recognition

Info

Files

Process

Training

Cross-validation

Reference

Authors

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
config		config
filelists		filelists
models		models
tmp		tmp
trainer		trainer
utils		utils
.DS_Store		.DS_Store
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
cv.py		cv.py
data_augment.py		data_augment.py
dataset.py		dataset.py
exp.ipynb		exp.ipynb
lite.py		lite.py
next_exp.sh		next_exp.sh
requirements.txt		requirements.txt
run_exp.sh		run_exp.sh
test.py		test.py
train.py		train.py
wandb_download.py		wandb_download.py

JinchaoLove/AffectiveVocalBurstRecognition

Folders and files

Latest commit

History

Repository files navigation

A Hierarchical Regression Chain Framework for Affective Vocal Burst Recognition

Info

Files

Process

Training

Cross-validation

Reference

Authors

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages