Our work on ACII-AVB 2022 Challenge, winner in the tasks of A-VB Two and A-VB Culture, second in the task of A-VB High.
config/: configurations for data (data.yaml), model (model.yaml), training (train.yaml) and loggerlogger.yaml.train.liteconfigs training environment such asddp.logger.wandb: W&B logger, initialize your API key at the first time if>0; will save model in<logger.dir>/wandb/latest-runn/filesif>1.- all of the configs of
module,optimizer,iterator,callbackscan also be passed/overided throughtrainerusing__(recursively forcallbacks).
filelists/: splitted filelists for train, validation and testmodels/: nn modules, such as upstream, downstream, losses, etc.trainer/: support wrappers of trainer with loggers and callbacksutils/: data process, callbacks and metricscv.py: cross-validationdata_augment.py: data augmentationdataset.py: data preparationlite.py: training wrapperrun_exp.sh,nex_exp.sh: run a set of experimentsrequirements.txt: auto generated bypipreqs .with no strict version specificationtest.py: model evaluationtrain.py: main training file with config of data, model, callbacks, etc.
- Setup environment (generated by pipreqs, python version is 3.9, recommend our Docker Image)
conda create -n pt python==3.9.12 pytorch==1.11.0 torchaudio==0.11.0 cudatoolkit -c pytorch -y -q # may need cuda version for `cudatoolkit` (nvcc --version)
conda activate pt
pip install -q -r requirements.txt &
echo "export PYTHONPATH=${PYTHONPATH}:$(pwd)" >> ~/.<shell>rc # add the path of the workspace
source ~/.<shell>rc # update shell environment- Trim silence in wav files
python3 utils/preprocess.py --src_dir /path/to/wav --tgt_dir /path/to/output/dir- Create filelists
python3 utils/create_splits.py --data_dir=/path/to/data --save_path=./filelistsRun the following cmd to train the model.
python3 train.py
# pkill -f train.py (if stucked)This will train the model with default setting using the model in models.ssl_trans.MTL. If you want to train other models or modify the parameters, please refer the config files under the config dir.
python3 cv.pyThis will run cross-validation with default setting.
- The ACII 2022 Affective Vocal Bursts Workshop & Competition: Understanding a critically understudied modality of emotional expression Code
- Exploring the Effectiveness of Self-supervised Learning and Classifier Chains in Emotion Recognition of Nonverbal Vocalizations Code
- Environment supported by LightningLite
- Trainer modified from skorch
- Config supported by hydra_core
Please give me a 🌟 if this repository helps you 🤗
If you have any questions, please feel free to issue or contact me (Jinchao).