New York Institute of Technology, SIAT CAS, Shanghai University, Central South University, Xi'an Jiaotong-Liverpool University, Huizhou Univeristy
In IEEE International Conference on Acoustics, Speech, and Signal Processing 2026 (ICASSP 2026)
The benchmark datasets are available at Kaggle.
You may download the dataset first, and then specify TRAIN_DIR, VAL_DIR and SAVE_DIR in the section TRAINING in config.yml.
For single GPU training:
python train.py
For multiple GPUs training:
accelerate config
accelerate launch train.py
If you have difficulties with the usage of accelerate, please refer to Accelerate.
Please first specify TRAIN_DIR, VAL_DIR and SAVE_DIR in section TESTING in config.yml.
python test.py