EmoVerse: Enhancing Multimodal Large Language Models for Affective Computing via Multitask Learning (Neurocomputing)
Ao Li*, Longwei Xu*, Chen Ling, Jinghui Zhang, Pengwei Wang†
- *Equal contribution
- †Corresponding author
we construct the Affective Multitask (AMT) dataset, which includes multimodal sentiment analysis (MSA), multimodal emotion recognition (MER), facial expression recognition (FER), emotion reasoning inference (ERI), and emotion cause-pair extraction (ECPE) tasks.
We provide the training data for both stages of EmoVerse, which can be found in the data folder.
- Multitask Pretraining: The files are based on randomly sampled tasks from the MOSEI dataset.
- Multitask Reason Fine-tuning: The files are based on mixed sampling from three different datasets and various tasks.
Please donwload raw video:
- For the MOSEI dataset, download the video from here.
- For the MELD dataset, download the video from here.
- For the ECF2.0 dataset, download the dataset from here.
We develop the EmoVerse model. EmoVerse unifies tasks in the sentiment and emotion domains by leveraging the M2SE strategy.
The fine-tuning framework for EmoVerse based on ms-swift: ms-swift.
conda create -n emoverse python=3.9
conda activate emoverse
pip install 'ms-swift[all]' -U
EmoVerse is fine-tuned based on Internvl2, download Internvl2 weights at here. Once the checkpoint is downloaded, place it in your own directory and make sure to update the model path in the corresponding .sh file accordingly.
bash first_stage.sh
bash second_stage.sh
Once inference is completed using ms-swift, the corresponding inference files will be generated. You can then calculate the accuracy by using the functions in compute_result.py.
bash test.sh
python compute_result.py
@article{li2025emoverse,
title={EmoVerse: Enhancing Multimodal Large Language Models for Affective Computing via Multitask Learning},
author={Li, Ao and Xu, Longwei and Ling, Chen and Zhang, Jinghui and Wang, Pengwei},
journal={Neurocomputing},
volume={650},
pages={130810},
year={2025},
publisher={Elsevier}
}

