EmoVerse: Enhancing Multimodal Large Language Models for Affective Computing via Multitask Learning (Neurocomputing)

Ao Li*, Longwei Xu*, Chen Ling, Jinghui Zhang, Pengwei Wang†

*Equal contribution
†Corresponding author

Affective Multitask Dataset (AMT Dataset)

we construct the Affective Multitask (AMT) dataset, which includes multimodal sentiment analysis (MSA), multimodal emotion recognition (MER), facial expression recognition (FER), emotion reasoning inference (ERI), and emotion cause-pair extraction (ECPE) tasks.

We provide the training data for both stages of EmoVerse, which can be found in the data folder.

Multitask Pretraining: The files are based on randomly sampled tasks from the MOSEI dataset.
Multitask Reason Fine-tuning: The files are based on mixed sampling from three different datasets and various tasks.

Please donwload raw video:

For the MOSEI dataset, download the video from here.
For the MELD dataset, download the video from here.
For the ECF2.0 dataset, download the dataset from here.

EmoVerse

We develop the EmoVerse model. EmoVerse unifies tasks in the sentiment and emotion domains by leveraging the M2SE strategy.

Environment

The fine-tuning framework for EmoVerse based on ms-swift: ms-swift.

conda create -n emoverse python=3.9
conda activate emoverse
pip install 'ms-swift[all]' -U

Checkpoint

EmoVerse is fine-tuned based on Internvl2, download Internvl2 weights at here. Once the checkpoint is downloaded, place it in your own directory and make sure to update the model path in the corresponding .sh file accordingly.

Multitask Pretraining

bash first_stage.sh

Multitask Reason Fine-tuning

bash second_stage.sh

Evaluation

Once inference is completed using ms-swift, the corresponding inference files will be generated. You can then calculate the accuracy by using the functions in compute_result.py.

bash test.sh
python compute_result.py

Cite us

@article{li2025emoverse,
  title={EmoVerse: Enhancing Multimodal Large Language Models for Affective Computing via Multitask Learning},
  author={Li, Ao and Xu, Longwei and Ling, Chen and Zhang, Jinghui and Wang, Pengwei},
  journal={Neurocomputing},
  volume={650},
  pages={130810},
  year={2025},
  publisher={Elsevier}
}

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
LICENSE		LICENSE
README.md		README.md
amt_dataset_first_stage.json		amt_dataset_first_stage.json
amt_dataset_second_stage.json		amt_dataset_second_stage.json
compute_result.py		compute_result.py
data.jpg		data.jpg
first_stage.sh		first_stage.sh
model.jpg		model.jpg
second_stage.sh		second_stage.sh
test.sh		test.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

EmoVerse: Enhancing Multimodal Large Language Models for Affective Computing via Multitask Learning (Neurocomputing)

Affective Multitask Dataset (AMT Dataset)

EmoVerse

Environment

Checkpoint

Multitask Pretraining

Multitask Reason Fine-tuning

Evaluation

Cite us

About

Uh oh!

Releases

Packages

Languages

License

liaolea/EmoVerse

Folders and files

Latest commit

History

Repository files navigation

EmoVerse: Enhancing Multimodal Large Language Models for Affective Computing via Multitask Learning (Neurocomputing)

Affective Multitask Dataset (AMT Dataset)

EmoVerse

Environment

Checkpoint

Multitask Pretraining

Multitask Reason Fine-tuning

Evaluation

Cite us

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages