Jingyi Zhang1,2*, Tianyi Lin3*, Huanjin Yao4, Xiang Lan5, Shunyu Liu2, Jiaxing Huang1✉️
1Hong Kong Polytechnic University, 2Nanyang Technological Univeristy, 3Tsinghua University, 4ByteDance, 5National University of Singapore
*Equal Contribution, ✉️Corresponding Author
-
April 30, 2026.R1-SyntheticVL is accepted by ICML 2026! 🎉 -
Feb 03, 2026.We release our paper on arxiv.
Code, model and data will be released soon ...
We conduct experiments with base model Qwen2.5-VL-7B.
Our work is built on the following codebases, and we are deeply grateful for their contributions.
We appreciate your citations if you find our paper related and useful to your research!
@article{zhang2026r1,
title={R1-SyntheticVL: Is Synthetic Data from Generative Models Ready for Multimodal Large Language Model?},
author={Zhang, Jingyi and Lin, Tianyi and Yao, Huanjin and Lan, Xiang and Liu, Shunyu and Huang, Jiaxing},
journal={arXiv preprint arXiv:2602.03300},
year={2026}
}

