Skip to content

Latest commit

 

History

History
60 lines (34 loc) · 2.43 KB

README.md

File metadata and controls

60 lines (34 loc) · 2.43 KB

🔥 Updates

  • 2025/04/14:🔥🔥🔥 We release our VideoChat-R1 and VideoChat-R1-thinking at Huggingface.
  • 2025/04/10:🔥🔥🔥 We release our paper and code.

🦜 Introduction

alt text

Demo & Inference

Refer to hf README to inference our model.

Evaluation

See eval_scripts.

Training

See training_scripts.

📄 Citation

If you find this project useful in your research, please consider cite:

@article{li2025videochatr1,
  title={VideoChat-R1: Enhancing Spatio-Temporal
Perception via Reinforcement Fine-Tuning},
  author={Li, Xinhao and Yan, Ziang and Meng, Desen and Dong, Lu and Zeng, Xiangyu and He, Yinan and Wang, Yali and Qiao, Yu and Wang, Yi and Wang, Limin},
  journal={arXiv preprint arXiv:2504.06958},
  year={2025}
}