Skip to content

VideoChat-R1: Enhancing Spatio-Temporal Perception via Reinforcement Fine-Tuning

Notifications You must be signed in to change notification settings

OpenGVLab/VideoChat-R1

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

🔥 Updates

  • 2025/04/14:🔥🔥🔥 We release our VideoChat-R1 and VideoChat-R1-thinking at Huggingface.
  • 2025/04/10:🔥🔥🔥 We release our paper and code.

🦜 Introduction

alt text

Demo & Inference

Refer to hf README to inference our model.

Evaluation

See eval_scripts.

Training

See training_scripts.

📄 Citation

If you find this project useful in your research, please consider cite:

@article{li2025videochatr1,
  title={VideoChat-R1: Enhancing Spatio-Temporal
Perception via Reinforcement Fine-Tuning},
  author={Li, Xinhao and Yan, Ziang and Meng, Desen and Dong, Lu and Zeng, Xiangyu and He, Yinan and Wang, Yali and Qiao, Yu and Wang, Yi and Wang, Limin},
  journal={arXiv preprint arXiv:2504.06958},
  year={2025}
}

About

VideoChat-R1: Enhancing Spatio-Temporal Perception via Reinforcement Fine-Tuning

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published