Skip to content

UCF-CRCV/VRR-QA

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 

Repository files navigation

Official repository for VRR-QA, CVPR 2026 Paper

Install

conda create -n implicit python==3.9 -y
conda activate implicit
pip install torch torchvision torchaudio
mamba install ffmpeg -c conda-forge
ffmpeg -decoders | grep -i nvidia
pip install torchcodec --index-url=https://download.pytorch.org/whl/cu124
pip install vllm
pip install pandas

Evaluation

python eval.py --input-dir <path to data> --model-type <model_type> --num-frames 16

Cite

If you use this code in your research, please cite:

@InProceedings{Swetha_2026_CVPR,
    author    = {Swetha, Sirnam and Gupta, Rohit and Kulkarni, Parth Parag and Shatwell, David G and A Chan Santiago, Jeffrey and Siddiqui, Nyle and Fioresi, Joseph and Shah, Mubarak},
    title     = {VRR-QA: Visual Relational Reasoning in Videos Beyond Explicit Cues},
    booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
    month     = {June},
    year      = {2026},
    pages     = {32840-32849}
}

Contact

If you have any problems with the code or have a question, please open an issue or email swetha(dot)sirnam at ucf.edu. I'll try to answer as soon as possible.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages