Skip to content

NVlabs/Wolf

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Wolf 🐺: Dense Captioning with a World Summarization Framework

The repo contains official Implementation and dataset (including annotations) of Wolf 🐺: Dense Captioning with a World Summarization Framework.

Wolf Benchmark Overview

Dataset Number Download Annotations
Normal Driving Scenes 4785 wolf_driving_index_file.json Normal Nuscenes_annotations.json
Challenging Driving Scenes 500 wolf_driving_index_file.json Interactive Nuscenes_annotations.json
General Daily Scenes 473 link Pexels_annotations.json
Robot Manipulation 100 link robotics_annotations.json

Note: please download the driving dataset from official Nuscenes webpage based our provided index file wolf_driving_index_file.json.

Capscore: Wolf metric to evaluate your caption quality

CapScore is a quantitative metric to use LLMs (GPT-4) to evaluate the similarity between predicted and human-annotated (ground-truth) captions. Please revise the script and run the code below to obtain your results.

python get_capscore.py

Running

Please revise the script and run the code below to obtain your results.

python get_wolfcaps.py

If you find this repo useful, please cite:

@article{li2024wolf,
  title={Wolf: Captioning everything with a world summarization framework},
  author={Li, Boyi and Zhu, Ligeng and Tian, Ran and Tan, Shuhan and Chen, Yuxiao and Lu, Yao and Cui, Yin and Veer, Sushant and Ehrlich, Max and Philion, Jonah and others},
  journal={arXiv preprint arXiv:2407.18908},
  year={2024}
}

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages