vectordb-recipes/examples/v-jepa-video-search at main · lancedb/vectordb-recipes

Name	Name	Last commit message	Last commit date
parent directory ..
README.md	README.md
demo.png	demo.png
get_frame_stats.py	get_frame_stats.py
intra-video.ipynb	intra-video.ipynb

Name

Last commit message

Last commit date

Video Search with V-Jepa 2 and LanceDB

V-Jepa 2 is a self-supervised video model designed to enhance AI's understanding, prediction, and planning capabilities in real-world environments. The model is initially pre-trained on over one million hours of internet video data using a mask-denoising technique in representation space, demonstrating state-of-the-art performance in video understanding and human action anticipation.

Subsequently, an action-conditioned variant, V-JEPA 2-AC, is fine-tuned with a limited amount of robot interaction data, enabling zero-shot robotic planning for tasks like object manipulation. The research also highlights V-JEPA 2's effectiveness when integrated with a large language model for video question-answering tasks, achieving strong results.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

Video Search with V-Jepa 2 and LanceDB

FilesExpand file tree

v-jepa-video-search

Directory actions

More options

Directory actions

More options

Latest commit

History

v-jepa-video-search

Folders and files

parent directory

README.md

Video Search with V-Jepa 2 and LanceDB