Skip to content

Commit 128b7cd

Browse files
add demo instruction
1 parent e195785 commit 128b7cd

File tree

1 file changed

+12
-1
lines changed

1 file changed

+12
-1
lines changed

README.md

Lines changed: 12 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@ Homepage: https://bytedance.github.io/vidi-website/
55
> We introduce Vidi, a family of Large Multimodal Models (LMMs) for a wide range of video understanding and editing (VUE) scenarios. The first release focuses on temporal retrieval (TR), i.e., identifying the time ranges in input videos corresponding to a given text query.
66
77
## Release
8-
- [06/06/2025] 🔥 Vidi-7B demo released at https://vidi.byteintl.com/.
8+
- [06/06/2025] 🔥 Vidi-7B demo released at https://vidi.byteintl.com/. Follow the instructions in the [demo](#demo) section to run the demo.
99
- [04/21/2025] 🔥 The first release of Vidi consists of tech report and the VUE-TR evaluation benchmark. The 7B model demo and weights are coming soon.
1010

1111
## Content
@@ -14,6 +14,17 @@ Homepage: https://bytedance.github.io/vidi-website/
1414
- [Demo](https://vidi.byteintl.com/)
1515
- [ ] Vidi-7B Weight
1616

17+
## Demo
18+
1. Click "Choose File" button and find a video local file (better in mp4 format). Click the "Upload" button.
19+
20+
Video files could contain corrupted frames which causes errors for video loading, it is recommended to use the following command to transcode the video file before uploading if the demo raises an error:
21+
```
22+
ffmpeg -i {vpath_in} -vf scale=480:-2 -c:v libx264 -c:a copy -preset ultrafast {vpath_out} -y
23+
```
24+
2. After the video is uploaded, wait till the video is ready to play in the "Input Video" box.
25+
3. Enter the text query in the "Input Query". Click the "Run Time Retrieval" button.
26+
4. Wait till the result clips show in the "Output Clips" box. This could take several minutes for long video.
27+
1728
## Installation
1829
Run the [install.sh](install.sh).
1930

0 commit comments

Comments
 (0)