#
video-large-language-models
Here are
11 public repositories
matching this topic...
Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Train your own 8B/14B LLaVA-training-like MLLM on RTX3090/4090 24GB.
Updated
Mar 10, 2025
Jupyter Notebook
Awesome papers & datasets specifically focused on long-term videos.
✨✨[NeurIPS 2025] This is the official implementation of our paper "Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension"
Updated
Sep 24, 2025
Python
[ICLR 2025] TRACE: Temporal Grounding Video LLM via Casual Event Modeling
Updated
Aug 22, 2025
Python
[AAAI 2025] VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Grounding
Updated
Dec 10, 2024
Python
This is the official implementation of our paper "QuoTA: Query-oriented Token Assignment via CoT Query Decouple for Long Video Comprehension"
Updated
Apr 28, 2025
Python
[NeurIPS'25] HoliTom: Holistic Token Merging for Fast Video Large Language Models
Updated
Oct 10, 2025
Python
🚀 Video Compression Commander: Plug-and-Play Inference Acceleration for Video Large Language Models
Updated
Oct 12, 2025
Python
[CVPR 2025] Official Repository of the paper "On the Consistency of Video Large Language Models in Temporal Comprehension"
Updated
Oct 13, 2025
Python
This repo integrates DyCoke's token compression method with VLMs such as Gemma3 and InternVL3
Updated
Oct 8, 2025
Python
[ICCV 2025] Streaming VideoLLMs for Real-time Procedural Video Understanding
Updated
Oct 13, 2025
Python
Improve this page
Add a description, image, and links to the
video-large-language-models
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
video-large-language-models
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.