Change the repository type filter
All
Repositories list
93 repositories
VisionDirector
PublicMGM-Omni
PublicMGM-Omni: Scaling Omni LLMs to Personalized Long-Horizon Speech- VisionReasoner: Unified Reasoning-Integrated Visual Perception via Reinforcement Learning
Scaf-GRPO
PublicUnityVideo
PublicSearchGym
PublicTraveLLaMA
Public- Project Page For "Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement"
DreamOmni3
PublicRePlan
PublicRePlan: Reasoning-Guided Region Planning for Complex Instruction-Based Image EditingSmartSwitch
PublicDreamOmni2
PublicThis project is the official implementation of 'DreamOmni2: Multimodal Instruction-based Editing and Generation''VisionThink
PublicLSDBench
PublicA benchmark that focuses on the sampling dilemma in long-video tasks. Through well-designed tasks, it evaluates the sampling efficiency of long-video VLMs. (ICC…Jenga
PublicVisionZip
PublicOfficial repository for VisionZip (CVPR 2025)- Parametric Contrastive Learning (ICCV2021) & GPaCo (TPAMI 2023)
- [ICML 2025] TGDPO: Harnessing Token-Level Reward Guidance for Enhancing Direct Preference Optimization
Video-P2P
PublicVideo-P2P: Video Editing with Cross-attention ControlRL-GPT
PublicMagicMirror
Public- This project is the official implementation of 'LLMGA: Multimodal Large Language Model based Generation Assistant', ECCV2024 Oral
ARPO
PublicMoTCoder
PublicThis is the official code repository of MoTCoder: Elevating Large Language Models with Modular of Thought for Challenging Programming Tasks.Open-Code-Zero
PublicLISA
PublicProject Page for "LISA: Reasoning Segmentation via Large Language Model"Step-DPO
PublicLyra
Public[ICCV 2025] Official Implementation for "Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition"LBGAT
PublicLearnable Boundary Guided Adversarial Training (ICCV2021)