Change the repository type filter
All
Repositories list
21 repositories
Alive
PublicInfinityStar
Public[NeurIPS 2025 Oral]Infinity⭐️: Unified Spacetime AutoRegressive Modeling for Visual Generation.github
PublicUniTok
Public[NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding- [NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Gen…
- (Accepted by IJCV) Liquid: Language Models are Scalable and Unified Multi-modal Generators
Infinity
Public[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis- official training and inference code of bitwise tokenizer
- [CVPR2024] Generative Region-Language Pretraining for Open-Ended Object Detection
FlashVideo
Public[AAAI-2026]FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video GenerationUniRef
Public[ICCV2023] Segment Every Reference Object in Spatial and Temporal Spacesflashvideo-page
Publicinfinity.project
PublicGLEE
Public[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at ScaleLlamaGen
PublicAutoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image GenerationOmniTokenizer
Public[NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.vaex
PublicByteTrack
Public[ECCV 2022] ByteTrack: Multi-Object Tracking by Associating Every Detection BoxGroma
Public[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual TokenizationVNext
Public archiveNext-generation Video instance recognition framework on top of Detectron2 which supports InstMove (CVPR 2023), SeqFormer(ECCV Oral), and IDOL(ECCV Oral))