Pinned Loading
-
FoundationVision/VAR
FoundationVision/VAR Public[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…
-
FoundationVision/Waver
FoundationVision/Waver PublicIndustry-level video foundation model for unified Text-to-Video (T2V) and Image-to-Video (I2V) generation.
-
FoundationVision/Infinity
FoundationVision/Infinity Public[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
-
FoundationVision/Liquid
FoundationVision/Liquid Public(Accepted by IJCV) Liquid: Language Models are Scalable and Unified Multi-modal Generators
-
FoundationVision/Groma
FoundationVision/Groma Public[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization
-
FoundationVision/UniTok
FoundationVision/UniTok Public[NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding
If the problem persists, check the GitHub status page or contact support.


