ARC Lab, Tencent PCG

All

82 repositories

GenCompositor
Public
[ICLR 2026] GenCompositor: Generative Video Compositing with Diffusion Transformer
video-editing diffusion-models diffusion-transformer
video-editing diffusion-models diffusion-transformer
Python
•
Other
•6•151•3•0•Updated Mar 16, 2026Mar 16, 2026
MotionCrafter
Public
[CVPR 2026] MotionCrafter: Dense Geometry and Motion Reconstruction with a 4D VAE
video geometry dynamic
video geometry dynamic motion vae 3d 3d-reconstruction diffusion motion-estimation 4d
Python
•
Other
•4•139•1•0•Updated Mar 13, 2026Mar 13, 2026
TimeLens
Public
[CVPR 2026] TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs
Python
•
Other
•9•117•8•0•Updated Mar 12, 2026Mar 12, 2026
Track4World
Public
Track4World: Feedforward World-centric Dense 3D Tracking of All Pixels
3d pointcloud pi3
3d pointcloud pi3 3dreconstruction depthanything vggt 3dtracking moge-2
Python
•
Other
•18•188•2•0•Updated Mar 11, 2026Mar 11, 2026
CubeComposer
Public
[CVPR 2026] Spatio-Temporal Autoregressive 4K 360° Video Generation from Perspective Video
Python
•
Other
•10•95•1•0•Updated Mar 5, 2026Mar 5, 2026
VerseCrafter
Public
VerseCrafter: Dynamic Realistic Video World Model with 4D Geometric Control
world-model
world-model
Python
•
Other
•26•338•8•0•Updated Feb 26, 2026Feb 26, 2026
DSR_Suite
Public
Jupyter Notebook
•
Apache License 2.0
•7•68•1•0•Updated Feb 23, 2026Feb 23, 2026
ColorFlow
Public
The official implementation of paper "ColorFlow: Retrieval-Augmented Image Sequence Colorization". ColorFlow：基于检索增强的图像序列上色
computer-vision image-colorization colorization
computer-vision image-colorization colorization automatic-colorization
Python
•
Other
•39•458•15•0•Updated Dec 10, 2025Dec 10, 2025
SEED-Voken
Public
SEED-Voken: A Series of Powerful Visual Tokenizers
Python
•
Apache License 2.0
•42•999•2•1•Updated Nov 25, 2025Nov 25, 2025
ARC-Chapter
Public
Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries
Apache License 2.0
•2•35•3•0•Updated Nov 19, 2025Nov 19, 2025
BlobCtrl
Public
[SIGGRAPH ASIA'25] BlobCtrl: Taming Controllable Blob for Element-level Image Editing
image-editing aigc
image-editing aigc
Python
•
Other
•3•25•1•0•Updated Nov 14, 2025Nov 14, 2025
RollingForcing
Public
[ICLR 2026] Official Repo for Rolling Forcing: Autoregressive Long Video Diffusion in Real Time
real-time long-context long-video-generation
real-time long-context long-video-generation video-diffusion-model efficient-tuning
Python
•
Other
•16•347•12•1•Updated Oct 31, 2025Oct 31, 2025
MindOmni
Public
Python
•
Other
•2•141•2•0•Updated Oct 15, 2025Oct 15, 2025
vllm
Public
vllm for ARC-Hunyuan-Video-7B
Python
•
Apache License 2.0
•0•3•0•5•Updated Oct 6, 2025Oct 6, 2025
GeometryCrafter
Public
[ICCV 2025] GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors
depth-estimation video-to-4d iccv2025
depth-estimation video-to-4d iccv2025
Python
•
Other
•19•436•3•0•Updated Oct 2, 2025Oct 2, 2025
Moto
Public
[ICCV2025 Oral] Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videos
Python
•
Other
•5•167•6•0•Updated Oct 1, 2025Oct 1, 2025
ARC-Hunyuan-Video-7B
Public
Structured Video Comprehension of Real-World Shorts
Python
•
Other
•7•233•15•0•Updated Sep 21, 2025Sep 21, 2025
AudioStory
Public
AudioStory: Generating Long-Form Narrative Audio with Large Language Models
video-to-audio diffusion-models text-to-audio
video-to-audio diffusion-models text-to-audio audio-generation multimodal-large-language-models video-dubbing
Jupyter Notebook
•22•299•3•1•Updated Sep 21, 2025Sep 21, 2025
IC-Custom
Public
[Arxiv'25] IC-Custom: Diverse Image Customization via In-Context Learning
flux application image
flux application image image-editing image-inpainting image-customization aigc
Python
•
Other
•3•162•1•0•Updated Sep 15, 2025Sep 15, 2025
BrushEdit
Public
[under review] The official implementation of paper "BrushEdit: All-In-One Image Inpainting and Editing"
image-editing image-inpainting diffusion-models
image-editing image-inpainting diffusion-models
Python
•
Other
•28•589•11•0•Updated Sep 3, 2025Sep 3, 2025
ToonComposer
Public
[ICLR 2026] Streamlining Cartoon Production with Generative Post-Keyframing
Python
•
Other
•53•558•9•0•Updated Aug 20, 2025Aug 20, 2025
TokLIP
Public
TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation
Python
•
Other
•5•236•8•0•Updated Aug 18, 2025Aug 18, 2025
FreeSplatter
Public
[ICCV 2025] FreeSplatter: Pose-free Gaussian Splatting for Sparse-view 3D Reconstruction
JavaScript
•
Other
•16•235•10•2•Updated Aug 4, 2025Aug 4, 2025
TencentARC.github.io
Public
HTML
•0•1•0•0•Updated Aug 1, 2025Aug 1, 2025
Video-Holmes
Public
Video-Holmes: Can MLLM Think Like Holmes for Complex Video Reasoning?
Python
•
Apache License 2.0
•2•90•2•0•Updated Jul 13, 2025Jul 13, 2025
SEED-Bench-R1
Public
Python
•
Apache License 2.0
•2•99•2•0•Updated Jun 23, 2025Jun 23, 2025
GRPO-CARE
Public
Python
•
Apache License 2.0
•2•83•5•0•Updated Jun 23, 2025Jun 23, 2025
AnimeGamer
Public
[ICCV 2025] AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction
Python
•
Other
•29•345•5•1•Updated Apr 9, 2025Apr 9, 2025
VideoPainter
Public
[SIGGRAPH2025] Official repo for paper "Any-length Video Inpainting and Editing with Plug-and-Play Context Control"
video video-editing video-inpainting
video video-editing video-inpainting video-dataset
Python
•
Other
•41•586•15•0•Updated Apr 8, 2025Apr 8, 2025
DiTCtrl
Public
[CVPR 2025] Official code of "DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation"
Python
•
Other
•9•322•8•0•Updated Mar 30, 2025Mar 30, 2025