All

121 repositories

MoDA
Public
An hardware-aware Efficient Implementation for "Mixture-of-Depths Attention".
Python
•
MIT License
•9•261•1•0•Updated May 6, 2026May 6, 2026
InfiniteVL
Public
InfiniteVL: Synergizing Linear and Sparse Attention for Highly-Efficient, Unlimited-Input Vision-Language Models
Python
•
Apache License 2.0
•5•108•3•0•Updated Apr 20, 2026Apr 20, 2026
RAD
Public
[NeurIPS 2025] RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning
Python
•
MIT License
•9•242•7•0•Updated Apr 17, 2026Apr 17, 2026
VGT
Public
Visual Generation Tuning
Python
•
MIT License
•1•100•0•0•Updated Apr 16, 2026Apr 16, 2026
4DLangVGGT
Public
Official implementation of “4D LangVGGT: 4D Language-Visual Geometry Grounded Transformer”
Python
•
MIT License
•2•87•3•0•Updated Mar 25, 2026Mar 25, 2026
WeakTr
Public
[TIP, IEEE Transactions on Image Processing] WeakTr: Exploring Plain Vision Transformer for Weakly-supervised Semantic Segmentation
Python
•
MIT License
•3•140•10•0•Updated Mar 25, 2026Mar 25, 2026
Spa3R
Public
Spa3R: Predictive Spatial Field Modeling for 3D Visual Reasoning
Python
•
MIT License
•1•49•0•0•Updated Mar 25, 2026Mar 25, 2026
Senna
Public
Bridging Large Vision-Language Models and End-to-End Autonomous Driving
end-to-end autonomous-driving vision-language-model
end-to-end autonomous-driving vision-language-model
Python
•
Apache License 2.0
•48•550•29•0•Updated Mar 15, 2026Mar 15, 2026
MobileI2V
Public
[ArXiv 2025] MobileI2V: Fast and High-Resolution Image-to-Video on Mobile Devices
video-generation image-to-video diffusion-models
video-generation image-to-video diffusion-models video-diffusion-transformers
Python
•
Apache License 2.0
•3•82•1•0•Updated Mar 12, 2026Mar 12, 2026
VAD
Public
[ICCV 2023 & ICLR 2026] VAD: Vectorized Scene Representation for Efficient Autonomous Driving
end-to-end autonomous-driving
end-to-end autonomous-driving
Python
•
Apache License 2.0
•159•1.3k•77•1•Updated Jan 31, 2026Jan 31, 2026
GaussTR
Public
[CVPR 2025] GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding
Python
•
MIT License
•13•213•1•0•Updated Jan 5, 2026Jan 5, 2026
DiffusionDriveV2
Public
DiffusionDriveV2: Reinforcement Learning-Constrained Truncated Diffusion Modeling in End-to-End Autonomous Driving
Python
•
MIT License
•26•309•16•2•Updated Dec 29, 2025Dec 29, 2025
SuperCLIP
Public
Python
•
Apache License 2.0
•9•134•3•0•Updated Dec 26, 2025Dec 26, 2025
DiffusionVL
Public
[ArXiv 2025] DiffusionVL: Translating Any Autoregressive Models into Diffusion Vision Language Models
Python
•
Apache License 2.0
•9•142•0•0•Updated Dec 25, 2025Dec 25, 2025
TBCM
Public
Image-Free Timestep Distillation via Continuous-Time Consistency with Trajectory-Sampled Pairs
Python
•0•21•1•0•Updated Dec 16, 2025Dec 16, 2025
LightningDiT
Public
[CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models
Python
•
MIT License
•57•1.5k•18•1•Updated Dec 16, 2025Dec 16, 2025
DiffusionDrive
Public
[CVPR 2025 Highlight] Truncated Diffusion Model for Real-Time End-to-End Autonomous Driving
Python
•
MIT License
•130•1.4k•28•1•Updated Dec 8, 2025Dec 8, 2025
EVA-X
Public
[Nature Portfolio, npj DigitalMed] EVA-X: A foundation model for general chest X-ray analysis with self-supervised learning
Python
•15•97•6•0•Updated Dec 6, 2025Dec 6, 2025
MolSight
Public
[AAAI 2026] MolSight: Optical Chemical Structure Recognition with SMILES Pretraining, Multi-Granularity Learning and Reinforcement Learning
Python
•
Apache License 2.0
•3•22•0•0•Updated Dec 5, 2025Dec 5, 2025
LENS
Public
[AAAI 2026 Oral] LENS: Learning to Segment Anything with Unified Reinforced Reasoning
Python
•
Apache License 2.0
•9•128•15•0•Updated Dec 3, 2025Dec 3, 2025
Turbo-VAED
Public
[AAAI 2026] Turbo-VAED: Fast and Stable Transfer of Video-VAEs to Mobile Devices
Python
•1•116•12•0•Updated Nov 30, 2025Nov 30, 2025
MaskAdapter
Public
[CVPR 2025] Official repository of the paper "Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation"
segmentation clip zero-shot
segmentation clip zero-shot open-vocabulary open-vocabulary-semantic-segmentation vision-language-model segment-anything open-vocabulary-segmentation zero-shot-segmentation
Python
•
Apache License 2.0
•3•132•2•0•Updated Oct 23, 2025Oct 23, 2025
hustvl.github.io
Public
HTML
•
BSD 3-Clause "New" or "Revised" License
•3•1•0•0•Updated Oct 11, 2025Oct 11, 2025
TOGS
Public
[IEEE JBHI] The official code of "TOGS: Gaussian Splatting with Temporal Opacity Offset for Real-Time 4D DSA Rendering"
Python
•1•33•1•0•Updated Sep 10, 2025Sep 10, 2025
simpleseg
Public
Python
•0•8•3•0•Updated Sep 9, 2025Sep 9, 2025
Snap-Snap
Public
The repository of "Snap-Snap: Taking Two Images to Reconstruct 3D Human Gaussians in Milliseconds"
Python
•1•39•6•0•Updated Sep 1, 2025Sep 1, 2025
recogdrive
Public
ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving
Python
•
Apache License 2.0
•66•8•0•0•Updated Aug 21, 2025Aug 21, 2025
ViTMatte
Public
[Information Fusion (Vol.103, Mar. '24)] Boosting Image Matting with Pretrained Plain Vision Transformers
Python
•
MIT License
•49•533•19•3•Updated Aug 13, 2025Aug 13, 2025
Dynamic-2DGS
Public
[ACM MM 2025] Dynamic 2D Gaussians: Geometrically Accurate Radiance Fields for Dynamic Objects
3d-vision 4d-reconstruction dynamic-reconstruction
3d-vision 4d-reconstruction dynamic-reconstruction gaussian-splatting 3d-gaussian-splatting 4d-gaussian-splatting
Python
•
Apache License 2.0
•5•176•3•0•Updated Aug 6, 2025Aug 6, 2025
.github
Public
0•0•0•0•Updated Jul 4, 2025Jul 4, 2025

ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HUST Vision Lab

All

All

121 repositories

MoDA

InfiniteVL

RAD

VGT

4DLangVGGT

WeakTr

Spa3R

Senna

MobileI2V

VAD

GaussTR

DiffusionDriveV2

SuperCLIP

DiffusionVL

TBCM

LightningDiT

DiffusionDrive

EVA-X

MolSight

LENS

Turbo-VAED

MaskAdapter

hustvl.github.io

TOGS

simpleseg

Snap-Snap

recogdrive

ViTMatte

Dynamic-2DGS

.github

All

All

Repositories list

121 repositories