Change the repository type filter
All
Repositories list
62 repositories
fastrl
Publicradial-attention
Public[NeurIPS 2025] Radial Attention: O(nlogn) Sparse Attention with Energy Decay for Long Video Generation- A PyTorch-based framework for Quantum Classical Simulation, Quantum Machine Learning, Quantum Neural Networks, Parameterized Quantum Circuits with support for easy deployments on real quantum computers.
streaming-vlm
Publicefficientvit
PublicEfficient vision foundation models for high-resolution generation and perception.llm-awq
Publiclpd
PublicLocality-aware Parallel Decoding for Efficient Autoregressive Image GenerationQuest
Publicomniserve
Publictorchsparse
PublicVisCompare
Publicduo-attention
Publicpatch_conv
Publicfastcomposer
Publicsparserefine
Public- [CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models
tinyengine
Public[NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep Learning; [NeurIPS 2022] MCUNetV3: On-Device Training Under 256KB Memorytinychat-tutorial
Publichart
Publicdata-efficient-gans
Public[NeurIPS 2020] Differentiable Augmentation for Data-Efficient GAN Trainingproxylessnas
Public[ICLR 2019] ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardwarespatten
Public[HPCA'21] SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruningbevfusion
Public archive[ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation- [ACL'20] HAT: Hardware-Aware Transformers for Efficient Natural Language Processing
- [ECCV 2020] Searching Efficient 3D Architectures with Sparse Point-Voxel Convolution