File tree Expand file tree Collapse file tree
Expand file tree Collapse file tree Original file line number Diff line number Diff line change @@ -24,14 +24,27 @@ The playback video and text summary will be uploaded to <a href="https://space.b
2424- 💡 [ CVPR'26] [ AdaCluster: Adaptive Query-Key Clustering for Sparse Attention in Video Generation] ( ./slides/260413-Adacluster.pdf )
2525- 🙎♂️ Haoyue Tan
2626- [ 📕slides] ( ./slides/260414adahunyuan.pdf )
27+ - [ 📃 Q&A summary] ( https://zhuanlan.zhihu.com/p/2031793965167547374 ) , [ 📺 video] ( https://www.bilibili.com/video/BV1R6ZFBMEms/ )
2728#### Topic II
2829- 💡 [ arXiv] [ IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse] ( https://arxiv.org/abs/2603.12201 )
2930- 🙎♂️ Ruibo Liu, Ouxiang Zhou
3031- [ 📕slides] ( ./slides/260415IndexCache.pdf )
32+ - [ 📺 video] ( https://www.bilibili.com/video/BV1R6ZFBMEms/ )
33+
3134<br ><br >
3235### April 21
3336#### Topic I
3437- 💡 [ arXiv] [ Token Sparse Attention: Efficient Long-Context Inference with Interleaved Token Selection] ( https://arxiv.org/abs/2602.03216 )
3538- 🙎♂️ Chengjie Tang, Shen Fu
36- - 📕 ...
39+ - [ 📕slides] ( ./slides/260421-TokenSparseAttention.pdf )
40+ <br ><br >
41+ ### April 28
42+ #### Topic I
43+ - 💡 [ arXiv] [ PROBE: Co-Balancing Computation and Communication in MoE Inference via Real-Time Predictive Prefetching] ( https://arxiv.org/abs/2602.00509 )
44+ - 🙎♂️ Qinghe Wang
45+ - [ 📕slides] ( ./slides/260428-PROBE.pdf )
46+ #### Topic II
47+ - 💡 [ arXiv] [ FluxMoE: Decoupling Expert Residency for High-Performance MoE Serving] ( https://arxiv.org/abs/2604.02715 )
48+ - 🙎♂️ Long Zhao
49+ - [ 📕slides] ( ./slides/260428-FluxMoE.pdf )
3750<br ><br >
You can’t perform that action at this time.
0 commit comments