Skip to content

Commit fd50480

Browse files
authored
Update 2026_spring.md
1 parent 20cb01d commit fd50480

1 file changed

Lines changed: 14 additions & 1 deletion

File tree

src/2026_spring.md

Lines changed: 14 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -24,14 +24,27 @@ The playback video and text summary will be uploaded to <a href="https://space.b
2424
- 💡 [CVPR'26] [AdaCluster: Adaptive Query-Key Clustering for Sparse Attention in Video Generation](./slides/260413-Adacluster.pdf)
2525
- 🙎‍♂️ Haoyue Tan
2626
- [📕slides](./slides/260414adahunyuan.pdf)
27+
- [📃 Q&A summary](https://zhuanlan.zhihu.com/p/2031793965167547374), [📺 video](https://www.bilibili.com/video/BV1R6ZFBMEms/)
2728
#### Topic II
2829
- 💡 [arXiv] [IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse](https://arxiv.org/abs/2603.12201)
2930
- 🙎‍♂️ Ruibo Liu, Ouxiang Zhou
3031
- [📕slides](./slides/260415IndexCache.pdf)
32+
- [📺 video](https://www.bilibili.com/video/BV1R6ZFBMEms/)
33+
3134
<br><br>
3235
### April 21
3336
#### Topic I
3437
- 💡 [arXiv] [Token Sparse Attention: Efficient Long-Context Inference with Interleaved Token Selection](https://arxiv.org/abs/2602.03216)
3538
- 🙎‍♂️ Chengjie Tang, Shen Fu
36-
- 📕 ...
39+
- [📕slides](./slides/260421-TokenSparseAttention.pdf)
40+
<br><br>
41+
### April 28
42+
#### Topic I
43+
- 💡 [arXiv] [PROBE: Co-Balancing Computation and Communication in MoE Inference via Real-Time Predictive Prefetching](https://arxiv.org/abs/2602.00509)
44+
- 🙎‍♂️ Qinghe Wang
45+
- [📕slides](./slides/260428-PROBE.pdf)
46+
#### Topic II
47+
- 💡 [arXiv] [FluxMoE: Decoupling Expert Residency for High-Performance MoE Serving](https://arxiv.org/abs/2604.02715)
48+
- 🙎‍♂️ Long Zhao
49+
- [📕slides](./slides/260428-FluxMoE.pdf)
3750
<br><br>

0 commit comments

Comments
 (0)