Commit b809821
[Fmha] update trtllm-gen FMHA cubins and sync headers for context SWA fix (#3089)
<!-- .github/pull_request_template.md -->
## 📌 Description
The branch has 2 commits:
1. Update trtllm-gen FMHA cubins to fix context SWA page-skip — updates
artifacts.py path + checksum
2. Sync trtllm FMHA headers with latest trtllm-gen (from PR #2711) —
cherry-picks header changes to match the new cubin MetaInfo struct
## 🔍 Related Issues
<!-- Link any related issues here -->
## 🚀 Pull Request Checklist
Thank you for contributing to FlashInfer! Before we review your pull
request, please make sure the following items are complete.
### ✅ Pre-commit Checks
- [x] I have installed `pre-commit` by running `pip install pre-commit`
(or used your preferred method).
- [x] I have installed the hooks with `pre-commit install`.
- [x] I have run the hooks manually with `pre-commit run --all-files`
and fixed any reported issues.
> If you are unsure about how to set up `pre-commit`, see [the
pre-commit documentation](https://pre-commit.com/).
## 🧪 Tests
- [x] Tests have been added or updated as needed.
- [x] All tests are passing (`unittest`, etc.).
## Reviewer Notes
<!-- Optional: anything you'd like reviewers to focus on, concerns, etc.
-->
<!-- This is an auto-generated comment: release notes by coderabbit.ai
-->
## Summary by CodeRabbit
* **Bug Fixes**
* Fixed sparse-attention truncation so sequence-length top-K is applied
correctly when sparse-attention is enabled.
* **Improvements**
* Standardized sparse-attention parameter naming and selection logic to
make behavior more consistent across launches and kernel choices.
* Skip incompatible kernel variants during runtime kernel loading to
avoid incorrect selections.
* **Chores**
* Updated FMHA runtime artifact paths and their checksums for validation
and downloads.
<!-- end of auto-generated comment: release notes by coderabbit.ai -->
---------
Co-authored-by: Claude Opus 4.7 <noreply@anthropic.com>1 parent c9eb3cd commit b809821
2 files changed
Lines changed: 19 additions & 8 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
135 | 135 | | |
136 | 136 | | |
137 | 137 | | |
138 | | - | |
| 138 | + | |
139 | 139 | | |
140 | 140 | | |
141 | 141 | | |
| |||
155 | 155 | | |
156 | 156 | | |
157 | 157 | | |
158 | | - | |
| 158 | + | |
159 | 159 | | |
160 | 160 | | |
161 | 161 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
20 | 20 | | |
21 | 21 | | |
22 | 22 | | |
| 23 | + | |
23 | 24 | | |
24 | 25 | | |
25 | 26 | | |
| |||
112 | 113 | | |
113 | 114 | | |
114 | 115 | | |
| 116 | + | |
| 117 | + | |
| 118 | + | |
| 119 | + | |
| 120 | + | |
| 121 | + | |
| 122 | + | |
115 | 123 | | |
116 | 124 | | |
117 | 125 | | |
| |||
443 | 451 | | |
444 | 452 | | |
445 | 453 | | |
446 | | - | |
447 | | - | |
| 454 | + | |
| 455 | + | |
| 456 | + | |
| 457 | + | |
| 458 | + | |
448 | 459 | | |
449 | 460 | | |
450 | 461 | | |
451 | | - | |
452 | | - | |
| 462 | + | |
453 | 463 | | |
454 | 464 | | |
455 | 465 | | |
| |||
864 | 874 | | |
865 | 875 | | |
866 | 876 | | |
867 | | - | |
868 | | - | |
869 | 877 | | |
870 | 878 | | |
871 | 879 | | |
872 | 880 | | |
| 881 | + | |
| 882 | + | |
| 883 | + | |
873 | 884 | | |
874 | 885 | | |
875 | 886 | | |
| |||
0 commit comments