Commit ffa04bb
Update trtllm-gen FMHA cubins to fix context SWA page-skip
Update TRTLLM_GEN_FMHA artifact path and checksum to pick up cubins
built from trtllm-gen with the context SWA page-skip fix. The new cubins
skip loading out-of-window KV pages in context (prefill) kernels,
preventing NaN corruption from null blocks in the KV cache.
Fixes: https://nvbugspro.nvidia.com/bug/5922676
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>1 parent a99ee72 commit ffa04bb
1 file changed
Lines changed: 2 additions & 2 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
135 | 135 | | |
136 | 136 | | |
137 | 137 | | |
138 | | - | |
| 138 | + | |
139 | 139 | | |
140 | 140 | | |
141 | 141 | | |
| |||
155 | 155 | | |
156 | 156 | | |
157 | 157 | | |
158 | | - | |
| 158 | + | |
159 | 159 | | |
160 | 160 | | |
161 | 161 | | |
| |||
0 commit comments