Commit 20b3e76
Revert "fix(jit): GEMM kernels produce NaN under concurrency — missing GDC flags cause PDL synchronization barriers to compile as no-ops" (flashinfer-ai#2737)
Proposing to revert flashinfer-ai#2716 in order to unblock
0.6.6 release
flashinfer-ai#2716 seems to have
broken AOT packages
https://github.com/flashinfer-ai/flashinfer/actions/runs/22870567870/job/66353637447?pr=2730
<!-- This is an auto-generated comment: release notes by coderabbit.ai
-->
## Summary by CodeRabbit
* **Bug Fixes**
* Removed legacy GPU compilation flags related to GDC enablement for
certain GPU tiers during JIT GEMM generation, reducing extra compile
flags and build noise; GDC-related flags for the latest GPU tier remain
enabled where still applicable.
<!-- end of auto-generated comment: release notes by coderabbit.ai -->
---------
Co-authored-by: yzh119 <zihaoy@nvidia.com>
Signed-off-by: Amey Naik <212485788+ameynaik-hub@users.noreply.github.com>1 parent 3d07465 commit 20b3e76
1 file changed
Lines changed: 1 addition & 15 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
91 | 91 | | |
92 | 92 | | |
93 | 93 | | |
94 | | - | |
95 | | - | |
96 | 94 | | |
97 | 95 | | |
98 | 96 | | |
| |||
160 | 158 | | |
161 | 159 | | |
162 | 160 | | |
163 | | - | |
164 | | - | |
165 | 161 | | |
166 | 162 | | |
167 | 163 | | |
| |||
210 | 206 | | |
211 | 207 | | |
212 | 208 | | |
213 | | - | |
214 | | - | |
215 | 209 | | |
216 | 210 | | |
217 | 211 | | |
| |||
262 | 256 | | |
263 | 257 | | |
264 | 258 | | |
265 | | - | |
266 | | - | |
267 | 259 | | |
268 | 260 | | |
269 | 261 | | |
| |||
357 | 349 | | |
358 | 350 | | |
359 | 351 | | |
360 | | - | |
361 | | - | |
362 | 352 | | |
363 | 353 | | |
364 | 354 | | |
| |||
526 | 516 | | |
527 | 517 | | |
528 | 518 | | |
529 | | - | |
530 | | - | |
531 | | - | |
532 | | - | |
533 | | - | |
| 519 | + | |
534 | 520 | | |
535 | 521 | | |
536 | 522 | | |
| |||
0 commit comments