Commit 8b4f25c
committed
Validate incompatible cache_modifier/eviction_policy combinations in NVIDIA backend
When tl.load/tl.store is called with a PTX-illegal combination of
cache_modifier and eviction_policy, Triton previously emitted PTX
containing both modifiers and let ptxas fail with an opaque assembler
error:
ptxas error: Modifier '.evict_first' cannot be combined with modifier '.cs'
Users saw a low-level message with no indication of which Python
arguments caused it.
Add validation in LoadStoreOpToLLVM.cpp (NVIDIA-specific PTX lowering)
that emits a clear compilation error before any PTX is generated.
Placing the check in the NVIDIA backend, not in backend-agnostic
semantic.py, keeps the frontend neutral to PTX ISA constraints.
PTX-illegal combinations covered:
| op | cache_modifier | eviction_policy |
|-------|----------------|------------------------------|
| store | .cs | evict_first, evict_last |
| store | .cg | evict_first |
| load | .ca | evict_first, evict_last |
| load | .cg | evict_first |1 parent f7c1d69 commit 8b4f25c
File tree
2 files changed
+109
-0
lines changed- test/Conversion
- third_party/nvidia/lib/TritonNVIDIAGPUToLLVM
2 files changed
+109
-0
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | 1 | | |
| 2 | + | |
2 | 3 | | |
3 | 4 | | |
4 | 5 | | |
| |||
127 | 128 | | |
128 | 129 | | |
129 | 130 | | |
| 131 | + | |
| 132 | + | |
| 133 | + | |
| 134 | + | |
| 135 | + | |
| 136 | + | |
| 137 | + | |
| 138 | + | |
| 139 | + | |
| 140 | + | |
| 141 | + | |
| 142 | + | |
| 143 | + | |
| 144 | + | |
| 145 | + | |
| 146 | + | |
| 147 | + | |
| 148 | + | |
| 149 | + | |
| 150 | + | |
| 151 | + | |
| 152 | + | |
| 153 | + | |
| 154 | + | |
| 155 | + | |
| 156 | + | |
| 157 | + | |
| 158 | + | |
| 159 | + | |
| 160 | + | |
| 161 | + | |
| 162 | + | |
| 163 | + | |
| 164 | + | |
| 165 | + | |
| 166 | + | |
| 167 | + | |
| 168 | + | |
| 169 | + | |
| 170 | + | |
| 171 | + | |
| 172 | + | |
| 173 | + | |
| 174 | + | |
| 175 | + | |
| 176 | + | |
| 177 | + | |
| 178 | + | |
| 179 | + | |
| 180 | + | |
| 181 | + | |
| 182 | + | |
| 183 | + | |
| 184 | + | |
| 185 | + | |
| 186 | + | |
| 187 | + | |
| 188 | + | |
| 189 | + | |
| 190 | + | |
| 191 | + | |
| 192 | + | |
| 193 | + | |
| 194 | + | |
| 195 | + | |
| 196 | + | |
| 197 | + | |
| 198 | + | |
| 199 | + | |
| 200 | + | |
| 201 | + | |
| 202 | + | |
| 203 | + | |
| 204 | + | |
130 | 205 | | |
131 | 206 | | |
132 | 207 | | |
| |||
Lines changed: 34 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
188 | 188 | | |
189 | 189 | | |
190 | 190 | | |
| 191 | + | |
| 192 | + | |
| 193 | + | |
| 194 | + | |
| 195 | + | |
| 196 | + | |
| 197 | + | |
| 198 | + | |
| 199 | + | |
| 200 | + | |
| 201 | + | |
| 202 | + | |
| 203 | + | |
| 204 | + | |
| 205 | + | |
| 206 | + | |
| 207 | + | |
191 | 208 | | |
192 | 209 | | |
193 | 210 | | |
| |||
399 | 416 | | |
400 | 417 | | |
401 | 418 | | |
| 419 | + | |
| 420 | + | |
| 421 | + | |
| 422 | + | |
| 423 | + | |
| 424 | + | |
| 425 | + | |
| 426 | + | |
| 427 | + | |
| 428 | + | |
| 429 | + | |
| 430 | + | |
| 431 | + | |
| 432 | + | |
| 433 | + | |
| 434 | + | |
| 435 | + | |
402 | 436 | | |
403 | 437 | | |
404 | 438 | | |
| |||
0 commit comments