Commit ff8edef
Fix SDPA decode watcher errors with half-tile (16x32) CBs
`generate_reduce_scaler` hardcoded 2048 bytes and 4 faces, assuming
full 32x32 bf16 tiles. When circular buffers use half tiles (1024B,
2 faces), this overwrites adjacent L1 memory causing watcher-detected
corruption.
Restore the `half_tile` template parameter (previously removed in
cleanup) so the zero-fill size and face iteration adapt to the actual
tile dimensions. Also fix idle core runtime args count mismatch in
sdpa_decode_program_factory.
Fixes: #37631
Fixes: #29225
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>1 parent 89175b6 commit ff8edef
File tree
4 files changed
+8
-9
lines changed- models/demos/deepseek_v3_b1/tests/unit_tests
- ttnn/cpp/ttnn
- kernel/dataflow
- operations/transformer/sdpa_decode/device
- kernels/dataflow
4 files changed
+8
-9
lines changedLines changed: 1 addition & 3 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
12 | 12 | | |
13 | 13 | | |
14 | 14 | | |
15 | | - | |
| 15 | + | |
16 | 16 | | |
17 | 17 | | |
18 | 18 | | |
| |||
22 | 22 | | |
23 | 23 | | |
24 | 24 | | |
25 | | - | |
26 | | - | |
27 | 25 | | |
28 | 26 | | |
29 | 27 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
8 | 8 | | |
9 | 9 | | |
10 | 10 | | |
| 11 | + | |
11 | 12 | | |
12 | 13 | | |
13 | 14 | | |
14 | | - | |
| 15 | + | |
15 | 16 | | |
16 | 17 | | |
17 | 18 | | |
| |||
27 | 28 | | |
28 | 29 | | |
29 | 30 | | |
30 | | - | |
| 31 | + | |
31 | 32 | | |
32 | 33 | | |
33 | 34 | | |
| |||
Lines changed: 3 additions & 3 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
134 | 134 | | |
135 | 135 | | |
136 | 136 | | |
137 | | - | |
138 | | - | |
139 | | - | |
| 137 | + | |
| 138 | + | |
| 139 | + | |
140 | 140 | | |
141 | 141 | | |
142 | 142 | | |
| |||
Lines changed: 1 addition & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1011 | 1011 | | |
1012 | 1012 | | |
1013 | 1013 | | |
1014 | | - | |
| 1014 | + | |
1015 | 1015 | | |
1016 | 1016 | | |
1017 | 1017 | | |
| |||
0 commit comments