Commit e3fb72a
Aegis-AI
Fix GPU Hang: flush Metal graph with eval() after MTPTokenIterator verification pass
On hybrid SSM/attention models (Qwen35), the recurrent GatedDeltaNet layers
accumulate un-evaluated MLX graph nodes across each speculateRound(). Without
an explicit eval() after callMTP(), the Metal command buffer grows across
multiple speculation rounds until it triggers the GPU Watchdog (kIOGPUCommandBufferCallbackErrorHang).
Adding eval(mtpResult) immediately after the verification forward pass flushes
the accumulated graph, preventing the Metal timeout.1 parent b42d9a0 commit e3fb72a
1 file changed
Lines changed: 7 additions & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1227 | 1227 | | |
1228 | 1228 | | |
1229 | 1229 | | |
1230 | | - | |
| 1230 | + | |
| 1231 | + | |
| 1232 | + | |
| 1233 | + | |
| 1234 | + | |
| 1235 | + | |
| 1236 | + | |
1231 | 1237 | | |
1232 | 1238 | | |
1233 | 1239 | | |
| |||
0 commit comments