Commit 86b63d7
authored
Reduce DeepSeek long-seq decoder override to 12288 (#37834)
## Background
The Galaxy long-seq decoder-block job in [run
#21973954244/job/63481804018](https://github.com/tenstorrent/tt-metal/actions/runs/21973954244/job/63481804018)
was OOM-killed on host at `DEEPSEEK_MAX_SEQ_LEN_OVERRIDE=16384`.
## Change
This PR reduces the decoder-block override only:
- `DEEPSEEK_MAX_SEQ_LEN_OVERRIDE=12288` for
`models/demos/deepseek_v3/tests/test_decoder_block.py`
The `test_mla.py` invocation remains at `16384`, and `test_model.py`
stays at `1024`.
## Why
`12288` lowers peak memory for the long sequence decoder-block path
while preserving the extended long-seq coverage in CI. It targets the
specific failure mode seen in the run above without weakening other test
matrix coverage.
## Verification
A rerun was kicked off on this branch:
- [](https://github.com/tenstorrent/tt-metal/actions/runs/21983835196)
- Run URL:
https://github.com/tenstorrent/tt-metal/actions/runs/219838351961 parent c987548 commit 86b63d7
1 file changed
+1
-1
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
122 | 122 | | |
123 | 123 | | |
124 | 124 | | |
125 | | - | |
| 125 | + | |
126 | 126 | | |
127 | 127 | | |
128 | 128 | | |
| |||
0 commit comments