Increased core count for paged SDPA for Qwen by atupe-tt · Pull Request #37872 · tenstorrent/tt-metal

atupe-tt · 2026-02-13T18:20:31Z

Problem description

Improve the decode perf for Qwen on TG

What's changed

Increased the core count for paged SDPA (decode)

Checklist

New/Existing tests provide coverage for changes

Model tests

If your changes cover model-related code, you should run tests corresponding to affected models and platforms (Single card, T3K, Galaxy). "Choose your pipeline" workflows facilitate running multiple kinds of tests in a single run. Each offers models-mandatory and models-extended presets.
The former includes a minimal set of tests, to be run always. The latter extends that with additional ones - use your best judgement in deciding which is the most appropriate for your PR.

Copilot

Pull request overview

This PR improves decode performance for Qwen on TG by increasing the core count for paged SDPA (Scaled Dot-Product Attention) decode operations from 32 to 48 cores. The change aligns the Qwen-specific model configuration with the base Llama model configuration in model_config.py, which already uses these optimized settings.

Changes:

Increased compute grid size from (8, 4) to (8, 6) for PAGED_SDPA_DECODE_PROGCFG
Updated core count from 32 to 48 to match the new grid size (8 × 6 = 48)

yalrawwashTT

Approving to unblock, pending CI run of galaxy demo

increased core count for paged SDPA for Qwen

16befc1

atupe-tt requested a review from mbahnasTT February 13, 2026 18:20

atupe-tt requested review from djordje-tt, johanna-rock-tt, kpaigwar, mtairum and sraizada-tt as code owners February 13, 2026 18:20

Copilot AI review requested due to automatic review settings February 13, 2026 18:20

atupe-tt requested review from a team, alingTT and yalrawwashTT as code owners February 13, 2026 18:20

Copilot started reviewing on behalf of atupe-tt February 13, 2026 18:21 View session

Copilot AI reviewed Feb 13, 2026

View reviewed changes

mbahnasTT approved these changes Feb 13, 2026

View reviewed changes

kpaigwar approved these changes Feb 13, 2026

View reviewed changes

yalrawwashTT approved these changes Feb 13, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Increased core count for paged SDPA for Qwen#37872

Increased core count for paged SDPA for Qwen#37872
atupe-tt wants to merge 1 commit intomainfrom
atupe/qwen-tg-core-optimization

atupe-tt commented Feb 13, 2026 •

edited by github-actions bot

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

yalrawwashTT left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

atupe-tt commented Feb 13, 2026 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem description

What's changed

Checklist

Model tests

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

yalrawwashTT left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

atupe-tt commented Feb 13, 2026 •

edited by github-actions bot

Loading