Skip to content

A balanced traffic pattern for AG minimal.#37878

Draft
llongTT wants to merge 24 commits intomainfrom
llong/dit_ag_min
Draft

A balanced traffic pattern for AG minimal.#37878
llongTT wants to merge 24 commits intomainfrom
llong/dit_ag_min

Conversation

@llongTT
Copy link
Contributor

@llongTT llongTT commented Feb 13, 2026

Ticket

Link to Github Issue

Problem description

AG minimal has low fabric utilization, especially for device ring = 4 cases.
WAN 2.2 model perf requires AG to reach 92% fabric utilization to achieve the target.

What's changed

  1. Balance the packet traffic between forward worker and backward worker such that they split the last slice. It saves 25% of fabric latency by going from "2 slice forward, 1 slice backward" to "1 and half slice both forward and backward" in device ring = 4.
  2. If balanced traffic feature enabled, also split the local writes half/half, to reduce the noc traffic on backward worker.
  3. Extend the feature to all device rings >2 and ring size even cases.

Checklist

  • All post-commit tests
  • Blackhole Post commit
  • cpp-unit-tests
  • New/Existing tests provide coverage for changes

Model tests

If your changes cover model-related code, you should run tests corresponding to affected models and platforms (Single card, T3K, Galaxy). "Choose your pipeline" workflows facilitate running multiple kinds of tests in a single run. Each offers models-mandatory and models-extended presets.
The former includes a minimal set of tests, to be run always. The latter extends that with additional ones - use your best judgement in deciding which is the most appropriate for your PR.

@llongTT llongTT changed the title Llong/dit ag min A balanced traffic pattern for AG minimal. Feb 13, 2026
@llongTT llongTT self-assigned this Feb 13, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant