[tritonbench][AutoWS] Enable AutoWS matmul kernels on Hopper by njriasan · Pull Request #1025 · meta-pytorch/tritonbench

njriasan · 2026-04-21T19:41:57Z

Summary:
Extends the Blackwell-only TMA / persistent / descriptor matmul benchmarks
in tritonbench to also run on Hopper.

Changes:

Widen enabled=IS_BLACKWELL to enabled=IS_BLACKWELL or IS_HOPPER for
the 6 triton_blackwell_* benchmarks in operator.py. The underlying
kernels (matmul_kernel_tma, matmul_kernel_tma_persistent, and
matmul_kernel_descriptor_persistent) are arch-agnostic, so they run on
Hopper once the gates allow it.
Update _prune_tma_persistent_configs in warp_spec_persistent_matmul.py
so the FLATTEN requirement is computed correctly across {Hopper,
Blackwell} x {meta WS, OAI Triton WS, no WS}. tl.range(flatten=True) is
only supported on Blackwell with the OAI Triton WS path; on Hopper we now
require FLATTEN=False for both AutoWS (meta WS) and OAI Triton paths.

Differential Revision: D101840063

Summary: Extends the Blackwell-only TMA / persistent / descriptor matmul benchmarks in `tritonbench` to also run on Hopper. Changes: 1. Widen `enabled=IS_BLACKWELL` to `enabled=IS_BLACKWELL or IS_HOPPER` for the 6 `triton_blackwell_*` benchmarks in `operator.py`. The underlying kernels (`matmul_kernel_tma`, `matmul_kernel_tma_persistent`, and `matmul_kernel_descriptor_persistent`) are arch-agnostic, so they run on Hopper once the gates allow it. 2. Update `_prune_tma_persistent_configs` in `warp_spec_persistent_matmul.py` so the `FLATTEN` requirement is computed correctly across {Hopper, Blackwell} x {meta WS, OAI Triton WS, no WS}. `tl.range(flatten=True)` is only supported on Blackwell with the OAI Triton WS path; on Hopper we now require `FLATTEN=False` for both AutoWS (meta WS) and OAI Triton paths. Differential Revision: D101840063

meta-codesync · 2026-04-21T19:42:05Z

@njriasan has exported this pull request. If you are a Meta employee, you can view the originating Diff in D101840063.

njriasan had a problem deploying to docker-s3-upload April 21, 2026 19:42 — with GitHub Actions Failure

meta-cla Bot added the cla signed label Apr 21, 2026

meta-codesync Bot added fb-exported meta-exported labels Apr 21, 2026

njriasan changed the title ~~Enable Blackwell matmul tests on Hopper~~ [tritonbench][AutoWS] Enable AutoWS matmul kernels on Hopper Apr 21, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[tritonbench][AutoWS] Enable AutoWS matmul kernels on Hopper#1025

[tritonbench][AutoWS] Enable AutoWS matmul kernels on Hopper#1025
njriasan wants to merge 1 commit intometa-pytorch:mainfrom
njriasan:export-D101840063

njriasan commented Apr 21, 2026

Uh oh!

meta-codesync Bot commented Apr 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

njriasan commented Apr 21, 2026

Uh oh!

meta-codesync Bot commented Apr 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant