Adding flex_attention benchmark for eager and compile mode #174

mandroid6 · 2025-03-13T19:09:53Z

Summary:
Since we are actively adding TritonKernel optimizations to flex_attention through inductor, its useful to track the perf improvements through tritonbench.

NOTE: Intial version uses fixed sizes for batch_size, seq_len, head_dim and num_heads with followups to make them configurable

Differential Revision: D71137239

facebook-github-bot · 2025-03-13T19:10:00Z

This pull request was exported from Phabricator. Differential Revision: D71137239

Summary: Since we are actively adding `TritonKernel` optimizations to flex_attention through inductor, its useful to track the perf improvements through `tritonbench`. NOTE: Intial version uses fixed sizes for `batch_size, seq_len, head_dim and num_heads` with followups to make them configurable Differential Revision: D71137239

facebook-github-bot · 2025-03-13T19:12:41Z

This pull request was exported from Phabricator. Differential Revision: D71137239

facebook-github-bot · 2025-03-13T21:09:39Z

This pull request has been merged in 73c9b75.

facebook-github-bot added the cla signed label Mar 13, 2025

mandroid6 had a problem deploying to docker-s3-upload March 13, 2025 19:09 — with GitHub Actions Error

mandroid6 had a problem deploying to docker-s3-upload March 13, 2025 19:10 — with GitHub Actions Error

facebook-github-bot added the fb-exported label Mar 13, 2025

facebook-github-bot force-pushed the export-D71137239 branch from 6d54a40 to 4592bea Compare March 13, 2025 19:12

facebook-github-bot temporarily deployed to docker-s3-upload March 13, 2025 19:12 — with GitHub Actions Inactive

facebook-github-bot closed this in 73c9b75 Mar 13, 2025

facebook-github-bot added the Merged label Mar 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding flex_attention benchmark for eager and compile mode #174

Adding flex_attention benchmark for eager and compile mode #174

mandroid6 commented Mar 13, 2025

facebook-github-bot commented Mar 13, 2025

facebook-github-bot commented Mar 13, 2025

facebook-github-bot commented Mar 13, 2025

Adding flex_attention benchmark for eager and compile mode #174

Adding flex_attention benchmark for eager and compile mode #174

Conversation

mandroid6 commented Mar 13, 2025

facebook-github-bot commented Mar 13, 2025

facebook-github-bot commented Mar 13, 2025

facebook-github-bot commented Mar 13, 2025