Explicitly zero grad in flash_attention Triton Bench #683

sryap · 2025-12-03T23:06:26Z

Summary:
Explicitly zero grad because simply setting grad to None does not
eliminate the grad accumulation step

Reviewed By: y-sq

Differential Revision: D85325066

meta-codesync · 2025-12-03T23:06:33Z

@sryap has exported this pull request. If you are a Meta employee, you can view the originating Diff in D85325066.

Summary: Add an option to generate inputs as large as the GPU L2 cache size to avoid an explicit cache clearing in every iteration Reviewed By: henrylhtsang Differential Revision: D85318459

Summary: Explicitly zero grad because simply setting grad to None does not eliminate the grad accumulation step Reviewed By: y-sq Differential Revision: D85325066

sryap temporarily deployed to docker-s3-upload December 3, 2025 23:06 — with GitHub Actions Inactive

sryap had a problem deploying to docker-s3-upload December 3, 2025 23:06 — with GitHub Actions Failure

sryap temporarily deployed to docker-s3-upload December 3, 2025 23:06 — with GitHub Actions Inactive

meta-cla bot added the cla signed label Dec 3, 2025

meta-codesync bot added fb-exported meta-exported labels Dec 3, 2025

sryap added 2 commits December 3, 2025 16:34

Generate GPU L2 size inputs in flash_attention Triton Bench (#681)

ff17272

Summary: Add an option to generate inputs as large as the GPU L2 cache size to avoid an explicit cache clearing in every iteration Reviewed By: henrylhtsang Differential Revision: D85318459

Explicitly zero grad in flash_attention Triton Bench (#683)

2ffe770

Summary: Explicitly zero grad because simply setting grad to None does not eliminate the grad accumulation step Reviewed By: y-sq Differential Revision: D85325066

facebook-github-bot force-pushed the export-D85325066 branch from 08d6fea to 2ffe770 Compare December 4, 2025 00:34

facebook-github-bot temporarily deployed to docker-s3-upload December 4, 2025 00:35 — with GitHub Actions Inactive

facebook-github-bot had a problem deploying to docker-s3-upload December 4, 2025 00:35 — with GitHub Actions Failure

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Explicitly zero grad in flash_attention Triton Bench #683

Explicitly zero grad in flash_attention Triton Bench #683

Uh oh!

sryap commented Dec 3, 2025

Uh oh!

meta-codesync bot commented Dec 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Explicitly zero grad in flash_attention Triton Bench #683

Are you sure you want to change the base?

Explicitly zero grad in flash_attention Triton Bench #683

Uh oh!

Conversation

sryap commented Dec 3, 2025

Uh oh!

meta-codesync bot commented Dec 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants