Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[fa3] Upgrade fa3 on B200 #171

Closed
wants to merge 2 commits into from
Closed

[fa3] Upgrade fa3 on B200 #171

wants to merge 2 commits into from

Conversation

xuzhao9
Copy link
Contributor

@xuzhao9 xuzhao9 commented Mar 4, 2025

To run flash_attention on B200, we need to upgrade the submodule.

flash_attn_v3 does not support B200, but flash_attn_v2 support is done on B200.

Test plan:

python run.py --op flash_attention --metrics tflops --only flash_v2 --batch 8 --n-heads 16 --d-head 128 

  (Batch, Heads, SeqLen, Dhead)    flash_v2-tflops
-------------------------------  -----------------
              (8, 16, 128, 128)            60.5676
              (8, 16, 256, 128)           147.492
              (8, 16, 512, 128)           245.259
             (8, 16, 1024, 128)           315.111
             (8, 16, 2048, 128)           352.104
             (8, 16, 4096, 128)           353.912
             (8, 16, 8192, 128)           361.591
            (8, 16, 16384, 128)           330.587
                        average           270.828

@xuzhao9 xuzhao9 temporarily deployed to docker-s3-upload March 4, 2025 18:10 — with GitHub Actions Inactive
@xuzhao9 xuzhao9 temporarily deployed to docker-s3-upload March 4, 2025 18:10 — with GitHub Actions Inactive
@xuzhao9 xuzhao9 temporarily deployed to docker-s3-upload March 4, 2025 18:10 — with GitHub Actions Inactive
@xuzhao9 xuzhao9 requested a review from adamomainz March 7, 2025 19:40
@facebook-github-bot
Copy link
Contributor

@xuzhao9 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

@facebook-github-bot
Copy link
Contributor

@xuzhao9 merged this pull request in 79c5302.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants