New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Integrate ragged paged attn to pytorch/xla #8692

Merged

vanbasten23 merged 8 commits into pytorch:master from bythew3i:integrate-ragged-paged-attn

Feb 12, 2025

+433 −3

Contributor

bythew3i commented Feb 9, 2025 •

edited

Loading

Note: the jax checkify in ragged paged attention kernel will insert several scalar refs to both inputs (end of prefetch) and outputs (begining of the original output). This affects how we re-construct xla.tpu_custom_call in with payload as we need take those scalars into account.

Tested:

python test/test_pallas.py -v -k PallasTest.test_ragged_paged_attention_wrapper

bythew3i added 6 commits

February 8, 2025 10:28


          Integrate ragged paged attn to pytorch/xla

19bd1d7


          Integrate ragged paged attn to pytorch/xla

4c75ff0


          Merge branch 'integrate-ragged-paged-attn' of https://github.com/byth…

367f6bf

…ew3i/xla into integrate-ragged-paged-attn


          Fix unintended change

2bc8cf7


          Fix unintended changes 2

d903b91


          Remove temp file

f3762e1

vanbasten23 reviewed

View reviewed changes

test/test_pallas.py Outdated Show resolved Hide resolved

vanbasten23 reviewed

View reviewed changes

torch_xla/experimental/custom_kernel.py Show resolved Hide resolved

vanbasten23 reviewed

View reviewed changes

torch_xla/experimental/custom_kernel.py Show resolved Hide resolved

vanbasten23 reviewed

View reviewed changes

torch_xla/experimental/custom_kernel.py Outdated Show resolved Hide resolved

vanbasten23 reviewed

View reviewed changes

torch_xla/experimental/custom_kernel.py Show resolved Hide resolved

vanbasten23 reviewed

View reviewed changes

test/test_pallas.py Outdated Show resolved Hide resolved

miladm assigned bythew3i

miladm added the pallas label

bythew3i added 2 commits

February 12, 2025 00:47


          Resolve the comments.

789be3f


          Merge branch 'master' into integrate-ragged-paged-attn

Collaborator

vanbasten23 commented Feb 12, 2025

LGTM. This is great. Thanks @bythew3i

vanbasten23 approved these changes

View reviewed changes

vanbasten23 merged commit 06e1b59 into pytorch:master

12 checks passed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels