Skip to content

Pass the updated embeddings to EmbeddingKVDB #4210

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

chouxi
Copy link
Contributor

@chouxi chouxi commented May 29, 2025

Summary:
X-link: https://github.com/facebookresearch/FBGEMM/pull/1285

Added the functions stream_cuda/stream.

Added profile to the callback functions.

Added the function stream_sync_cuda

  • To explicitly join the async copy thread to make sure the copy is happened before the buffer got overwritten again.

Reviewed By: q10

Differential Revision: D73819097

Copy link

netlify bot commented May 29, 2025

Deploy Preview for pytorch-fbgemm-docs ready!

Name Link
🔨 Latest commit 7f1d0ca
🔍 Latest deploy log https://app.netlify.com/projects/pytorch-fbgemm-docs/deploys/68408e8969ca260008dea23d
😎 Deploy Preview https://deploy-preview-4210--pytorch-fbgemm-docs.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D73819097

chouxi added a commit to chouxi/FBGEMM that referenced this pull request May 29, 2025
Summary:

X-link: facebookresearch/FBGEMM#1285

Added the functions `stream_cuda/stream`.
- When `blocking_tensor_copy=false` Create a new thread to execute the callback registered via the `stream_cuda` the reason as in comment. this is based on [the findings that callbacks are executed in serialized manner]( https://fb.workplace.com/permalink.php?story_fbid=pfbid02s7RWvRZ4g2nS5i42kkyApvLsCbiRpGrdAEPs2p5qr2MDnq5YbgfThQ6PXSB6y13Al&id=100026528794331)
- When `blocking_tensor_copy=true` just copy and enqueue in the callback thread.

Added profile to the callback functions.

Added the function  `stream_sync_cuda`
- To explicitly join the async copy thread to make sure the copy is happened before the buffer got overwritten again.

Reviewed By: q10

Differential Revision: D73819097
@chouxi chouxi force-pushed the export-D73819097 branch from a422630 to 95d3b88 Compare June 4, 2025 18:09
chouxi added a commit to chouxi/FBGEMM that referenced this pull request Jun 4, 2025
Summary:

X-link: facebookresearch/FBGEMM#1285

Added the functions `stream_cuda/stream`.
- When `blocking_tensor_copy=false` Create a new thread to execute the callback registered via the `stream_cuda` the reason as in comment. this is based on [the findings that callbacks are executed in serialized manner]( https://fb.workplace.com/permalink.php?story_fbid=pfbid02s7RWvRZ4g2nS5i42kkyApvLsCbiRpGrdAEPs2p5qr2MDnq5YbgfThQ6PXSB6y13Al&id=100026528794331)
- When `blocking_tensor_copy=true` just copy and enqueue in the callback thread.

Added profile to the callback functions.

Added the function  `stream_sync_cuda`
- To explicitly join the async copy thread to make sure the copy is happened before the buffer got overwritten again.

Reviewed By: q10

Differential Revision: D73819097
chouxi added a commit to chouxi/FBGEMM that referenced this pull request Jun 4, 2025
Summary:

X-link: facebookresearch/FBGEMM#1285

Added the functions `stream_cuda/stream`.
- When `blocking_tensor_copy=false` Create a new thread to execute the callback registered via the `stream_cuda` the reason as in comment. this is based on [the findings that callbacks are executed in serialized manner]( https://fb.workplace.com/permalink.php?story_fbid=pfbid02s7RWvRZ4g2nS5i42kkyApvLsCbiRpGrdAEPs2p5qr2MDnq5YbgfThQ6PXSB6y13Al&id=100026528794331)
- When `blocking_tensor_copy=true` just copy and enqueue in the callback thread.

Added profile to the callback functions.

Added the function  `stream_sync_cuda`
- To explicitly join the async copy thread to make sure the copy is happened before the buffer got overwritten again.

Reviewed By: q10

Differential Revision: D73819097
@chouxi chouxi force-pushed the export-D73819097 branch from 95d3b88 to 7e280ac Compare June 4, 2025 18:11
chouxi added a commit to chouxi/FBGEMM that referenced this pull request Jun 4, 2025
Summary:

X-link: facebookresearch/FBGEMM#1285

Added the functions `stream_cuda/stream`.
- When `blocking_tensor_copy=false` Create a new thread to execute the callback registered via the `stream_cuda` the reason as in comment. this is based on [the findings that callbacks are executed in serialized manner]( https://fb.workplace.com/permalink.php?story_fbid=pfbid02s7RWvRZ4g2nS5i42kkyApvLsCbiRpGrdAEPs2p5qr2MDnq5YbgfThQ6PXSB6y13Al&id=100026528794331)
- When `blocking_tensor_copy=true` just copy and enqueue in the callback thread.

Added profile to the callback functions.

Added the function  `stream_sync_cuda`
- To explicitly join the async copy thread to make sure the copy is happened before the buffer got overwritten again.

Reviewed By: q10

Differential Revision: D73819097
chouxi added a commit to chouxi/FBGEMM that referenced this pull request Jun 4, 2025
Summary:

X-link: facebookresearch/FBGEMM#1285

Added the functions `stream_cuda/stream`.
- When `blocking_tensor_copy=false` Create a new thread to execute the callback registered via the `stream_cuda` the reason as in comment. this is based on [the findings that callbacks are executed in serialized manner]( https://fb.workplace.com/permalink.php?story_fbid=pfbid02s7RWvRZ4g2nS5i42kkyApvLsCbiRpGrdAEPs2p5qr2MDnq5YbgfThQ6PXSB6y13Al&id=100026528794331)
- When `blocking_tensor_copy=true` just copy and enqueue in the callback thread.

Added profile to the callback functions.

Added the function  `stream_sync_cuda`
- To explicitly join the async copy thread to make sure the copy is happened before the buffer got overwritten again.

Reviewed By: q10

Differential Revision: D73819097
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D73819097

@chouxi chouxi force-pushed the export-D73819097 branch from 7e280ac to 5699c45 Compare June 4, 2025 18:13
chouxi added a commit to chouxi/FBGEMM that referenced this pull request Jun 4, 2025
Summary:
Pull Request resolved: pytorch#4210

X-link: facebookresearch/FBGEMM#1285

Added the functions `stream_cuda/stream`.
- When `blocking_tensor_copy=false` Create a new thread to execute the callback registered via the `stream_cuda` the reason as in comment. this is based on [the findings that callbacks are executed in serialized manner]( https://fb.workplace.com/permalink.php?story_fbid=pfbid02s7RWvRZ4g2nS5i42kkyApvLsCbiRpGrdAEPs2p5qr2MDnq5YbgfThQ6PXSB6y13Al&id=100026528794331)
- When `blocking_tensor_copy=true` just copy and enqueue in the callback thread.

Added profile to the callback functions.

Added the function  `stream_sync_cuda`
- To explicitly join the async copy thread to make sure the copy is happened before the buffer got overwritten again.

Reviewed By: q10

Differential Revision: D73819097
Summary:
Pull Request resolved: pytorch#4210

X-link: facebookresearch/FBGEMM#1285

Added the functions `stream_cuda/stream`.
- When `blocking_tensor_copy=false` Create a new thread to execute the callback registered via the `stream_cuda` the reason as in comment. this is based on [the findings that callbacks are executed in serialized manner]( https://fb.workplace.com/permalink.php?story_fbid=pfbid02s7RWvRZ4g2nS5i42kkyApvLsCbiRpGrdAEPs2p5qr2MDnq5YbgfThQ6PXSB6y13Al&id=100026528794331)
- When `blocking_tensor_copy=true` just copy and enqueue in the callback thread.

Added profile to the callback functions.

Added the function  `stream_sync_cuda`
- To explicitly join the async copy thread to make sure the copy is happened before the buffer got overwritten again.

Reviewed By: q10

Differential Revision: D73819097
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D73819097

@chouxi chouxi force-pushed the export-D73819097 branch from 5699c45 to 7f1d0ca Compare June 4, 2025 18:20
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants