Adding a mutex lock to set_range function #4207

Raahul46 · 2025-05-29T00:35:06Z

Summary:
Context:
While we expose KVTensor to external surfaces (i.e., checkpointing), they have the freedom to leverage the KVTensor functions in a concurrent fashion.

For example,

https://www.internalfb.com/code/fbsource/[5b7b1eef7d69]/fbcode/aiplatform/modelstore/checkpointing/pyper/TensorLoaderCallback.h?lines=85-86

This function here calls set_range to the same KVTensor multiple times because we divide a huge chunk of data into smaller chunks and try to write it in a concurrent fashion. This is a bad practice because in SSD I/O, We also use multi threading to write data in KVTensor.

Currently, we use 32 threads (each thread per shard) to write data. Due to this, when we call set_range multiple times, this can lead to thread contention and increase in synchronization overhead

In this Diff:

We introduce a mutex lock on the set_range function, due to this every transaction is locked during execution and the multiple calls are processed serially leading to more efficient use of the threads

Differential Revision: D75555658

netlify · 2025-05-29T00:35:11Z

✅ Deploy Preview for pytorch-fbgemm-docs ready!

Name	Link
🔨 Latest commit	`98cc2b6`
🔍 Latest deploy log	https://app.netlify.com/projects/pytorch-fbgemm-docs/deploys/6838930881056100081de1dc
😎 Deploy Preview	https://deploy-preview-4207--pytorch-fbgemm-docs.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

facebook-github-bot · 2025-05-29T00:35:14Z

This pull request was exported from Phabricator. Differential Revision: D75555658

facebook-github-bot · 2025-05-29T16:49:04Z

This pull request was exported from Phabricator. Differential Revision: D75555658

Summary: Pull Request resolved: pytorch#4207 X-link: facebookresearch/FBGEMM#1281 **Context:** While we expose KVTensor to external surfaces (i.e., checkpointing), they have the freedom to leverage the KVTensor functions in a concurrent fashion. For example, https://www.internalfb.com/code/fbsource/[5b7b1eef7d69]/fbcode/aiplatform/modelstore/checkpointing/pyper/TensorLoaderCallback.h?lines=85-86 This function here calls set_range to the same KVTensor multiple times because we divide a huge chunk of data into smaller chunks and try to write it in a concurrent fashion. This is a bad practice because in SSD I/O, We also use multi threading to write data in KVTensor. Currently, we use 32 threads (each thread per shard) to write data. Due to this, when we call set_range multiple times, this can lead to thread contention and increase in synchronization overhead **In this Diff:** We introduce a mutex lock on the set_range function, due to this every transaction is locked during execution and the multiple calls are processed serially leading to more efficient use of the threads Differential Revision: D75555658

facebook-github-bot · 2025-05-29T17:01:54Z

This pull request was exported from Phabricator. Differential Revision: D75555658

facebook-github-bot · 2025-05-30T00:15:01Z

This pull request has been merged in c845cc9.

facebook-github-bot added the cla signed label May 29, 2025

facebook-github-bot added the fb-exported label May 29, 2025

Raahul46 force-pushed the export-D75555658 branch 2 times, most recently from c967848 to cdc6c13 Compare May 29, 2025 16:46

Raahul46 force-pushed the export-D75555658 branch from cdc6c13 to 2071f80 Compare May 29, 2025 16:49

Raahul46 force-pushed the export-D75555658 branch from 2071f80 to 98cc2b6 Compare May 29, 2025 17:01

facebook-github-bot closed this in c845cc9 May 30, 2025

facebook-github-bot added the Merged label May 30, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Adding a mutex lock to set_range function #4207

Adding a mutex lock to set_range function #4207

Uh oh!

Raahul46 commented May 29, 2025

Uh oh!

netlify bot commented May 29, 2025 •

edited

Loading

Uh oh!

facebook-github-bot commented May 29, 2025

Uh oh!

facebook-github-bot commented May 29, 2025

Uh oh!

facebook-github-bot commented May 29, 2025

Uh oh!

facebook-github-bot commented May 30, 2025

Uh oh!

Uh oh!

Adding a mutex lock to set_range function #4207

Adding a mutex lock to set_range function #4207

Uh oh!

Conversation

Raahul46 commented May 29, 2025

Uh oh!

netlify bot commented May 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for pytorch-fbgemm-docs ready!

Uh oh!

facebook-github-bot commented May 29, 2025

Uh oh!

facebook-github-bot commented May 29, 2025

Uh oh!

facebook-github-bot commented May 29, 2025

Uh oh!

facebook-github-bot commented May 30, 2025

Uh oh!

Uh oh!

netlify bot commented May 29, 2025 •

edited

Loading