Skip to content

Commit a24cdd6

Browse files
[XPU] [DeepNVMe] use same cpu_op_desc_t with cuda (#6645)
We have found that #6592 uses `_pinned_tensor_mgr` to create cpu bounce buffer, which is same with what our xpu accelerator currently doing. So no need to use xpu device specific cpu_op_desc_t. In this PR: 1. remove custom csrc/xpu/aio/deepspeed_cpu_op.cpp 2. modify xpu async_io opbuilder. This issue cannot be easily done with revert #6532 , for we added some source file as last time GDS feature going in DS. So file this new PR :)
1 parent 11bbf45 commit a24cdd6

File tree

2 files changed

+1
-52
lines changed

2 files changed

+1
-52
lines changed

csrc/xpu/aio/deepspeed_cpu_op.cpp

-51
This file was deleted.

op_builder/xpu/async_io.py

+1-1
Original file line numberDiff line numberDiff line change
@@ -31,7 +31,7 @@ def sources(self):
3131
'csrc/aio/common/deepspeed_aio_types.cpp',
3232
'csrc/aio/py_lib/deepspeed_pin_tensor.cpp',
3333
'csrc/aio/py_lib/deepspeed_py_io_handle.cpp',
34-
'csrc/xpu/aio/deepspeed_cpu_op.cpp',
34+
'csrc/aio/py_lib/deepspeed_cpu_op.cpp',
3535
'csrc/aio/py_lib/deepspeed_aio_op_desc.cpp',
3636
]
3737

0 commit comments

Comments
 (0)