Commit 81ecf9a
[release/2.9] Fix int4mm device memcpy error on Windows (pytorch#175410) (#3166)
On Windows with HIP/ROCm, std::memcpy is a __host__ function and cannot
be called from __device__ code. Use raw memcpy (which the HIP compiler
provides as a device builtin) when building on Windows.
This will allow builds for of pytorch for gfx942 on Windows. gfx950 is
yet to be tested but it should likely build as well.
Pull Request resolved: pytorch#175410
Approved by: https://github.com/jeffdaily
Co-authored-by: Aaryaman Vasishta <aaryaman.vasishta@amd.com>1 parent 813feb6 commit 81ecf9a
1 file changed
+7
-0
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
581 | 581 | | |
582 | 582 | | |
583 | 583 | | |
| 584 | + | |
| 585 | + | |
| 586 | + | |
| 587 | + | |
| 588 | + | |
| 589 | + | |
584 | 590 | | |
| 591 | + | |
585 | 592 | | |
586 | 593 | | |
587 | 594 | | |
| |||
0 commit comments