[rocisa] Use comgr instead of calling amdclang++ #1952

KKyang · 2025-04-18T05:30:38Z

Use comgr C API instead of calling amdclang++ directly.

hcman2

ok if ci pass

stellaraccident

Let's land #1951 first since that introduces a structural improvement.

This needs more work to not leak and be exception safe and should be implemented in the helper.cpp introduced in the above.

Also, please don't land those cmake changes to rocm_path. I couldn't follow why they were needed so we can discuss more what the intent is.

tensilelite/rocisa/CMakeLists.txt

tensilelite/rocisa/rocisa/include/helper.hpp

stellaraccident · 2025-04-18T09:34:17Z

tensilelite/rocisa/rocisa/include/helper.hpp

+    // Create data set and add the input data
+    CHECK_COMGR(amd_comgr_create_data_set(&dataSet));
+    CHECK_COMGR(amd_comgr_create_data_set(&outputDataSet));
+    CHECK_COMGR(amd_comgr_data_set_add(dataSet, data));


What happens if any of these get an error and throw? As I read it, every one of them leaks memory.

You need to rewrite this so that your allocated entities are contained by a raii instance that will deallocate any allocated things on destruction. See #1951 for one (of many) ways you can do this to bridge to a c based API safely.

I'll change to unique_ptr instead

I'm not sure that is a good way: these have explicit apis for create/release and do not have the same semantics as delete with respect to nullptr afaict. But I'll have a look at what you come up with.

KKyang · 2025-04-18T11:36:44Z

Merge after #1951 is merged

stellaraccident · 2025-04-21T23:47:43Z

I tested this on Windows, and indeed, we are going to need more work there. The issue is that you can't just have a cross-filesystem dep on a DLL like this (which is what a Python extension is) without making arrangements for how to find it at runtime (i.e. there is no RPATH on Windows).

ImportError: DLL load failed while importing rocisa: The specified module could not be found.

The Python frameworks solve this in one of two ways:

Bundle ROCm as a sibling to the Python binary distribution in a way that it can be found. PyTorch does this in the current version and it is as invasive as it sounds.
Use a Python level pre-loading mechanism to load the necessary DLLs prior to importing the Python extension (which links to DLLs by name). This is what CUDA based dependencies do and what ROCm based PyTorch will do in the next version.

Neither option is particularly satisfying for this kind of situation. In this case, since it is a build-only dep, we would most likely want to emulate the Linux RPATH mechanic to ensure the DLL is loaded properly. However, doing that properly will require a bit more pre-work.

Could we put this change on ice for a little while? I know roughly how to enable it but have other priorities right this minute and would like to come up with a mechanism that will work for any ROCm project vs just a one off. If this becomes urgent, I could do something project specific in a few hours if needed.

stellaraccident

Marking request changes per the above discussion. We can apply this once a bit more work is done on the windows side.

stellaraccident · 2025-05-06T02:07:34Z

Thanks for rebasing. I think I know how to fix this on the windows build, but I won't be able to get to it for some days. Should be an improvement, though -- just need to sequence it with some other work.

jayhawk-commits · 2025-06-20T19:23:31Z

Closing the pull request in this repo. Please refer to the migrated pull request for updates.

KKyang requested review from jichangjichang, vin-huang, imcarsonliao, hcman2, Serge45, Jinp800125, TonyYHsieh, solaslin and aazz44ss as code owners April 18, 2025 05:30

KKyang requested review from stellaraccident and ellosel April 18, 2025 05:31

KKyang changed the title ~~[rocisa] Use comgr instead of subprocess~~ [rocisa] Use comgr instead Apr 18, 2025

KKyang changed the title ~~[rocisa] Use comgr instead~~ [rocisa] Use comgr instead of calling amdclang++ Apr 18, 2025

hcman2 previously approved these changes Apr 18, 2025

View reviewed changes

stellaraccident requested changes Apr 18, 2025

View reviewed changes

KKyang dismissed hcman2’s stale review via fc4513d April 18, 2025 11:35

KKyang force-pushed the comgr branch from b956c9c to fc4513d Compare April 18, 2025 11:35

KKyang requested a review from a team as a code owner April 18, 2025 11:35

KKyang force-pushed the comgr branch from fc4513d to 20eb68a Compare April 18, 2025 11:35

KKyang requested a review from stellaraccident April 18, 2025 11:36

KKyang marked this pull request as draft April 18, 2025 11:36

KKyang force-pushed the comgr branch 3 times, most recently from 4d47f27 to 70046a7 Compare April 21, 2025 02:40

KKyang marked this pull request as ready for review April 21, 2025 02:40

KKyang force-pushed the comgr branch 3 times, most recently from e1c0a12 to 1bd651c Compare April 21, 2025 04:03

stellaraccident requested changes Apr 22, 2025

View reviewed changes

[rocisa] Use comgr instead of calling amdclang++

712f8c8

KKyang force-pushed the comgr branch from 1bd651c to 712f8c8 Compare May 6, 2025 01:03

Merge branch 'develop' into comgr

3e27d94

KKyang requested a review from stellaraccident June 9, 2025 00:27

assistant-librarian bot mentioned this pull request Jun 20, 2025

[rocisa] Use comgr instead of calling amdclang++ ROCm/rocm-libraries#269

Open

jayhawk-commits closed this Jun 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[rocisa] Use comgr instead of calling amdclang++ #1952

[rocisa] Use comgr instead of calling amdclang++ #1952

Uh oh!

KKyang commented Apr 18, 2025 •

edited

Loading

Uh oh!

hcman2 left a comment

Uh oh!

stellaraccident left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

stellaraccident Apr 18, 2025 •

edited

Loading

Uh oh!

KKyang Apr 18, 2025

Uh oh!

stellaraccident Apr 18, 2025

Uh oh!

KKyang commented Apr 18, 2025

Uh oh!

stellaraccident commented Apr 21, 2025

Uh oh!

stellaraccident left a comment

Uh oh!

stellaraccident commented May 6, 2025

Uh oh!

jayhawk-commits commented Jun 20, 2025

Uh oh!

Uh oh!

[rocisa] Use comgr instead of calling amdclang++ #1952

[rocisa] Use comgr instead of calling amdclang++ #1952

Uh oh!

Conversation

KKyang commented Apr 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hcman2 left a comment

Choose a reason for hiding this comment

Uh oh!

stellaraccident left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

stellaraccident Apr 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

KKyang Apr 18, 2025

Choose a reason for hiding this comment

Uh oh!

stellaraccident Apr 18, 2025

Choose a reason for hiding this comment

Uh oh!

KKyang commented Apr 18, 2025

Uh oh!

stellaraccident commented Apr 21, 2025

Uh oh!

stellaraccident left a comment

Choose a reason for hiding this comment

Uh oh!

stellaraccident commented May 6, 2025

Uh oh!

jayhawk-commits commented Jun 20, 2025

Uh oh!

Uh oh!

KKyang commented Apr 18, 2025 •

edited

Loading

stellaraccident Apr 18, 2025 •

edited

Loading