Add safe API for CUDA constant memory operations by wizenink · Pull Request #478 · chelsea0x3b/cudarc

wizenink · 2025-11-01T09:37:04Z

Adds support for accessing and copying data to __constant__ memory in CUDA modules.

Changes

Result layer: Added module::get_global() wrapper for cuModuleGetGlobal_v2
Safe API: New CudaSymbol type to represent global/constant symbols
Memory operations: Added CudaStream::memcpy_htos() and memcpy_dtos() for copying to symbols
Example: Added 09-constant-memory.rs demonstrating polynomial evaluation using constant memory

Usage

let symbol = module.get_global("my_constant")?;
stream.memcpy_htos(&data, &symbol)?;

Closes the gap in constant memory support by providing a safe, ergonomic API for symbol access and data transfer.

gemini-code-assist

Code Review

This pull request introduces a safe API for CUDA constant memory, which is a valuable addition. The implementation is well-structured, with a new CudaSymbol type, associated memory copy functions, and a clear example demonstrating its usage. My review focuses on improving error handling in the new public APIs. I've identified a few places where the code could panic on invalid user input. By replacing these panics with Result::Err returns, the library will become more robust and user-friendly. These are high-severity issues for a library.

gemini-code-assist · 2025-11-01T09:38:35Z

+        assert!(
+            symbol.bytes >= src_bytes,
+            "Symbol size ({} bytes) is smaller than source data ({} bytes)",
+            symbol.bytes,
+            src_bytes
+        );


Similar to memcpy_htos, using assert! here can cause a panic if the source data is larger than the symbol's capacity. It's better to return a Result::Err to allow for graceful error handling by the caller.

Suggested change

assert!(

symbol.bytes >= src_bytes,

"Symbol size ({} bytes) is smaller than source data ({} bytes)",

symbol.bytes,

src_bytes

);

if symbol.bytes < src_bytes {

return Err(DriverError(sys::cudaError_enum::CUDA_ERROR_INVALID_VALUE));

}

Maybe we should do this on all of the memcpy assertions, seems reasonable

Potentially - either way should be a separate PR

Will keep this for now, and if it's merged, I'll get another pr to change asserts to results

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

chelsea0x3b · 2025-11-01T14:13:37Z

+    /// let symbol = module.get_global("my_const")?;
+    /// stream.memcpy_htos(&data, &symbol)?;
+    /// ```
+    pub fn get_global(self: &Arc<Self>, name: &str) -> Result<CudaSymbol, DriverError> {


I'm thinking it might be better to just return a CudaSlice<u8> from this function. This has the following benefits:

We don't need the CudaSymbol struct at all & we can use the existing memcpy stuff.

CudaSlice can already be transmuted to different types

We have built in support for event tracking - which right now CudaSymbol does not track

We don't need to add support to CudaSymbol for multi stream synchronization

Thoughts? Open to discussion, but definitely leaning towards CudaSlice<u8>

CudaSlice seems right, will check the impl details and give you another commit

get_global should now return a CudaSlice :)

chelsea0x3b · 2025-11-01T14:14:13Z

Love this addition - just need to discuss the return from get_global. Thank you for this work

Add safe API for CUDA constant memory operations

29fab7a

wizenink requested a review from chelsea0x3b as a code owner November 1, 2025 09:37

gemini-code-assist Bot reviewed Nov 1, 2025

View reviewed changes

Don't panic on invalid name

9a8045f

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

chelsea0x3b reviewed Nov 1, 2025

View reviewed changes

Moved get_global to return CudaSlice<u8>

0b7fb97

chelsea0x3b merged commit 12cbbab into chelsea0x3b:main Nov 1, 2025
33 checks passed

chelsea0x3b mentioned this pull request Nov 1, 2025

Constant memory #370

Closed

wizenink deleted the copy_to_global branch November 4, 2025 16:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add safe API for CUDA constant memory operations#478

Add safe API for CUDA constant memory operations#478
chelsea0x3b merged 3 commits into
chelsea0x3b:mainfrom
wizenink:copy_to_global

wizenink commented Nov 1, 2025 •

edited

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

gemini-code-assist Bot Nov 1, 2025

Uh oh!

wizenink Nov 1, 2025

Uh oh!

chelsea0x3b Nov 1, 2025

Uh oh!

wizenink Nov 1, 2025

Uh oh!

Uh oh!

chelsea0x3b Nov 1, 2025

Uh oh!

wizenink Nov 1, 2025

Uh oh!

wizenink Nov 1, 2025

Uh oh!

chelsea0x3b commented Nov 1, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

wizenink commented Nov 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Usage

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

gemini-code-assist Bot Nov 1, 2025

Choose a reason for hiding this comment

Uh oh!

wizenink Nov 1, 2025

Choose a reason for hiding this comment

Uh oh!

chelsea0x3b Nov 1, 2025

Choose a reason for hiding this comment

Uh oh!

wizenink Nov 1, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

chelsea0x3b Nov 1, 2025

Choose a reason for hiding this comment

Uh oh!

wizenink Nov 1, 2025

Choose a reason for hiding this comment

Uh oh!

wizenink Nov 1, 2025

Choose a reason for hiding this comment

Uh oh!

chelsea0x3b commented Nov 1, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

wizenink commented Nov 1, 2025 •

edited

Loading