[REVIEW][Java] One PinnedMemoryBuffer per CuVSResourcesImpl #1441

ldematte · 2025-10-21T12:25:11Z

While profiling cuvs-java, we found that allocating a PinnedMemoryBuffer for each host->device or device->host memory copy was unnecessary and wasteful.
This PR moves the allocation of a PinnedMemoryBuffer to CuVSResourcesImpl, so that the buffer can be cached and reused. Since CuVSResources are already meant to be per-thread, this is safe, as the PinnedMemoryBuffer will never be used concurrently.
In order to do it cleanly, we introduced two named ScopedAccess classes and a helper method that will always find its way to the internal MemorySegment used by native functions to access the buffer, without the need to expose it via the public interface.

copy-pr-bot · 2025-10-21T12:25:15Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

mythrocks · 2025-10-27T21:51:51Z

java/cuvs-java/src/main/java22/com/nvidia/cuvs/internal/common/PinnedMemoryBuffer.java

    try (var localArena = Arena.ofConfined()) {
      MemorySegment pointer = localArena.allocate(C_POINTER);
-      checkCudaError(cudaMallocHost(pointer, bufferBytes), "cudaMallocHost");
+      checkCudaError(cudaMallocHost(pointer, PinnedMemoryBuffer.CHUNK_BYTES), "cudaMallocHost");


Please correct me if I'm wrong.

There were a couple of advantages to the previous approach that now seem to be missing:

If the total number of bytes did not exceed CHUNK_BYTES, we would make a smaller allocation. By removing this check, we're guaranteeing 8MB allocation per thread just for pinned memory. Is that wise?

There was protection against a very large row-size (8MB). Now we ignore the possibility. Wouldn't that mean a buffer-overrun when a single row is copied? Or is that not deemed possible now?

If the total number of bytes did not exceed CHUNK_BYTES, we would make a smaller allocation. By removing this check, we're guaranteeing 8MB allocation per thread just for pinned memory. Is that wise?

We discussed this with @achirkin and think this is worth it. It's true we are allocating 8MB per-resource, but it's also true we do this on-demand, and the benefit of not having to re-allocate every time (which is costly, due to the lock on CUDA context) is worth it.

There was protection against a very large row-size (8MB). Now we ignore the possibility. Wouldn't that mean a buffer-overrun when a single row is copied? Or is that not deemed possible now?

It's true. It's very unlikely, I would say impossible in our scenarios, but it's something missing. Let me add this protection back (although it will need to be in a different place)

I'm 👍 on the change so far.

It's very unlikely, I would say impossible in our scenarios, but it's something missing. Let me add this protection back...

I'll take another look when this changes.

Yes please :) I'll ping you when I'll push the change!

java/cuvs-java/src/main/java22/com/nvidia/cuvs/internal/CuVSResourcesImpl.java

java/cuvs-java/src/main/java22/com/nvidia/cuvs/internal/ScopedAccessWithHostBuffer.java

mythrocks · 2025-10-28T20:53:37Z

/ok to test aa5e469

mythrocks

LGTM.

mythrocks · 2025-10-28T22:14:17Z

I've rebased this PR and resolved a conflict.
But it looks like we're going to need to fix the copyright headers on the changed files.

benfred · 2025-10-28T23:15:51Z

/ok to test d7fcdb3

benfred · 2025-10-28T23:20:03Z

/ok to test c4f0570

benfred · 2025-10-29T19:13:12Z

/ok to test d792fce

mythrocks · 2025-10-29T20:54:19Z

/ok to test d792fce

@benfred: This PR will likely continue to fail CI until the copyright headers are fixed for this change. Plus, there's another change pending (for handling cases where the row is super-large).

benfred · 2025-10-29T21:06:49Z

@mythrocks the copyright issue should be fixed with c4f0570 (but CI is still failing here on CagraMultiThreadStabilityIT.testQueryingUsingMultipleThreadsWithPrivateResources:62->testQueryingUsingMultipleThreads:162 MultiThreaded stablity test failed: null errors in the java unittests =(

One PinnedMemoryBuffer per CuVSResourcesImpl

90ec3ea

ldematte requested a review from a team as a code owner October 21, 2025 12:25

github-project-automation bot added this to Vector Search, ML, & Data Mining Release Board Oct 21, 2025

github-project-automation bot moved this to Todo in Vector Search, ML, & Data Mining Release Board Oct 21, 2025

ldematte changed the base branch from main to branch-25.12 October 21, 2025 12:25

Merge branch 'branch-25.12' into pinned-buffer-per-resource

5955987

ldematte changed the base branch from branch-25.12 to main October 21, 2025 14:36

cjnolet assigned ldematte Oct 22, 2025

cjnolet moved this from Todo to In Progress in Vector Search, ML, & Data Mining Release Board Oct 22, 2025

cjnolet added improvement Improves an existing functionality non-breaking Introduces a non-breaking change labels Oct 22, 2025

Merge branch 'main' into pinned-buffer-per-resource

1f9afdc

mythrocks reviewed Oct 27, 2025

View reviewed changes

java/cuvs-java/src/main/java22/com/nvidia/cuvs/internal/CuVSResourcesImpl.java Show resolved Hide resolved

mythrocks reviewed Oct 27, 2025

View reviewed changes

java/cuvs-java/src/main/java22/com/nvidia/cuvs/internal/ScopedAccessWithHostBuffer.java Show resolved Hide resolved

Merge branch 'main' into pinned-buffer-per-resource

aa5e469

mythrocks approved these changes Oct 28, 2025

View reviewed changes

Merge branch 'main' into pinned-buffer-per-resource

f67b463

mythrocks self-requested a review October 28, 2025 22:14

Merge branch 'main' into pinned-buffer-per-resource

d7fcdb3

update copyright to spdx format

c4f0570

Merge branch 'main' into pinned-buffer-per-resource

d792fce

[REVIEW][Java] One PinnedMemoryBuffer per CuVSResourcesImpl #1441

Are you sure you want to change the base?

[REVIEW][Java] One PinnedMemoryBuffer per CuVSResourcesImpl #1441

Uh oh!

Conversation

ldematte commented Oct 21, 2025

Uh oh!

copy-pr-bot bot commented Oct 21, 2025

Uh oh!

mythrocks Oct 27, 2025

Choose a reason for hiding this comment

Uh oh!

ldematte Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

mythrocks Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

ldematte Oct 29, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

mythrocks commented Oct 28, 2025

Uh oh!

mythrocks left a comment

Choose a reason for hiding this comment

Uh oh!

mythrocks commented Oct 28, 2025

Uh oh!

benfred commented Oct 28, 2025

Uh oh!

benfred commented Oct 28, 2025

Uh oh!

benfred commented Oct 29, 2025

Uh oh!

mythrocks commented Oct 29, 2025

Uh oh!

benfred commented Oct 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants