Add environment `DeviceMerge` by gonidelis · Pull Request #7969 · NVIDIA/cccl

gonidelis · 2026-03-10T03:57:29Z

fixes #7543

copy-pr-bot · 2026-03-10T03:57:32Z

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

github-actions · 2026-03-10T09:32:54Z

😬 CI Workflow Results

🟥 Finished in 2h 04m: Pass: 15%/249 | Total: 4d 12h | Max: 2h 03m | Hits: 88%/55533

See results here.

bernhardmgruber · 2026-03-10T09:39:25Z

cub/cub/device/device_merge.cuh

+            ::cuda::std::enable_if_t<!::cuda::std::is_same_v<KeyIteratorIn1, void*>
+                                       && !::cuda::std::is_same_v<KeyIteratorIn1, ::cuda::std::nullptr_t>,
+                                     int> = 0>


Remark: that's an interesting constraint since it guards against using the old overload with nullptr. It's fine. You need it here I guess because you cannot constraint a second template argument since it's just int64 here and a size_t from the other overload would convert to int64 well.

bernhardmgruber · 2026-03-10T09:54:30Z

cub/cub/device/device_merge.cuh

+    using requested_determinism_t =
+      ::cuda::std::execution::__query_result_or_t<requirements_t,
+                                                  ::cuda::execution::determinism::__get_determinism_t,
+                                                  ::cuda::execution::determinism::run_to_run_t>;
+    static_assert(!::cuda::std::is_same_v<requested_determinism_t, ::cuda::execution::determinism::gpu_to_gpu_t>,
+                  "gpu_to_gpu determinism is not supported for unstable device merge");


Important: I need to think about this, since an unstable algorithm could still be deterministic (even across multiple GPUs). How did you conclude that the merge path implementation is run_to_run deterministic?

gonidelis added 2 commits March 9, 2026 20:53

Add env DeviceMerge

e4e153b

Add env DeviceMerge

f07ec91

github-project-automation bot added this to CCCL Mar 10, 2026

github-project-automation bot moved this to Todo in CCCL Mar 10, 2026

cccl-authenticator-app bot moved this from Todo to In Progress in CCCL Mar 10, 2026

gonidelis changed the title ~~Merge env~~ Add environment DeviceMerge Mar 10, 2026

gonidelis marked this pull request as ready for review March 10, 2026 07:26

gonidelis requested a review from a team as a code owner March 10, 2026 07:26

gonidelis requested a review from elstehle March 10, 2026 07:26

gonidelis enabled auto-merge (squash) March 10, 2026 07:26

cccl-authenticator-app bot moved this from In Progress to In Review in CCCL Mar 10, 2026

gonidelis requested a review from bernhardmgruber March 10, 2026 07:27

bernhardmgruber reviewed Mar 10, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add environment `DeviceMerge` #7969

Add environment `DeviceMerge` #7969
gonidelis wants to merge 2 commits intoNVIDIA:mainfrom
gonidelis:merge_env

gonidelis commented Mar 10, 2026 •

edited

Loading

Uh oh!

copy-pr-bot bot commented Mar 10, 2026

Uh oh!

github-actions bot commented Mar 10, 2026

Uh oh!

bernhardmgruber Mar 10, 2026

Uh oh!

bernhardmgruber Mar 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

gonidelis commented Mar 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

copy-pr-bot bot commented Mar 10, 2026

Uh oh!

github-actions bot commented Mar 10, 2026

😬 CI Workflow Results

🟥 Finished in 2h 04m: Pass: 15%/249 | Total: 4d 12h | Max: 2h 03m | Hits: 88%/55533

Uh oh!

bernhardmgruber Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

bernhardmgruber Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

gonidelis commented Mar 10, 2026 •

edited

Loading