Enable precision 4 for HLLPP by res-life · Pull Request #14430 · NVIDIA/spark-rapids

res-life · 2026-03-18T06:24:12Z

Fixes #12452.

Description

Enable precision 4 for HLLPP(Hyper Log Log Plus Plus) since cuDF fixed the bug NVIDIA/cuCollections#696

Depends on

bump cuco version: Update cuco version to fetch the cuda fancy iterators rapidsai/rapids-cmake#987

Checklists

This PR has added documentation for new or modified features or behaviors.
This PR has added new tests or modified existing tests to cover new code paths.
(Please explain in the PR description how the new code paths are tested, such as names of the new/existing tests that cover them.)
Performance testing has been performed and its results are added in the PR description. Or, an issue has been filed with a link in the PR description.

Signed-off-by: Chong Gao chongg@nvidia.com

Signed-off-by: Chong Gao <res_life@163.com>

greptile-apps · 2026-03-18T06:27:02Z

Greptile Summary

This PR enables GPU support for HLLPP (HyperLogLog++) at precision 4 by fixing the lower-bound guard in GpuOverrides.scala and removing the corresponding xfail markers from the integration tests. The change is directly tied to a cuCollections upstream bug fix (NVIDIA/cuCollections#696) and is a minimal, targeted fix.

Key changes:

GpuOverrides.scala: The precision guard is updated from precision <= 4 (blocking precision 4) to precision < 4 (allowing precision 4). The new supported range is [4, 14], matching the comment update. The error message is also improved to clearly state "out of range" and reference [4, 14].
hyper_log_log_plus_plus_test.py: xfail markers for relativeSD=0.3 (precision 4) are removed from both _relativeSD and the test_hllpp_precisions_groupby parametrize decorator, so precision 4 is now tested as a normal (expected-to-pass) case. The old conditional list-comprehension wrapping the mark is eliminated, simplifying the code.

No logic regressions are introduced; all previously supported precisions [5, 14] remain supported, and precision 4 is now added.

Confidence Score: 5/5

Safe to merge — the change is a one-line predicate fix with clean test coverage and no regressions to existing supported precisions.

Both changes are minimal and correct: the Scala guard predicate is fixed to include precision 4, the error message and comments are updated consistently, and the Python tests properly promote precision 4 from xfail to a first-class passing test. No new logic is added that could introduce regressions. All previously supported precisions [5, 14] are unaffected.

No files require special attention.

Important Files Changed

Filename	Overview
sql-plugin/src/main/scala/com/nvidia/spark/rapids/GpuOverrides.scala	Lower bound of precision check changed from `<= 4` (exclusive of 4) to `< 4` (inclusive of 4), expanding GPU-supported precision range from [5, 14] to [4, 14]; error message and comments updated accordingly.
integration_tests/src/main/python/hyper_log_log_plus_plus_test.py	Removed `xfail` marks for precision 4 (`relativeSD=0.3`) from both `_relativeSD` list and `test_hllpp_precisions_groupby` parametrize decorator; simplifies test scaffolding and confirms the bug fix is testable end-to-end.

Flowchart

%%{init: {'theme': 'neutral'}}%%
flowchart TD
    A["HyperLogLogPlusPlus expr\n(tagExprForGpu)"] --> B["Compute precision\nfrom relativeSD"]
    B --> C{"precision < 4\nOR\nprecision > 14?"}
    C -- "Yes (out of range)" --> D["willNotWorkOnGpu\n(fallback to CPU)"]
    C -- "No (in range [4,14])" --> E["GPU supported\n(previously [5,14])"]
    E --> F["convertToGpu\nGpuHyperLogLogPlusPlus"]

    style E fill:#22c55e,color:#fff
    style D fill:#ef4444,color:#fff

_{Reviews (5): Last reviewed commit: "Format" | Re-trigger Greptile}

sql-plugin/src/main/scala/com/nvidia/spark/rapids/GpuOverrides.scala

….scala Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>

res-life · 2026-03-30T02:33:38Z

build

res-life · 2026-03-30T06:02:03Z

build

nvauto · 2026-03-30T06:29:10Z

NOTE: release/26.04 has been created from main. Please retarget your PR to release/26.04 if it should be included in the release.

Signed-off-by: Chong Gao <res_life@163.com>

res-life · 2026-03-31T02:35:15Z

build

Fix HLLPP bug when precision is 4

c3469ee

Signed-off-by: Chong Gao <res_life@163.com>

greptile-apps bot reviewed Mar 18, 2026

View reviewed changes

sql-plugin/src/main/scala/com/nvidia/spark/rapids/GpuOverrides.scala Outdated Show resolved Hide resolved

Update sql-plugin/src/main/scala/com/nvidia/spark/rapids/GpuOverrides…

a1ed823

….scala Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>

res-life self-assigned this Mar 18, 2026

res-life closed this Mar 30, 2026

res-life reopened this Mar 30, 2026

Format

7ac5cca

Signed-off-by: Chong Gao <res_life@163.com>

res-life marked this pull request as draft March 31, 2026 02:31

res-life marked this pull request as ready for review March 31, 2026 08:09

res-life changed the title ~~Fix HLLPP bug when precision is 4~~ Enable precision 4 for HLLPP Mar 31, 2026

res-life mentioned this pull request Mar 31, 2026

Remove deprecated GpuTimeZoneDB methods NVIDIA/spark-rapids-jni#4422

Merged

res-life requested review from firestarman, revans2 and thirtiseven March 31, 2026 08:37

thirtiseven approved these changes Mar 31, 2026

View reviewed changes

res-life merged commit 0cd5dae into NVIDIA:main Mar 31, 2026
50 of 51 checks passed

sameerz added the bug Something isn't working label Mar 31, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable precision 4 for HLLPP#14430

Enable precision 4 for HLLPP#14430
res-life merged 3 commits intoNVIDIA:mainfrom
res-life:fix-hllpp-precision-4

res-life commented Mar 18, 2026 •

edited

Loading

Uh oh!

greptile-apps bot commented Mar 18, 2026 •

edited

Loading

Uh oh!

Uh oh!

res-life commented Mar 30, 2026

Uh oh!

res-life commented Mar 30, 2026

Uh oh!

nvauto commented Mar 30, 2026

Uh oh!

res-life commented Mar 31, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

res-life commented Mar 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Checklists

Uh oh!

greptile-apps bot commented Mar 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Greptile Summary

Confidence Score: 5/5

Important Files Changed

Flowchart

Uh oh!

Uh oh!

res-life commented Mar 30, 2026

Uh oh!

res-life commented Mar 30, 2026

Uh oh!

nvauto commented Mar 30, 2026

Uh oh!

res-life commented Mar 31, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

res-life commented Mar 18, 2026 •

edited

Loading

greptile-apps bot commented Mar 18, 2026 •

edited

Loading