Remove deprecated GpuTimeZoneDB cache overload usage by res-life · Pull Request #14500 · NVIDIA/spark-rapids

res-life · 2026-03-31T07:57:19Z

Description

Previously, we use hybrid mode:

before 2200 year, generate a transition cache for DST timezone, and run on GPU for timezone rebasing.
This mehod wastes GPU memories.
after 2200 year, use CPU to do timezone rebasing.
This is slow.

Currently, we do not use hybrid mode any more, so remove the deprecated methods.

Replace deprecated GpuTimeZoneDB.cacheDatabaseAsync(int) and cacheDatabase(int) usage with the no-arg APIs.
Remove the now-unused spark.rapids.timezone.transitionCache.maxYear config and obsolete docs entry.
Update timezone perf and cast tests to call the replacement API.

Related change

Remove deprecated GpuTimeZoneDB methods spark-rapids-jni#4422

Checklists

This PR has added documentation for new or modified features or behaviors.
This PR has added new tests or modified existing tests to cover new code paths.
(Updated existing timezone perf and cast tests to exercise the replacement API entry points.)
Performance testing has been performed and its results are added in the PR description. Or, an issue has been filed with a link in the PR description.

Signed-off-by: Chong Gao chongg@nvidia.com

greptile-apps · 2026-03-31T07:58:47Z

Greptile Summary

This PR completes the removal of the hybrid timestamp-processing mode by replacing all usages of the deprecated GpuTimeZoneDB.cacheDatabaseAsync(int) / cacheDatabase(int) overloads with their no-arg equivalents, and deletes the now-meaningless spark.rapids.timezone.transitionCache.maxYear configuration key alongside its documentation entry.

Key changes:

RapidsConf.scala: TIMESTAMP_RULES_END_YEAR config entry and timestampRulesEndYear lazy accessor are fully removed; no remaining references exist anywhere in the codebase.
Plugin.scala: Executor startup now calls GpuTimeZoneDB.cacheDatabaseAsync() unconditionally rather than passing a year ceiling.
cast_test.py: Six integration tests updated to call the no-arg cacheDatabase().
TimeZonePerfSuite.scala: Eight perf tests updated; stale timestampRulesEndYear local field removed; copyright year updated to 2026.
advanced_configs.md: Stale config table row removed.

Confidence Score: 5/5

Safe to merge — mechanical deprecation removal with complete and consistent updates across all call sites and documentation.

All usages of the deprecated int-overload APIs are replaced, no remaining references to the removed config exist anywhere in the codebase, and the documentation is kept in sync. No logic changes beyond removing the year ceiling, which is the intended behaviour.

No files require special attention.

Important Files Changed

Filename	Overview
sql-plugin/src/main/scala/com/nvidia/spark/rapids/RapidsConf.scala	Removes TIMESTAMP_RULES_END_YEAR config entry and its lazy accessor timestampRulesEndYear — clean deletion with no remaining references.
sql-plugin/src/main/scala/com/nvidia/spark/rapids/Plugin.scala	Executor plugin init updated to call cacheDatabaseAsync() with no args instead of passing conf.timestampRulesEndYear.
integration_tests/src/main/python/cast_test.py	Six cast-to-timestamp tests updated to call no-arg cacheDatabase() instead of cacheDatabase(2200).
tests/src/test/scala/com/nvidia/spark/rapids/timezone/TimeZonePerfSuite.scala	Eight perf tests updated to call cacheDatabase() with no args; removes the now-unused timestampRulesEndYear private field; copyright year bumped to 2026.
docs/additional-functionality/advanced_configs.md	Documentation entry for spark.rapids.timezone.transitionCache.maxYear removed to match the config deletion.

Sequence Diagram

sequenceDiagram
    participant EP as RapidsExecutorPlugin
    participant TZDB as GpuTimeZoneDB (JNI)
    participant GPU as GPU Memory

    Note over EP: Before: cacheDatabaseAsync(conf.timestampRulesEndYear)<br/>After: cacheDatabaseAsync()
    EP->>TZDB: cacheDatabaseAsync()
    TZDB-->>GPU: Load full timezone transition table asynchronously
    Note over TZDB,GPU: No year ceiling — full table always cached

    Note over EP: Tests (cast_test.py / TimeZonePerfSuite)
    EP->>TZDB: cacheDatabase()
    TZDB-->>GPU: Synchronous cache load
    EP->>EP: Thread.sleep(5ms)
    EP->>GPU: Execute timezone-dependent operations

_{Reviews (2): Last reviewed commit: "Remove deprecated GpuTimeZoneDB cache ov..." | Re-trigger Greptile}

res-life · 2026-03-31T08:09:01Z

build

Switch the plugin and timezone tests to the no-arg cache APIs so they stop relying on deprecated max-year overloads. This also removes the unused transition cache config and docs entry that no longer affect behavior. Made-with: Cursor Signed-off-by: Chong Gao <res_life@163.com>

firestarman

LGTM

res-life · 2026-04-01T02:11:21Z

build

## Summary - remove the deprecated `GpuTimeZoneDB.cacheDatabaseAsync(int)`, `cacheDatabase(int)`, and `getTransitions()` methods - update the remaining timezone DB test to use the supported no-arg cache API ## Related change * NVIDIA/spark-rapids#14500 Signed-off-by: Chong Gao <res_life@163.com> Co-authored-by: Chong Gao <res_life@163.com>

res-life force-pushed the chongg/remove-deprecated-gputimezonedb-cache-api branch from 89ada20 to bc69ad8 Compare March 31, 2026 08:21

res-life mentioned this pull request Mar 31, 2026

Remove deprecated GpuTimeZoneDB methods NVIDIA/spark-rapids-jni#4422

Merged

thirtiseven approved these changes Mar 31, 2026

View reviewed changes

res-life requested a review from firestarman March 31, 2026 08:38

firestarman approved these changes Mar 31, 2026

View reviewed changes

res-life merged commit eb47571 into NVIDIA:main Apr 1, 2026
50 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove deprecated GpuTimeZoneDB cache overload usage#14500

Remove deprecated GpuTimeZoneDB cache overload usage#14500
res-life merged 1 commit intoNVIDIA:mainfrom
res-life:chongg/remove-deprecated-gputimezonedb-cache-api

res-life commented Mar 31, 2026 •

edited

Loading

Uh oh!

greptile-apps bot commented Mar 31, 2026 •

edited

Loading

Uh oh!

res-life commented Mar 31, 2026

Uh oh!

firestarman left a comment

Uh oh!

res-life commented Apr 1, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

res-life commented Mar 31, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Related change

Checklists

Uh oh!

greptile-apps bot commented Mar 31, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Greptile Summary

Confidence Score: 5/5

Important Files Changed

Sequence Diagram

Uh oh!

res-life commented Mar 31, 2026

Uh oh!

firestarman left a comment

Choose a reason for hiding this comment

Uh oh!

res-life commented Apr 1, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

res-life commented Mar 31, 2026 •

edited

Loading

greptile-apps bot commented Mar 31, 2026 •

edited

Loading