Skip to content

Add Python tests for different compression implementations #18710

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 9 commits into from
May 13, 2025

Conversation

vuule
Copy link
Contributor

@vuule vuule commented May 7, 2025

Description

Add tests that are parameterized on the compression codec and the implementation (nvCOMP, internal kernel, host) to ensure all available implementations are tested for all supported codecs.

These tests cover codecs that cannot be covered by C++ tests because we don't have writer support and thus need to use another library to generate input.

Checklist

  • I am familiar with the Contributing Guidelines.
  • New or existing tests cover these changes.
  • The documentation is up to date with these changes.

Copy link

copy-pr-bot bot commented May 7, 2025

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

@github-actions github-actions bot added the Python Affects Python cuDF API. label May 7, 2025
@vuule vuule added tests Unit testing for project non-breaking Non-breaking change improvement Improvement / enhancement to an existing function and removed Python Affects Python cuDF API. labels May 7, 2025
@github-actions github-actions bot added the Python Affects Python cuDF API. label May 7, 2025
@@ -1995,3 +2042,30 @@ def test_orc_reader_desynced_timestamp(datadir, inputfile):
got = cudf.read_orc(path)

assert_frame_equal(cudf.from_pandas(expect), got)


@pytest.mark.parametrize("compression", ["LZ4", "SNAPPY", "ZLIB", "ZSTD"])
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

could not include brotli, seems like pandas does not support it on the writer side.

@vuule vuule marked this pull request as ready for review May 7, 2025 22:43
@vuule vuule requested a review from a team as a code owner May 7, 2025 22:43
@vuule vuule requested review from vyasr and mroeschke May 7, 2025 22:43
@vuule vuule added this to libcudf May 8, 2025
@vuule vuule moved this to Burndown in libcudf May 8, 2025
@GPUtester GPUtester moved this to In Progress in cuDF Python May 9, 2025
@vuule vuule requested a review from mroeschke May 9, 2025 18:58
@vuule
Copy link
Contributor Author

vuule commented May 13, 2025

/merge

@rapids-bot rapids-bot bot merged commit c44b185 into rapidsai:branch-25.06 May 13, 2025
123 checks passed
@github-project-automation github-project-automation bot moved this from In Progress to Done in cuDF Python May 13, 2025
@vuule vuule deleted the test-python-comp branch May 13, 2025 16:32
@vuule vuule moved this from Burndown to Landed in libcudf May 13, 2025
@vuule
Copy link
Contributor Author

vuule commented May 13, 2025

issue #18445

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
improvement Improvement / enhancement to an existing function non-breaking Non-breaking change Python Affects Python cuDF API. tests Unit testing for project
Projects
Status: Done
Status: Landed
Development

Successfully merging this pull request may close these issues.

2 participants