Expose cuDF Parquet writer row group size configs by thirtiseven · Pull Request #14783 · NVIDIA/spark-rapids

thirtiseven · 2026-05-12T09:27:57Z

Related to #9126.

Description

This PR exposes two internal Spark RAPIDS configs for tuning cuDF Parquet writer row group limits:

spark.rapids.sql.format.parquet.writer.rowGroupSizeRows
spark.rapids.sql.format.parquet.writer.rowGroupSizeBytes

These configs are cuDF-specific pass-through knobs and are documented as best-effort limits. They are not mapped from Spark parquet.block.size, because cuDF row group sizing is based on uncompressed estimates and page-fragment boundaries rather than Spark exact parquet.block.size behavior.

The implementation wires the options into both the standard GPU Parquet writer and the Hive GPU Parquet writer. When the byte limit is set, the standard Parquet writer factory also uses it for partition flush sizing so concurrent output writer buffering is consistent with the configured row group byte target.

This PR also adds a warning when parquet.block.size is set to a non-default value for GPU Parquet writes. The warning explains that RAPIDS GPU Parquet writer does not apply Spark CPU writer row group sizing semantics for parquet.block.size, points users to spark.rapids.sql.format.parquet.write.enabled=false if they require CPU writer behavior, and lists the two internal RAPIDS-specific row group tuning configs for experimentation.

Tests were added to ParquetWriterSuite to verify that the row-based and byte-based configs affect the written Parquet row groups. The rows test uses an observable row count because cuDF does not split row groups below its page fragment granularity. The suite also covers the parquet.block.size warning helper for unset/default/non-default values.

Checklists

Documentation

Updated for new or modified user-facing features or behaviors
No user-facing change

Testing

Added or modified tests to cover new code paths
Covered by existing tests
(Please provide the names of the existing tests in the PR description.)
Not required

Performance

Tests ran and results are added in the PR description
Issue filed with a link in the PR description
Not required

Signed-off-by: Haoyang Li <haoyangl@nvidia.com>

thirtiseven · 2026-05-12T09:29:40Z

@greptile full review

greptile-apps · 2026-05-12T09:32:56Z

Greptile Summary

This PR exposes two internal cuDF pass-through knobs — spark.rapids.sql.format.parquet.writer.rowGroupSizeRows and spark.rapids.sql.format.parquet.writer.rowGroupSizeBytes — so power users can tune GPU Parquet row-group sizing without falling back to the CPU writer. Both configs are wired into GpuParquetWriter and GpuHiveParquetWriter, partitionFlushSize is updated to honour the byte limit on both write paths, and a driver-side warning is emitted when the unrelated Spark parquet.block.size setting is detected.

New RapidsConf keys (PARQUET_WRITER_ROW_GROUP_SIZE_ROWS, PARQUET_WRITER_ROW_GROUP_SIZE_BYTES) are .internal(), createOptional, with input validation, and are passed through to both the standard and Hive GPU Parquet writer paths.
partitionFlushSize is now overridden in both GpuParquetFileFormat and GpuHiveParquetFileFormat factory classes to keep concurrent-write partition buffer flushing consistent with the configured byte target.
New unit tests verify row-group splitting, flush-size propagation, and the block-size warning logic; the bytes-based splitting test uses hardcoded arithmetic that ties assertions to cuDF's internal per-row estimation, which may be fragile across cuDF versions.

Confidence Score: 5/5

Safe to merge; the new code paths add optional pass-through knobs with no effect when unset, both write paths are updated symmetrically, and the warning logic is covered by unit tests.

The functional changes are additive and gated behind optional configs that default to None, so existing Parquet write behaviour is entirely unchanged. Both the standard and Hive GPU writer paths are treated consistently. The only concern is that the bytes-splitting test ties assertions to cuDF's internal per-row estimation math, which could become a flaky failure if cuDF changes how it accounts for page overhead — but this does not affect production correctness.

tests/src/test/scala/com/nvidia/spark/rapids/ParquetWriterSuite.scala — the byte-budget test assertions are fragile; the production files are clean.

Important Files Changed

Filename	Overview
sql-plugin/src/main/scala/com/nvidia/spark/rapids/RapidsConf.scala	Adds two new internal configs PARQUET_WRITER_ROW_GROUP_SIZE_ROWS (integerConf, positive) and PARQUET_WRITER_ROW_GROUP_SIZE_BYTES (bytesConf, ≥1024), both createOptional and correctly documented.
sql-plugin/src/main/scala/com/nvidia/spark/rapids/GpuParquetFileFormat.scala	Adds parquetBlockSizeWarning utility, wires the two new row-group configs into GpuParquetWriter, and updates partitionFlushSize to honour the byte config; logic and resource handling are correct.
sql-plugin/src/main/scala/org/apache/spark/sql/hive/rapids/GpuHiveFileFormat.scala	Mirrors the standard writer: adds Logging mixin, emits the block-size warning, wires row-group configs into GpuHiveParquetWriter, and overrides partitionFlushSize; parity with GpuParquetFileFormat is correct.
tests/src/test/scala/com/nvidia/spark/rapids/ParquetWriterSuite.scala	Adds four new tests; the bytes test hard-codes cuDF per-row estimation math (16 bytes/row, 512-byte overhead constant) that may become fragile if cuDF internals change.

Flowchart

%%{init: {'theme': 'neutral'}}%%
flowchart TD
    A[prepareWrite called] --> B{non-default BLOCK_SIZE?}
    B -- yes --> C[logWarning on driver]
    B -- no --> D[continue]
    C --> D
    D --> E[Read rowGroupSizeRows and rowGroupSizeBytes from RapidsConf]
    E --> F[ColumnarOutputWriterFactory]
    F --> G[partitionFlushSize uses rowGroupSizeBytes if set]
    F --> H{writer type}
    H -- standard --> I[GpuParquetWriter]
    H -- hive --> J[GpuHiveParquetWriter]
    I --> K[builder.withRowGroupSizeRows / withRowGroupSizeBytes]
    J --> K
    K --> L[Table.writeParquetChunked]

_{Reviews (6): Last reviewed commit: "block size warning" | Re-trigger Greptile}

Signed-off-by: Haoyang Li <haoyangl@nvidia.com>

pmattione-nvidia · 2026-05-13T15:18:56Z

+        val rowGroupCounts = getSingleParquetFileRowGroupCounts(spark, writePath)
+        assert(rowGroupCounts.length > 1, s"Expected multiple row groups, got $rowGroupCounts")
+        assertResult(10000L) {
+          rowGroupCounts.sum


check that each row group is less than #rows and #bytes.

Do any of these tests actually check the byte size of the row group that is written? It doesn't look like it?

Good catch. Yesterday I avoided a direct totalByteSize <= rowGroupSizeBytes assertion because BlockMetaData.getTotalByteSize includes Parquet page/encoding overhead; with a 1024-byte limit I saw footer sizes like 1064 bytes even though cuDF was honoring the limit based on its uncompressed data-size estimate.

I updated the test to check both now: the estimated data bytes used for cuDF row-group splitting, and the actual footer totalByteSize with a small 512-byte overhead allowance.

Signed-off-by: Haoyang Li <haoyangl@nvidia.com>

thirtiseven · 2026-05-19T01:53:14Z

build

Expose Expose cuDF Parquet writer row group size configs

bf2e73c

Signed-off-by: Haoyang Li <haoyangl@nvidia.com>

thirtiseven changed the title ~~[FEA] Expose cuDF Parquet writer row group size configs~~ Expose cuDF Parquet writer row group size configs May 12, 2026

thirtiseven self-assigned this May 12, 2026

greptile-apps Bot reviewed May 12, 2026

View reviewed changes

Comment thread sql-plugin/src/main/scala/org/apache/spark/sql/hive/rapids/GpuHiveFileFormat.scala

Comment thread tests/src/test/scala/com/nvidia/spark/rapids/ParquetWriterSuite.scala Outdated

thirtiseven added 2 commits May 13, 2026 12:13

Address parquet row group review comments

2fc1f85

add new line

f405cee

Signed-off-by: Haoyang Li <haoyangl@nvidia.com>

thirtiseven marked this pull request as ready for review May 13, 2026 06:30

thirtiseven requested a review from pmattione-nvidia May 13, 2026 06:31

pmattione-nvidia reviewed May 13, 2026

View reviewed changes

thirtiseven added 4 commits May 14, 2026 09:53

Strengthen parquet row group size test

bf20260

Fix parquet writer test line length

aefefb9

Check parquet row group footer byte size

a85f9ae

block size warning

00cdff8

Signed-off-by: Haoyang Li <haoyangl@nvidia.com>

pmattione-nvidia approved these changes May 18, 2026

View reviewed changes

thirtiseven merged commit bdf9542 into NVIDIA:main May 19, 2026
49 checks passed

thirtiseven deleted the parquet-row-group-size-config-main branch May 19, 2026 05:22

This was referenced May 19, 2026

[FEA] Expose max_dictionary_size and dictionary_policy in cuDF Java/JNI Parquet writer bindings rapidsai/cudf#22569

Closed

Expose cuDF Parquet writer dictionary configs #14878

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Expose cuDF Parquet writer row group size configs#14783

Expose cuDF Parquet writer row group size configs#14783
thirtiseven merged 7 commits into
NVIDIA:mainfrom
thirtiseven:parquet-row-group-size-config-main

thirtiseven commented May 12, 2026 •

edited

Loading

Uh oh!

thirtiseven commented May 12, 2026

Uh oh!

greptile-apps Bot commented May 12, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

pmattione-nvidia May 13, 2026

Uh oh!

thirtiseven May 14, 2026

Uh oh!

pmattione-nvidia May 14, 2026

Uh oh!

thirtiseven May 15, 2026

Uh oh!

thirtiseven commented May 19, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

thirtiseven commented May 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Checklists

Uh oh!

thirtiseven commented May 12, 2026

Uh oh!

greptile-apps Bot commented May 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Greptile Summary

Confidence Score: 5/5

Important Files Changed

Flowchart

Uh oh!

Uh oh!

Uh oh!

pmattione-nvidia May 13, 2026

Choose a reason for hiding this comment

Uh oh!

thirtiseven May 14, 2026

Choose a reason for hiding this comment

Uh oh!

pmattione-nvidia May 14, 2026

Choose a reason for hiding this comment

Uh oh!

thirtiseven May 15, 2026

Choose a reason for hiding this comment

Uh oh!

thirtiseven commented May 19, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

thirtiseven commented May 12, 2026 •

edited

Loading

greptile-apps Bot commented May 12, 2026 •

edited

Loading