Skip to content

feat: Add PartitioningSerializer for Presto serialization#1800

Open
yingsu00 wants to merge 1 commit intoIBM:optimized_partitionedoutputfrom
yingsu00:PartitionedOutput3.0
Open

feat: Add PartitioningSerializer for Presto serialization#1800
yingsu00 wants to merge 1 commit intoIBM:optimized_partitionedoutputfrom
yingsu00:PartitionedOutput3.0

Conversation

@yingsu00
Copy link
Collaborator

This commit introduces PartitioningSerializer, which buffers RowVectors across
multiple append() calls, partitions rows in-place using PartitionedVector,
and on flush() serializes each non-empty partition into a Presto
wire-format IOBuf. The serializer has no dependency on velox_exec —
it returns raw folly::IOBuf objects, leaving SerializedPage creation
to the caller.

@yingsu00 yingsu00 requested a review from majetideepak as a code owner March 11, 2026 15:04
This commit introduces PartitioningSerializer, which buffers RowVectors across
multiple append() calls, partitions rows in-place using PartitionedVector,
and on flush() serializes each non-empty partition into a Presto
wire-format IOBuf. The serializer has no dependency on velox_exec —
it returns raw folly::IOBuf objects, leaving SerializedPage creation
to the caller.
@yingsu00 yingsu00 force-pushed the PartitionedOutput3.0 branch from 30b7fb6 to 38a63dd Compare March 11, 2026 15:10
@yingsu00 yingsu00 requested review from xin-zhang2 and removed request for majetideepak March 11, 2026 15:10
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant