Unnest operator honor kPreferredOutputBatchRows strictly

### Description

This PR honor kPreferredOutputBatchRows config.
https://github.com/facebookincubator/velox/pull/7051

Now there is the constraint that single row output should be into single batch.
But for this case, an input row has a very large nested array+struct, the output batch size is also large.
So we need to respect kPreferredOutputBatchRows strictly.
There is several strategies:
1. Split the row only if one row output batch size is more than maxOutputBatchSize.
2. Always split the last row to match the output batch size .

I would prefer the second way, it can lead to accurate batch size.
We could add a benchmark to test the performance if we always split the end row.

https://github.com/facebookincubator/velox/pull/7051#issuecomment-2264790839






Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Unnest operator honor kPreferredOutputBatchRows strictly #10655

Description

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Unnest operator honor kPreferredOutputBatchRows strictly #10655

Description

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions