Compatibility enhancements on AggregationNode

## Background

Velox's `AggregationNode` was likely designed following Presto style, where [step](https://github.com/facebookincubator/velox/blob/e45fc7383576e42d68ee624b587c52bf53686678/velox/core/PlanNode.h#L596-L605) is a property of the aggregation operator. As a comparison, Spark's [AggregateMode](https://github.com/apache/spark/blob/cbcc298970fe1b5221bb1eef69446e7b26e2b934/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregate/interfaces.scala#L101) is bound to a specific `AggreagateFunction`, making it possible that one Spark aggregate operator has aggregate functions that are with different modes (steps). There was also an old [discussion](https://github.com/facebookincubator/velox/issues/4412) around this difference.

Apache Gluten has been relying on some [tricks](https://github.com/apache/incubator-gluten/pull/4130) to align Spark with Velox on this: the planner interprets Spark's aggregation modes into different types of Velox companion functions, then assigns `single` step to Velox's `AggregationNode` constantly. By doing this we can get rid of a lot of relevant issues caused of the mismatch. For example, we can naturally support spill-able partial aggregation which is unsupported by Velox. We can also avoid unwanted flushing causing result mismatches because of some specific query plans generated by Spark.

The solution basically worked as expected until we met that sometimes the **intermediate** / **final** companion functions are not resolvable given intermediate input types for some functions. Because the functions accept different types of input but use the same type of intermediate data. For example [this one](https://github.com/facebookincubator/velox/blob/e45fc7383576e42d68ee624b587c52bf53686678/velox/functions/sparksql/aggregates/AverageAggregate.cpp#L102-L108). 

The issue is literally hard to fix completely because, the companion functions are all treated as normal aggregation functions in Velox, even we have the input types of the original aggregation function from Spark, we can't always use them to find **intermediate** / **final** companion functions accurately because they are resolved with intermediate data types in Velox. Although Velox provides suffixed version of final companion functions to distinguish between them, but theoretically they're not reliable since in SQL world overloaded functions can only be distinguished by function name and input data types.

## Proposed Changes

1. Move `AggregationNode::Step` to `AggregationNode::Aggregate::Step` and make it work.
2. Add a flag to `AggregationNode` to allow user disable flushing actively.

The changes will make Velox's aggregation API completely compatible with both Spark and Presto, and possibly with other databases because the API is made finer grained. In Presto we can just pass the same aggregation step to all aggregate functions in the operator, and in Spark we are now able to interpret Spark `AggregateMode` into Velox `AggregationNode::Aggregate::Step` for each aggregate function. Spark does't do flushing so normally in Spark we can just disable flushing. We can also stop relying on companion functions.



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Compatibility enhancements on AggregationNode #12830

Background

Proposed Changes

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Compatibility enhancements on AggregationNode #12830

Description

Background

Proposed Changes

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions