Skip to content

Test out early stop sort merge join to handle AS OF join? #360

Open
@CTCC1

Description

@CTCC1

I ran into some online benchmarks about AS OF join where in certain cases, "early stop sort merge join" can outperform UNION based AS OF join.

https://www.hopsworks.ai/post/a-spark-join-operator-for-point-in-time-correct-joins (fwiw, it mentioned tempo as the inspiration for the UNION based AS OF join)

open sourced implementations
https://github.com/Ackuq/spark-pit/blob/main/scala/src/main/scala/execution/Patterns.scala

Would be interested to see what the community / maintainers think.

Metadata

Metadata

Assignees

No one assigned

    Labels

    futureThis issue is in a backlog of ideas to possibly be done in the futureperformanceIssues with the time to execute operations or other performance issues

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions