Skip to content

to_pyarrow_table does not deduce partition filters from filter list #1997

@zalmane

Description

@zalmane

Description

In the documentation for to_pyarrow_table it is stated that if filters are used, partitions do not need to be specified.
However, looking at the code here:

def to_pyarrow_table(

it seems that only partitions are used when creating the dataset. Then, filters are used when converting to a table.

Is this a bug or expected behavior? Seems that in a large Delta table, we will be scanning a lot of irrelevant files in the first stage.

Use Case
Specify filters list and expect Delta-rs to find the relevant partition filters and apply those to the Dataset.

Related Issue(s)

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions