Skip to content

Invalid argument error: all columns in a record batch must have the specified row count #3167

Open
@ryzhyk

Description

@ryzhyk

Environment

Delta-rs version: 0.22.2

Binding: Rust

Environment:

  • Cloud provider: AWS
  • OS: Linux
  • Other:

Bug

We ran into this error when reading from a managed Databricks Delta Table:

Invalid argument error: all columns in a record batch must have the specified row count

This happens when running the select * from my_table query via datafusion. This occurs in the customer environment, so we don't have a reliable reproduction yet.

FWIW, I found a similar issue that someone ran into when reading from Iceberg, along with the explanation that the number of physical and logical records in a batch may not match:
apache/datafusion-comet#973

Is it possible that a similar issue exists with Delta?

Thanks!

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't workingmre-neededWhether an MRE needs to be provided

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions