Open
Description
Environment
Delta-rs version: 0.22.2
Binding: Rust
Environment:
- Cloud provider: AWS
- OS: Linux
- Other:
Bug
We ran into this error when reading from a managed Databricks Delta Table:
Invalid argument error: all columns in a record batch must have the specified row count
This happens when running the select * from my_table
query via datafusion. This occurs in the customer environment, so we don't have a reliable reproduction yet.
FWIW, I found a similar issue that someone ran into when reading from Iceberg, along with the explanation that the number of physical and logical records in a batch may not match:
apache/datafusion-comet#973
Is it possible that a similar issue exists with Delta?
Thanks!