Description
Environment
Delta-rs version: 0.18.1
Binding: rust
Bug
What happened:
While running kafka-delta-ingest the service crashes with an error when trying to create a checkpoint:
DeltaTable { source: Arrow { source: JsonError("whilst decoding field 'add': while decoding field 'stats': offset overflow decoding Utf8") } }
The error pops up here: https://github.com/apache/arrow-rs/blob/49840ec0f110da5e9a21ce97affd32313d0b720f/arrow-json/src/reader/string_array.rs#L81
What you expected to happen:
The checkpoint will be created successfully
How to reproduce it:
Not sure how to reproduce this, presumably it would require a table with a large delta log and data volume
More details:
Not sure what additional information to provide:
Partitioning by year, month, day, and hour;
Current transaction number: 00000000000002020757;
Approximately 4TB are written to the table per day;
Each parquet file is on average 800MB