Insertion order difference between 0.15.1 and 0.24.0 #3226
-
I have been using delta-rs to accumulate timeserieses. I've generated a sample table with
inserting three timeseries (with its timestamp column in ascending order) to simulate my use case, in different transaction. Below is the content of actions table and data for 0.15.1 and 0.24.0, respectively. Output is the print out of
With 0.15.1,
Data is readily usable/scannable for my use case as it's timestmap is already in asending order. But, with 0.24.0,
Timestamp across parquet files are descending order while data in same transaction (same parauet file) in asending order. It seems the output data is aggregated in the order apperaed in the add action table. This is mentioned also in #2659 (comment) |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
Delta lake doesn't guarantee insertion order |
Beta Was this translation helpful? Give feedback.
Delta lake doesn't guarantee insertion order