Open
Description
Description
It's impossible to use PyArrow Dataset to represent Column Mapping (apache/arrow#36593), also Deletion Vectors are nothing to be represented in a Dataset. PyArrow Tables are more flexibel, but fully loaded into RAM. I think the correct abstraction would be a to_recordbatchreader() method on the Delta Tables which takes a (partition)filter parameter
Use Case
Future support for deletion vectors and column mapping