Does converting a DeltaTable into a Pandas DataFrame downloads the whole table?
#3169
-
|
I have the following Python code to connect to a Delta table and convert that into a Pandas from deltalake import DeltaTable
dt = DeltaTable(path, storage_options=storage_options)
df = dt.to_pandas()As I'm executing the above code remotely, will it download the whole table to create the |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments
-
|
I thought pandas was in-memory only with no concept of lazy data? Doesn't that imply that it would have to? |
Beta Was this translation helpful? Give feedback.
-
|
Yes, it will. It uses |
Beta Was this translation helpful? Give feedback.
Yes, it will. It uses
pyarrowunderneath to read everything if you do not pass any filter. You can passfiltersorpartitionsfor even better efficiency.