Conversation
|
|
EDIT:
Would this be correct? And why do we want to test reading from a single partition, since the app never does? 👀 To document pyarrow behavior? |
|
I haven't checked the reason for the test in question. Can do that later after merging this PR since the microdata-tools is out, so we should not be runnning different version here in job-executor for too long 👍 |
DanielElisenberg
left a comment
There was a problem hiding this comment.
That makes sense 👍🏻 Reasonable conclusion. Let's look at whether reading the separate folders and containing parquet files is reasonable in a new PR 💯



One test for partitioned datasets read the .paquet file which returns also the partitioned column. Added a test for the usual case when just asking for "start_year=123" and not "start_year=123/04f4164ec1f247f2ad392fa9c03e71fe-0.parquet".