Most people want to have tpcds data in parquet rather than the original TPC "TBL" csv-like format.
This ticket tracks adding support for creating parquet files directly from the tpcds data. Note that the way we did it for tpch (and the way I recommend for tpcds) is to generate Arrow data directly
And then write that arrow data to parquet
Most people want to have tpcds data in parquet rather than the original TPC "TBL" csv-like format.
This ticket tracks adding support for creating parquet files directly from the tpcds data. Note that the way we did it for tpch (and the way I recommend for tpcds) is to generate Arrow data directly
And then write that arrow data to parquet