You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
fix: lazily initialize BatchCoalescer in CoalescedShuffleReaderStream to avoid schema type mismatch
The BatchCoalescer inside CoalescedShuffleReaderStream was eagerly initialized
with the declared schema from the execution plan. However, the actual IPC shuffle
data may have different Arrow types (e.g., string columns declared as LargeUtf8
in the plan but written as Utf8 by the CSV reader). When
InProgressPrimitiveArray<T>::copy_rows() tries to downcast the source array, the
type mismatch causes a panic: Internal("primitive array").
This applies the same lazy initialization pattern used for RepartitionExec
(spiceai/datafusion#135): defer BatchCoalescer creation until the first batch
arrives and use the batch's actual schema.
0 commit comments