Describe the bug
I was testing some workarounds for
And unfortunately I hit a panic
To Reproduce
- Create benchmark data
cd benchmarks
./bench.sh data clickbench_partitioned
cd ..
Run query with memory limited to 1G:
datafusion-cli -m 1G -c "SELECT \"UserID\", extract(minute FROM to_timestamp_seconds(\"EventTime\")) AS m, \"SearchPhrase\", COUNT(*) FROM 'benchmarks/data/hits_partitioned' GROUP BY \"UserID\", m, \"SearchPhrase\" ORDER BY COUNT(*) DESC LIMIT 10;"
This fails like this:
datafusion-cli -m 1G -c "SELECT \"UserID\", extract(minute FROM to_timestamp_seconds(\"EventTime\")) AS m, \"SearchPhrase\", COUNT(*) FROM 'benchmarks/data/hits_partitioned' GROUP BY \"UserID\", m, \"SearchPhrase\" ORDER BY COUNT(*) DESC LIMIT 10;"
DataFusion CLI v50.3.0
thread 'tokio-runtime-worker' panicked at datafusion/physical-plan/src/aggregates/group_values/multi_group_by/bytes_view.rs:466:53:
range end index 1943308 out of range for slice of length 193245
note: run with `RUST_BACKTRACE=1` environment variable to display a backtrace
Error: Join Error
caused by
External error: task 246 panicked with message "range end index 1943308 out of range for slice of length 193245"
Expected behavior
No response
Additional context
No response