Update datafusion#10422
Merged
Merged
Conversation
Contributor
✅ Pull with Spice PassedPassing checks:
|
Contributor
✅ Pull with Spice Passed🏷️ Auto-applied labels:
Passing checks:
|
1 similar comment
Contributor
✅ Pull with Spice Passed🏷️ Auto-applied labels:
Passing checks:
|
Contributor
There was a problem hiding this comment.
Pull request overview
Updates the pinned spiceai/datafusion git revision to include changes from spiceai/datafusion#155.
Changes:
- Bump
datafusion(and relateddatafusion-*crates) gitrevin[patch.crates-io]to2e4b04b4e9dd4b949195529cb0d9b92cb8d75eaf. - Regenerate
Cargo.lockentries so alldatafusion*packages point at the new git revision.
Reviewed changes
Copilot reviewed 1 out of 2 changed files in this pull request and generated no comments.
| File | Description |
|---|---|
| Cargo.toml | Updates the pinned DataFusion fork revision across all patched datafusion* crates. |
| Cargo.lock | Updates lockfile sources to match the new DataFusion git revision. |
sgrebnov
previously approved these changes
Apr 20, 2026
ewgenius
previously approved these changes
Apr 20, 2026
fix: Update test snapshots
508adb6
fix: Update test snapshots
peasee
previously approved these changes
Apr 27, 2026
Co-authored-by: Spice Snapshot Update Bot <spiceaibot@spice.ai>
peasee
previously approved these changes
Apr 28, 2026
peasee
approved these changes
Apr 28, 2026
lukekim
added a commit
that referenced
this pull request
Apr 28, 2026
* update datafusion * fix: Update test snapshots * fix: Update test snapshots * fixes for _score in vector UDTF * remove bas snapshots * fix: Update Search integration test snapshots * search: stabilize vector_search score column for rrf recency plans * revert these snapshots * fix: Update Search integration test snapshots * Apply suggestions from code review Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * update snapshot * remove * fix build * fix: remove unused col import The col function from datafusion_expr is no longer used in embeddings/udtf.rs after the upstream datafusion update; the lint job runs with -D warnings, so the unused import broke the build. * snapshot * update DF * Fix DF * fixes * improved testing for RRF * snapshots * normalize remove * fix: Update Search integration test snapshots (#10567) Co-authored-by: Spice Snapshot Update Bot <spiceaibot@spice.ai> * test: update insta snapshots --------- Co-authored-by: Spice Snapshot Update Bot <spiceaibot@spice.ai> Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: Evgenii Khramkov <evgenii@spice.ai> Co-authored-by: Luke Kim <80174+lukekim@users.noreply.github.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Co-authored-by: claudespice <claude@spice.ai>
pull Bot
pushed a commit
to TheRakeshPurohit/spiceai
that referenced
this pull request
Apr 30, 2026
* Add self-hosted Spice connector support * Enhance RefreshTask to handle pre-delete rows for upserts and add tests for endpoint scheme validation * Refactor encode_data_update function to use lifetime annotations for better clarity * Refactor data exchange handling to support streaming snapshots and improve batch encoding * fix: enhance upsert handling with primary key validation and improve error reporting * fix: pass dataset name as a reference in upsert pre-delete rows function * Add DuckDB vector engine support * Address DuckDB vector engine PR feedback * Add DuckDB HNSW search integration coverage * Remove DuckDB ef_search alias * Remove DuckDB ef_construction alias * Address DuckDB vector engine PR feedback * Revert "Add self-hosted Spice connector support" This reverts commit 10af21f. * fix merge * better Error enum * Support views on DDL catalogs (spiceai#10554) * Support views on DDL catalogs * fix ref * fix variable * fix compile * fix: invert table_exists loop condition for view dependency wait The view dependency polling loop had an inverted condition: it retried while table_exists() returned true (table found) and broke when it returned false (table not found). This caused all view tests to fail — the loop would spin until the deadline with the table already present, log 'does not exist, retrying...' throughout, then exit on timeout and report the view as failed. Fix: negate the condition so the loop retries while the table is absent (!table_exists) and exits as soon as the table appears. * fix: table_exists takes &TableReference to satisfy clippy::needless_pass_by_value * bad merge * Update datafusion (spiceai#10422) * update datafusion * fix: Update test snapshots * fix: Update test snapshots * fixes for _score in vector UDTF * remove bas snapshots * fix: Update Search integration test snapshots * search: stabilize vector_search score column for rrf recency plans * revert these snapshots * fix: Update Search integration test snapshots * Apply suggestions from code review Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * update snapshot * remove * fix build * fix: remove unused col import The col function from datafusion_expr is no longer used in embeddings/udtf.rs after the upstream datafusion update; the lint job runs with -D warnings, so the unused import broke the build. * snapshot * update DF * Fix DF * fixes * improved testing for RRF * snapshots * normalize remove * fix: Update Search integration test snapshots (spiceai#10567) Co-authored-by: Spice Snapshot Update Bot <spiceaibot@spice.ai> * test: update insta snapshots --------- Co-authored-by: Spice Snapshot Update Bot <spiceaibot@spice.ai> Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: Evgenii Khramkov <evgenii@spice.ai> Co-authored-by: Luke Kim <80174+lukekim@users.noreply.github.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Co-authored-by: claudespice <claude@spice.ai> * Improve full-text search indexing performance (spiceai#10464) * Improve tantivy FTS ingest performance * Improve tests * Rollback on error path * remove index as unnecessary --------- Co-authored-by: Jack Eadie <jack@spice.ai> * add to search integration tests * fix compile * update docs * Address DuckDB vector review feedback * fix boxed * Address DuckDB vector follow-up review feedback * Bound DuckDB vector search default limit * fix: Update Search integration test snapshots * fix finding duckdb index * feat: Enhance DuckDB vector query handling for empty projections and filter pushdown * fix: Update Search integration test snapshots * fixes * fixes * better docs * chore: update datafusion-table-providers to add ignored_index_prefixes Pin to fork branch spiceai-hnsw-index-drift which adds `TableDefinition::add_ignored_index_prefix` so externally-managed HNSW indexes (named `__spice_vss_*`) are excluded from the DuckDB writer's index drift check, preventing spurious refresh failures. * chore: update datafusion-table-providers to upstream merged commit Switch from Jeadie fork back to datafusion-contrib upstream at df7dbc64, which includes the merged ignored_index_prefixes fix. * formatting * clippy * fix clippy: return closure result directly instead of let binding --------- Co-authored-by: jeadie <jack@spice.ai> Co-authored-by: Spice Snapshot Update Bot <spiceaibot@spice.ai> Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: Evgenii Khramkov <evgenii@spice.ai> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Co-authored-by: claudespice <claude@spice.ai> Co-authored-by: Sergei Grebnov <sergei.grebnov@gmail.com> Co-authored-by: Copilot <copilot@github.com> Co-authored-by: William <98815791+peasee@users.noreply.github.com>
1 task
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
📝 Summary
Update datafusion to include:
ExecutionPlan::fetchandExecutionPlan::with_fetchforProjectionExec#155
ProjectionExec::try_pushdown_sortagainst stale name/index metadata#156
For # 156 we also need to fix a planning bug where
rrf(...)queries usingvector_search(...)with recency settings (time_column, decay args) could produce an invalid physical plan (physical_plan_error: SanityCheckPlan) due to a stale_score@idxsort reference after optimizer rewrites.