[v1.5] fix: add primary keys and indexes to metadata tables for concurrent write safety by fuziontech · Pull Request #14 · PostHog/ducklake

fuziontech · 2026-05-18T22:24:46Z

Cherry-picked from #1 to v1.5-variegata

…rite safety When using PostgreSQL as the metadata catalog with multiple concurrent writers (e.g., Kafka Connect tasks), UPDATE/DELETE statements on tables without primary keys cause postgres_scanner to use ctid (physical row ID) for row identification. This leads to serialization failures because ctid changes when rows are updated due to PostgreSQL's MVCC. Tables that previously had no primary key now have one: | Table | Primary Key | |-------|-------------| | ducklake_table_stats | table_id | | ducklake_table_column_stats | (table_id, column_id) | | ducklake_file_column_stats | (data_file_id, column_id) | | ducklake_partition_info | partition_id | | ducklake_partition_column | (partition_id, partition_key_index) | | ducklake_file_partition_value | (data_file_id, partition_key_index) | | ducklake_files_scheduled_for_deletion | data_file_id | | ducklake_inlined_data_tables | (table_id, schema_version) | | ducklake_column_mapping | mapping_id | | ducklake_name_mapping | (mapping_id, column_id) | | ducklake_macro_impl | (macro_id, impl_id) | | ducklake_macro_parameters | (macro_id, impl_id, column_id) | For frequently queried columns (especially table_id lookups): - idx_data_file_table_snapshot: (table_id, begin_snapshot, end_snapshot) - idx_delete_file_table_snapshot: (table_id, begin_snapshot, end_snapshot) - idx_column_table: (table_id, end_snapshot) - idx_file_column_stats_table: (table_id, column_id) - idx_partition_info_table: (table_id) - idx_partition_column_table: (table_id) - idx_file_partition_value_table: (table_id) This eliminates "could not serialize access due to concurrent update" errors and improves query performance for table-scoped operations. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

fuziontech and others added 2 commits May 18, 2026 15:24

fix: remove primary keys from variant stats and partition value

d2f5747

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[v1.5] fix: add primary keys and indexes to metadata tables for concurrent write safety#14

[v1.5] fix: add primary keys and indexes to metadata tables for concurrent write safety#14
fuziontech wants to merge 2 commits into
v1.5-variegatafrom
v1.5-fix-add-primary-key-to-table-stats

fuziontech commented May 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

fuziontech commented May 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant