Spec: Allow the use of `source-id` in V3 #12644

Fokko · 2025-03-25T18:58:24Z

Changes around the multi-argument transforms, mainly two things:

Up for debate. The spec does not point out an actual implementations of transforms that accept multiple arguments. From the existing transforms, the only contender is the bucket transform. Should we include this in the V3 spec? It will only allow to prune metadata if you do an equality expression on all the fields that are part of the transform.
Along the way, we've removed something that we did not intend. First we allowed to write source-id and source-ids based on the number of arguments. This has been changed to only allow source-ids for V3 in a PR that introduces backward compatibility. I think this makes the JSON parsers/producers more complex than needed (specifically PyIceberg). Also, in Java we would need to plumb down the table version to the PartitionSpecParser.java. I think it would be great to simplify this.

format/spec.md

szehon-ho · 2025-03-27T00:31:25Z

Hi @Fokko i havent take a look yet at spec change, but for multi bucket we had some discussions last year. For reference the pr is here : #8259 with more discussion from: #8579

format/spec.md

Co-authored-by: Gang Wu <[email protected]>

szehon-ho · 2025-04-02T16:12:29Z

I wonder should we remove multi-bucket into separate pr, to allow the source-id part to get in?

Fokko · 2025-04-03T11:19:26Z

@szehon-ho I think that's a good idea. Let me rework the PR 👍

…fd-moar-spec-changes

format/spec.md

rdblue

I think there's a typo, but overall I think this is the right direction. Thanks, @Fokko!

format/spec.md

RussellSpitzer · 2025-04-17T21:38:18Z

Looks good to me, thanks @Fokko !

I merged a [spec-change earlier today](#12644), but noticed that it was not live on the website. I think it would be good to get these changes out right away.

Spec: Allow the use of source-id in V3

f17f457

github-actions bot added the Specification Issues that may introduce spec changes. label Mar 25, 2025

RussellSpitzer reviewed Mar 25, 2025

View reviewed changes

format/spec.md Show resolved Hide resolved

sfc-gh-bhannel reviewed Mar 26, 2025

View reviewed changes

format/spec.md Outdated Show resolved Hide resolved

format/spec.md Outdated Show resolved Hide resolved

format/spec.md Outdated Show resolved Hide resolved

wgtmac reviewed Mar 27, 2025

View reviewed changes

format/spec.md Outdated Show resolved Hide resolved

format/spec.md Outdated Show resolved Hide resolved

wgtmac reviewed Mar 27, 2025

View reviewed changes

format/spec.md Outdated Show resolved Hide resolved

format/spec.md Outdated Show resolved Hide resolved

format/spec.md Show resolved Hide resolved

Fokko and others added 2 commits March 27, 2025 11:25

Add the

d60f5e9

Co-authored-by: Gang Wu <[email protected]>

How to handle nulls

1b25e4a

Fokko added 3 commits April 3, 2025 13:20

Remove the implementation for now

1de8b8c

Merge branch 'fd-moar-spec-changes' of github.com:Fokko/iceberg into …

4fec2f4

…fd-moar-spec-changes

Remove conflcit

c103bab

Fokko force-pushed the fd-moar-spec-changes branch from 81711a0 to c103bab Compare April 3, 2025 11:22

rdblue reviewed Apr 3, 2025

View reviewed changes

format/spec.md Outdated Show resolved Hide resolved

Cleanup

4deead2

jbonofre reviewed Apr 10, 2025

View reviewed changes

format/spec.md Outdated Show resolved Hide resolved

jbonofre approved these changes Apr 14, 2025

View reviewed changes

Fokko added 2 commits April 14, 2025 18:26

Only write the one or the other

f1fac5b

A few more changes

9cd6556

jbonofre approved these changes Apr 15, 2025

View reviewed changes

rdblue reviewed Apr 16, 2025

View reviewed changes

format/spec.md Outdated Show resolved Hide resolved

rdblue approved these changes Apr 16, 2025

View reviewed changes

Fokko commented Apr 16, 2025

View reviewed changes

format/spec.md Outdated Show resolved Hide resolved

Fix typo

0069ece

Fokko commented Apr 16, 2025

View reviewed changes

format/spec.md Outdated Show resolved Hide resolved

Replace should with must

c84e162

rdblue approved these changes Apr 16, 2025

View reviewed changes

szehon-ho approved these changes Apr 17, 2025

View reviewed changes

amogh-jahagirdar approved these changes Apr 17, 2025

View reviewed changes

RussellSpitzer approved these changes Apr 17, 2025

View reviewed changes

flyrain approved these changes Apr 18, 2025

View reviewed changes

Fokko merged commit 12b1f52 into apache:main Apr 22, 2025
2 checks passed

Fokko deleted the fd-moar-spec-changes branch April 22, 2025 08:29

Fokko added a commit that referenced this pull request Apr 22, 2025

Add format/ to site-ci

96607f5

I merged a [spec-change earlier today](#12644), but noticed that it was not live on the website. I think it would be good to get these changes out right away.

Fokko mentioned this pull request Apr 22, 2025

Add format/ to site-ci #12869

Merged

wypoon mentioned this pull request May 1, 2025

Spec: Remove misleading statement about source-ids #12948

Merged

This was referenced Jun 18, 2025

source-ids is not supported on Iceberg v1,2 apache/iceberg-python#2114

Closed

Deprecate source-id in favor of source-ids apache/iceberg-python#1547

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Spec: Allow the use of `source-id` in V3 #12644

Spec: Allow the use of `source-id` in V3 #12644

Uh oh!

Fokko commented Mar 25, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

szehon-ho commented Mar 27, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

szehon-ho commented Apr 2, 2025

Uh oh!

Fokko commented Apr 3, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

rdblue left a comment

Uh oh!

Uh oh!

Uh oh!

RussellSpitzer commented Apr 17, 2025

Uh oh!

Uh oh!

Uh oh!

Spec: Allow the use of source-id in V3 #12644

Spec: Allow the use of source-id in V3 #12644

Uh oh!

Conversation

Fokko commented Mar 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

szehon-ho commented Mar 27, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

szehon-ho commented Apr 2, 2025

Uh oh!

Fokko commented Apr 3, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

rdblue left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

RussellSpitzer commented Apr 17, 2025

Uh oh!

Uh oh!

Uh oh!

Spec: Allow the use of `source-id` in V3 #12644

Spec: Allow the use of `source-id` in V3 #12644

Fokko commented Mar 25, 2025 •

edited

Loading