Refactor IcebergPartitionTransform by sfc-gh-abozkurt · Pull Request #141 · Snowflake-Labs/pg_lake

sfc-gh-abozkurt · 2026-01-09T16:11:11Z

Separate ParsedIcebergPartitionTransform (parser step) and IcebergPartitionTransform (analyzer step).

sfc-gh-abozkurt · 2026-01-09T16:11:32Z

 	if (minText != NULL && maxText != NULL)
 	{
-		*names = lappend(*names, colName);
+		*names = lappend(*names, pstrdup(colName));


unrelated warning fix

colname comes from TextDatumGetCString, which does pstrdup, so seems confusing to do another pstrdup.

Can't we cast to non-const for avoiding the warning?

sfc-gh-okalaci

Thanks, I think this looks good in principle, but we should perhaps defer merging this after the tag, let's not add any optional changes during the release testing period.

sfc-gh-okalaci · 2026-01-12T06:36:16Z

 	if (minText != NULL && maxText != NULL)
 	{
-		*names = lappend(*names, colName);
+		*names = lappend(*names, pstrdup(colName));


colname comes from TextDatumGetCString, which does pstrdup, so seems confusing to do another pstrdup.

Can't we cast to non-const for avoiding the warning?

sfc-gh-okalaci · 2026-01-12T06:40:02Z

-
-	/* transform name, e.g. bucket[3] */
-	const char *transformName;
+	IcebergPartitionSpecField *specField;


hm, now we are introducing some edge cases when specField == NULL?

Can we use IcebergPartitionSpecField specField; instead?

sfc-gh-okalaci · 2026-01-20T06:13:55Z

@sfc-gh-abozkurt can you please rebase?

IcebergPartitionTransform can embed a IcebergPartitionSpecField by removing some individual fields, which makes it less verbose. Signed-off-by: Aykut Bozkurt <aykut.bozkurt@snowflake.com>

Signed-off-by: Aykut Bozkurt <aykut.bozkurt@snowflake.com>

sfc-gh-okalaci

I still think that switching from DataFileSchemaField *sourceField; to DataFileSchemaField sourceField; (same for other struct) is safer. We are passing these structs around, and it'd be hard to track the memory ownership.

So, let's make the copying safer.

Does that make sense to you as well?

But then we should do that properly.

sfc-gh-okalaci · 2026-01-22T10:46:45Z

-	transform->partitionFieldId = specField->field_id;
-	transform->partitionFieldName = pstrdup(specField->name);
-	transform->transformName = pstrdup(specField->transform);
+	transform->specField = *specField;


with the change from IcebergPartitionSpecField *specField to IcebergPartitionSpecField specField -- sorry I suggested that -- this is now a shallow copy.

IcebergPartitionSpecField has a source_ids pointer, so after this assignment both the original and the copy point to the same underlying array. Ownership becomes unclear and we could end up freeing it in multiple places.

We should perhaps add a DeepCopyIcebergPartitionSpecField() helper and use it here (and anywhere else we do this kind of assignment).

sfc-gh-okalaci · 2026-01-22T10:49:38Z

-		transform->sourceField = GetDataFileSchemaFieldById(schema, specField->source_id);
+		DataFileSchemaField *sourceField = GetDataFileSchemaFieldById(schema, specField->source_id);
+
+		transform->sourceField = *sourceField;


Same shallow copy issue here - DataFileSchemaField has several pointer members (name, type, etc.). After this assignment, both point to the same underlying strings/structs.
We could add a DeepCopyDataFileSchemaField() helper (there's already DeepCopyField() in field.h we could leverage).

Signed-off-by: Aykut Bozkurt <aykut.bozkurt@snowflake.com>

sfc-gh-dachristensen · 2026-03-31T22:56:03Z

@sfc-gh-abozkurt @sfc-gh-okalaci is this something that is worth getting rebased and into the next release?

sfc-gh-abozkurt · 2026-04-01T08:12:25Z

@sfc-gh-abozkurt @sfc-gh-okalaci is this something that is worth getting rebased and into the next release?

no but it might be good to get #173 in.

sfc-gh-abozkurt commented Jan 9, 2026

View reviewed changes

sfc-gh-abozkurt force-pushed the aykut/refactor-transform branch from 374d30a to a080db8 Compare January 9, 2026 16:14

sfc-gh-abozkurt requested a review from sfc-gh-okalaci January 9, 2026 16:14

sfc-gh-okalaci reviewed Jan 12, 2026

View reviewed changes

sfc-gh-abozkurt added 2 commits January 20, 2026 14:36

Refactor IcebergPartitionTransform

48d8555

IcebergPartitionTransform can embed a IcebergPartitionSpecField by removing some individual fields, which makes it less verbose. Signed-off-by: Aykut Bozkurt <aykut.bozkurt@snowflake.com>

separate structs for parse and analyze

a905847

Signed-off-by: Aykut Bozkurt <aykut.bozkurt@snowflake.com>

sfc-gh-abozkurt force-pushed the aykut/refactor-transform branch from 0cbf801 to a905847 Compare January 20, 2026 11:36

sfc-gh-abozkurt requested a review from sfc-gh-okalaci January 20, 2026 11:36

sfc-gh-okalaci requested changes Jan 22, 2026

View reviewed changes

sfc-gh-abozkurt force-pushed the aykut/refactor-transform branch from 937c534 to 8f41a5b Compare January 26, 2026 13:02

deep copy spec field

d7db991

Signed-off-by: Aykut Bozkurt <aykut.bozkurt@snowflake.com>

sfc-gh-abozkurt force-pushed the aykut/refactor-transform branch from 8f41a5b to d7db991 Compare January 26, 2026 13:19

sfc-gh-abozkurt requested a review from sfc-gh-okalaci February 11, 2026 21:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor IcebergPartitionTransform#141

Refactor IcebergPartitionTransform#141
sfc-gh-abozkurt wants to merge 3 commits into
mainfrom
aykut/refactor-transform

sfc-gh-abozkurt commented Jan 9, 2026 •

edited

Loading

Uh oh!

sfc-gh-abozkurt Jan 9, 2026

Uh oh!

sfc-gh-okalaci Jan 12, 2026

Uh oh!

sfc-gh-okalaci left a comment

Uh oh!

sfc-gh-okalaci Jan 12, 2026

Uh oh!

sfc-gh-okalaci Jan 12, 2026

Uh oh!

sfc-gh-okalaci commented Jan 20, 2026

Uh oh!

sfc-gh-okalaci left a comment

Uh oh!

sfc-gh-okalaci Jan 22, 2026

Uh oh!

sfc-gh-okalaci Jan 22, 2026

Uh oh!

sfc-gh-dachristensen commented Mar 31, 2026

Uh oh!

sfc-gh-abozkurt commented Apr 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

sfc-gh-abozkurt commented Jan 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sfc-gh-abozkurt Jan 9, 2026

Choose a reason for hiding this comment

Uh oh!

sfc-gh-okalaci Jan 12, 2026

Choose a reason for hiding this comment

Uh oh!

sfc-gh-okalaci left a comment

Choose a reason for hiding this comment

Uh oh!

sfc-gh-okalaci Jan 12, 2026

Choose a reason for hiding this comment

Uh oh!

sfc-gh-okalaci Jan 12, 2026

Choose a reason for hiding this comment

Uh oh!

sfc-gh-okalaci commented Jan 20, 2026

Uh oh!

sfc-gh-okalaci left a comment

Choose a reason for hiding this comment

Uh oh!

sfc-gh-okalaci Jan 22, 2026

Choose a reason for hiding this comment

Uh oh!

sfc-gh-okalaci Jan 22, 2026

Choose a reason for hiding this comment

Uh oh!

sfc-gh-dachristensen commented Mar 31, 2026

Uh oh!

sfc-gh-abozkurt commented Apr 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

sfc-gh-abozkurt commented Jan 9, 2026 •

edited

Loading