Skip to content

Conversation

@zikangh
Copy link
Contributor

@zikangh zikangh commented Oct 17, 2025

🥞 Stacked PR

Use this link to review incremental changes.


Which Delta project/connector is this regarding?

  • Spark
  • Standalone
  • Flink
  • Kernel
  • Other (fill in here)

Description

This PR is adds rate limiting to getFileChange() which reads delta metadata to determines what data to process (offset management).

How was this patch tested?

Parameterized tests verifying parity between DSv1 (DeltaSource) and DSv2 (SparkMicroBatchStream).

Does this PR introduce any user-facing changes?

No

@zikangh zikangh changed the title minor change [WIP] getFileChangesWithRateLimit() Oct 17, 2025
@zikangh zikangh changed the title [WIP] getFileChangesWithRateLimit() [WIP] Add rate limiting to getFileChanges() for the dsv2 streaming source Oct 20, 2025
@zikangh zikangh changed the title [WIP] Add rate limiting to getFileChanges() for the dsv2 streaming source [kernel-spark] Add rate limiting to getFileChanges() for DSv2 streaming Oct 20, 2025
@zikangh
Copy link
Contributor Author

zikangh commented Oct 20, 2025

Hi @huan233usc @gengliangwang @tdas @jerrypeng could you please review this PR?


/**
* Interface for files that can be admitted by admission control in Delta streaming sources.
* This abstraction allows both Scala and Java IndexedFile implementations to be used with
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: Scala and Java -> v1 and v2

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done.


/**
* Returns the size of the file in bytes.
* For files without a file action or files with unknown size, returns 0.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we just make this api can only be called when file action exists?

V1's current impl will just throw NPE if hasFileAction() is false

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done.

@zikangh zikangh requested a review from huan233usc October 23, 2025 01:33
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants