Add checkpoint during progress reporting. #34828

shunping · 2025-05-03T03:53:42Z

addresses #33815

I verified that the previous stuck sample code in #33815 (comment) and #33815 (comment) is working with this PR's changes.

shunping · 2025-05-03T04:35:02Z

sdks/go/pkg/beam/runners/prism/internal/stage.go

-					continue progress
-				}
+				fraction = 0.5
+			} else if checkpointReady && unsplit {


I am wondering if we should do this checkpointing for both bounded and unbounded cases.

@lostluck: WDYT?

My intuition is telling me that we should not to add it for the bounded case. But the essence of Beam is to unify batch and streaming so it's probably fine.

I guess I'm worried about the situation where we're just oversplitting and forcing a checkpoint on a perfectly fine running Batch DoFn (processing efficiently and per element fast).

As it stands, this code looks like it will checkpoint a fast moving bundle after 1 second. (10 ticks, at 100ms per tick, since a fast moving bundle won't get a slower progress request rate).

But that's sort of wasteful. A fast moving bundle shouldn't be stopped. We might just want to only do this if the bundle is moving fast, but the input index isn't moving? Perhaps this is a reason why Dataflow has explicit Batch and Streaming modes of execution.

One almost wants to do it based on the number or amount of output data instead, in order to allow the watermark to progress.... But that would be much harder, and is overthinking it for now.

Checkpointing is always a trade-off. In theory, we don't want to checkpoint too often to hurt performance, while we also want to checkpoint sufficiently enough so the hard work can be materialized and saved.

A fast moving bundle shouldn't be stopped.

I think we can consider using the checkpoint ticks AND the amount or rate of output data ("totalCount") as the criteria to identify a fast-moving bundle (thousands of events per tick) that lasts reasonably long. Instead of 1 second, we can change it to 10 (or even longer) seconds for example.

Even if it is fast moving, we may still want to checkpoint to make sure we don't need to repeat the previous 10-second work if something goes bad.

After we have bundle retrying implemented, we can adjust the threshold of checkpoint ticks to longer or shorter based on how often we see an error in the bundle, how long does it take to check point, etc.

shunping · 2025-05-03T04:37:12Z

sdks/go/pkg/beam/runners/prism/internal/stage.go

+				continue progress
+			}
+			// Save residual roots for checkpoint. After checkpointing is successful,
+			// the bundle will be marked as finished and no residual roots will be


This is kind of a surprise. When a bundle finishes due to splitting with 0.0 fraction, no residual roots in the response. Is this by design?

It is expected and by design.

Were you seeing errors due to returning the residuals early in the split response for checkpointing case?

There are a few different cases to think about, but they're aligned with the two FnAPI calls in question.

Normal ProcessBundleResponse: Returns when the primary is completed, there are no residuals to worry about.

Split Response + ProcessBundleResponse:
The Split Response contains the confirmation of the primary (what the bundle will finish processing), and the residual that needs to be processed later. ProcessBundleResponse will not contain any residuals at this time, since they were already persisted by the split response (per the above).

Self Checkpointed ProcessBundleResponse: This is when the DoFn itself returns a process continuation for a specific element (eg. Resume in 10s or similar). The Primary is by definition completed, but there may be residuals to process later. That's what's returned and scheduled.

You're seeing 2 in this case. We shouldn't need to do any additional residual handling and processing after the bundle is finished here. I'd be a bit concerned that there is a data duplication risk when doing it this way (the same residuals getting "returned" twice.)

I see. So my current code is correct then, since I am using the Residual from SplitResponse (rather than Residual from ProcessBundleResponse in the original code) to compute watermark and residual data outside of the progress for loop.

QQ: On case 2, what if we have a split at fraction 0.5? Prior to my change, I think the code is relying on Residual from ProcessBundleResponse to update the watermark. However isn't the residual empty there after we have a split response?

So residualRoots is set on line 260, but the ResigdualRoots is called again and processed and sent back to the EM on line 282.

That variable is never un-set after processing for bundles.

Then after the bundle finishes, the residual roots are only overridden If and only if the final bundle has residual roots already. Therefore the cached roots from the split response might be processed a second time, being sent to the EM as part of PersistBundle, which then also reschedules them. So it may duplicate the residual data.

IIUC the better fix is to handle the output watermark estimate in the em.ReturnResidual call. Right now it's only happening in PersistBundle.

PersistBundle call: https://github.com/apache/beam/blob/master/sdks/go/pkg/beam/runners/prism/internal/engine/elementmanager.go#L905

ReturnResiduals call:
https://github.com/apache/beam/blob/master/sdks/go/pkg/beam/runners/prism/internal/engine/elementmanager.go#L1023

Alternatively the MinOutputWatermarks for residuals is independent of the specific data, so it should also be valid to collect/update them with splits, but not persist them until the PersistBundle call. That matches closely what you have here (and what works), but without the duplicated data.

MinOutputWatermarks are collected here: https://github.com/apache/beam/pull/34828/files#diff-c799dce79559a70660d7abb42fcbff8455ba41452bd9483fc5c58dfcf156ee8cR343

Map is created just above currently: https://github.com/apache/beam/pull/34828/files#diff-c799dce79559a70660d7abb42fcbff8455ba41452bd9483fc5c58dfcf156ee8cR322

So we'd just need to ensure we don't have "stale" watermarks being persisted for this holding things back by accident.

github-actions · 2025-05-03T04:38:45Z

Assigning reviewers. If you would like to opt out of this review, comment assign to next reviewer:

R: @jrmccluskey for label go.

Available commands:

stop reviewer notifications - opt out of the automated review tooling
remind me after tests pass - tag the comment author after tests pass
waiting on author - shift the attention set back to the author (any comment or push by the author will return the attention set to the reviewers)

The PR bot will only process comments in the main thread (not review comments).

lostluck · 2025-05-05T01:47:30Z

sdks/go/pkg/beam/runners/prism/internal/stage.go

+					slog.LogAttrs(context.TODO(), slog.LevelError, "returned empty residual application", slog.Any("bundle", rb))
+					panic("sdk returned empty residual application")
 				}
+				// TODO what happens to output watermarks on splits?


Well at least i left a note about the output watermarks when I last touched this.

I think the "correct" thing to do here is to collect them and apply them accordingly with PersistBundle, instead of reprocessing the whole set of residuals later (which is how the phone is rendering it. I'll need to reread it).

Right. I didn't quite follow the part where residual from split response is handled in the original code.

The latter part outside of the progress for loop makes more sense to me.

The Estimated input elements bit is tricky since it's about how to estimate where to split for Unbounded SDFs and how big a bundle "is". I can't recall exactly why it ended up with the "filter out residuals that are before the end of data", cases... but apparently it had to do with timers?

https://github.com/apache/beam/blame/master/sdks/go/pkg/beam/runners/prism/internal/stage.go#L253

github-actions · 2025-05-13T12:15:29Z

Reminder, please take a look at this pr: @jrmccluskey

github-actions · 2025-05-16T12:15:44Z

Assigning new set of reviewers because Pr has gone too long without review. If you would like to opt out of this review, comment assign to next reviewer:

R: @lostluck for label go.

Available commands:

stop reviewer notifications - opt out of the automated review tooling
remind me after tests pass - tag the comment author after tests pass
waiting on author - shift the attention set back to the author (any comment or push by the author will return the attention set to the reviewers)

shunping · 2025-05-16T12:32:48Z

waiting on author

shunping · 2025-05-16T12:34:20Z

Assign that back to me as I need to make some more changes.

github-actions · 2025-05-24T12:14:15Z

Reminder, please take a look at this pr: @lostluck

github-actions · 2025-05-28T12:15:33Z

Assigning new set of reviewers because Pr has gone too long without review. If you would like to opt out of this review, comment assign to next reviewer:

R: @jrmccluskey for label go.

Available commands:

stop reviewer notifications - opt out of the automated review tooling
remind me after tests pass - tag the comment author after tests pass
waiting on author - shift the attention set back to the author (any comment or push by the author will return the attention set to the reviewers)

shunping · 2025-06-02T12:48:59Z

Converting this back to draft for now as it needs some more thoughts.

github-actions · 2025-06-10T12:16:42Z

Reminder, please take a look at this pr: @jrmccluskey

github-actions · 2025-06-13T12:15:38Z

Assigning new set of reviewers because Pr has gone too long without review. If you would like to opt out of this review, comment assign to next reviewer:

R: @lostluck for label go.

Available commands:

stop reviewer notifications - opt out of the automated review tooling
remind me after tests pass - tag the comment author after tests pass
waiting on author - shift the attention set back to the author (any comment or push by the author will return the attention set to the reviewers)

github-actions · 2025-06-20T12:15:52Z

Reminder, please take a look at this pr: @lostluck

lostluck · 2025-06-23T20:08:38Z

waiting on author

(since the bot doesn't know how drafts work)

derrickaw · 2025-08-06T17:58:07Z

waiting on author

shunping · 2025-08-07T01:11:43Z

It is a draft PR, and I don't have bandwidth to move forward with that recently. Feel free to close it if the bot is making too much noise.

github-actions · 2025-08-14T12:16:44Z

Reminder, please take a look at this pr: @lostluck

github-actions · 2025-08-19T12:15:24Z

Assigning new set of reviewers because Pr has gone too long without review. If you would like to opt out of this review, comment assign to next reviewer:

R: @jrmccluskey for label go.

Available commands:

stop reviewer notifications - opt out of the automated review tooling
remind me after tests pass - tag the comment author after tests pass
waiting on author - shift the attention set back to the author (any comment or push by the author will return the attention set to the reviewers)

danowar2 · 2025-08-19T12:29:06Z

The place in the center

shunping · 2025-08-19T12:35:34Z

Closing this for now.

Add periodic checkpointing during progress reporting.

c2070db

shunping self-assigned this May 3, 2025

github-actions bot added go runners prism labels May 3, 2025

shunping changed the title ~~Add periodic checkpointing during progress reporting.~~ Add periodic checkpoint during progress reporting. May 3, 2025

shunping changed the title ~~Add periodic checkpoint during progress reporting.~~ Add checkpoint during progress reporting. May 3, 2025

shunping marked this pull request as ready for review May 3, 2025 04:33

shunping requested a review from lostluck May 3, 2025 04:33

shunping commented May 3, 2025

View reviewed changes

github-actions bot added the Next Action: Reviewers label May 3, 2025

lostluck reviewed May 5, 2025

View reviewed changes

github-actions bot added the slow-review label May 13, 2025

github-actions bot added reassigned-reviewers and removed slow-review labels May 16, 2025

github-actions bot added Next Action: Author and removed Next Action: Reviewers labels May 16, 2025

github-actions bot added Next Action: Reviewers and removed Next Action: Author labels May 16, 2025

github-actions bot added the slow-review label May 24, 2025

github-actions bot removed the slow-review label May 28, 2025

shunping marked this pull request as draft June 2, 2025 12:48

github-actions bot added the slow-review label Jun 10, 2025

github-actions bot removed the slow-review label Jun 13, 2025

github-actions bot added the slow-review label Jun 20, 2025

github-actions bot added Next Action: Author and removed Next Action: Reviewers slow-review labels Jun 23, 2025

github-actions bot added Next Action: Reviewers and removed Next Action: Author labels Aug 7, 2025

github-actions bot added the slow-review label Aug 14, 2025

github-actions bot removed the slow-review label Aug 19, 2025

shunping closed this Aug 19, 2025

Add checkpoint during progress reporting. #34828

Add checkpoint during progress reporting. #34828

Uh oh!

Conversation

shunping commented May 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

shunping May 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

shunping May 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

shunping May 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented May 3, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented May 13, 2025

Uh oh!

github-actions bot commented May 16, 2025

Uh oh!

shunping commented May 16, 2025

Uh oh!

shunping commented May 16, 2025

Uh oh!

github-actions bot commented May 24, 2025

Uh oh!

github-actions bot commented May 28, 2025

Uh oh!

shunping commented Jun 2, 2025

Uh oh!

github-actions bot commented Jun 10, 2025

Uh oh!

github-actions bot commented Jun 13, 2025

Uh oh!

github-actions bot commented Jun 20, 2025

Uh oh!

lostluck commented Jun 23, 2025

Uh oh!

derrickaw commented Aug 6, 2025

Uh oh!

shunping commented Aug 7, 2025

Uh oh!

github-actions bot commented Aug 14, 2025

Uh oh!

github-actions bot commented Aug 19, 2025

Uh oh!

danowar2 commented Aug 19, 2025

Uh oh!

shunping commented Aug 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

shunping commented May 3, 2025 •

edited

Loading

shunping May 3, 2025 •

edited

Loading

shunping May 5, 2025 •

edited

Loading

shunping May 5, 2025 •

edited

Loading