ingest storage: Refactor `ConcurrentFetchers.run` #13809

seizethedave · 2025-12-10T23:45:04Z

What this PR does

This is a followup from a comment to #12222 from @dimitarvdimitrov:

Block-builder: adopt concurrent fetcher from ingest storage. #12222 (comment)

which also makes me think that these selects should be somehow abstracted away - there are already 4 slightly different versions of that select in this function. idk if you have time to think about this or i should. either way - probably something for a follow-up PR

So this PR makes run() somewhat less complex by reducing the tangled control flow that was previously going on, moving delicately coded resource freeing to separate methods with defer blocks, and so on.

Checklist

{n/a} Tests updated.
{n/a} Documentation added.
{n/a} CHANGELOG.md updated - the order of entries should be [CHANGE], [FEATURE], [ENHANCEMENT], [BUGFIX]. If changelog entry is not needed, please add the changelog-not-needed label to the PR.
{n/a} about-versioning.md updated with experimental features.

Note

Refactors ConcurrentFetchers.run by extracting fetch handling into handleWant, performAttempt, and isDone, simplifying control flow and channel/defers; minor casHWM param rename.

Ingest/Kafka fetcher (pkg/storage/ingest/fetcher.go):
- Refactor ConcurrentFetchers.run:
  - Delegates per-want processing to new handleWant().
  - Introduces performAttempt() to encapsulate a single fetch attempt (including HWM updates, merge, backoff reset, and result dispatch).
  - Adds isDone() helper for shutdown checks.
  - Uses defer in handleWant to close(w.result) and finish spans; consolidates select/send logic for result delivery and waiting.
- Minor:
  - Rename function parameter in casHWM to highWatermark for clarity.

^{Written by Cursor Bugbot for commit 127f700. This will update automatically on new commits. Configure here.}

This reverts commit 738bb71.

Copilot

Pull request overview

This PR refactors the ConcurrentFetchers.run() method in the Kafka fetch logic by extracting complex inline logic into separate helper methods for improved code organization and readability.

Key Changes:

Extracted fetch want handling into handleWant() method
Extracted individual fetch attempt logic into performAttempt() method
Added isDone() helper method for cleaner shutdown checking
Fixed typo in casHWM function parameter name (highWwatermark → highWatermark)

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

pkg/storage/ingest/fetcher.go

… 2 users, load averages: 3.31 3.61 3.22 USER TTY FROM LOGIN@ IDLE WHAT davidgrant console - Fri10 5days - davidgrant s000 - Fri10 - tmux when retrying.

… reattempts.

Copilot

Pull request overview

Copilot reviewed 1 out of 1 changed files in this pull request and generated 2 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2025-12-11T00:44:28Z

pkg/storage/ingest/fetcher.go

+		if !continueAttempts {
+			break
+		}
+	}


The loop may exit without sending any buffered records to the result channel. If the loop condition becomes false (e.g., errBackoff.Ongoing() returns false, isDone() returns true, or w.endOffset <= w.startOffset), any records in bufferedResult will be discarded instead of being sent to w.result. In the original code, bufferedResult was always sent to the channel before checking these conditions again. Consider sending any remaining bufferedResult before the handleWant function exits.

Suggested change

}

}

// Send any remaining bufferedResult before exiting.

if len(bufferedResult.Records) > 0 {

w.result <- bufferedResult

}

Yeah, interesting, I'll think about that.

pkg/storage/ingest/fetcher.go

Co-authored-by: Copilot <[email protected]>

Copilot

Pull request overview

Copilot reviewed 1 out of 1 changed files in this pull request and generated no new comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

seizethedave · 2025-12-11T01:17:55Z

pkg/storage/ingest/fetcher.go

 func (r *ConcurrentFetchers) run(ctx context.Context, wants chan fetchWant, logger log.Logger, highWatermark *atomic.Int64) {
 	defer r.wg.Done()

 	errBackoff := backoff.New(ctx, r.fetchBackoffConfig)


should every want get a fresh backoff?

would different wants get different behaviour from the kafka brokers? i don't think so. if they start getting errors, they slow down faster if they use a single backoff

dimitarvdimitrov

SGTM, this looks valid

seizethedave added 4 commits December 8, 2025 20:47

Typo.

f0f6d8a

initial restructure.

738bb71

Revert "initial restructure."

bf843f2

This reverts commit 738bb71.

Retool run() so it's not so fragile.

2e13d34

seizethedave requested a review from Copilot December 10, 2025 23:45

Copilot started reviewing on behalf of seizethedave December 10, 2025 23:45 View session

Copilot AI reviewed Dec 10, 2025

View reviewed changes

pkg/storage/ingest/fetcher.go Outdated Show resolved Hide resolved

pkg/storage/ingest/fetcher.go Outdated Show resolved Hide resolved

pkg/storage/ingest/fetcher.go Outdated Show resolved Hide resolved

pkg/storage/ingest/fetcher.go Outdated Show resolved Hide resolved

seizethedave added 4 commits December 10, 2025 16:22

ref semantics for fetchResult; return modified 16:22 up 5 days, 5:50,…

4e1d653

… 2 users, load averages: 3.31 3.61 3.22 USER TTY FROM LOGIN@ IDLE WHAT davidgrant console - Fri10 5days - davidgrant s000 - Fri10 - tmux when retrying.

making it to the end of performAttempt means we should re-attempt.

17904f0

Update doc comment.

10a1be5

Fix fetchResult reference problem. Replacing the value wasn't seen in…

daf2bc3

… reattempts.

seizethedave requested a review from Copilot December 11, 2025 00:34

Copilot started reviewing on behalf of seizethedave December 11, 2025 00:35 View session

seizethedave changed the title ~~Davidgrant/ingest refact~~ Refactor ConcurrentFetchers.run Dec 11, 2025

seizethedave added the changelog-not-needed PRs that don't need a CHANGELOG.md entry label Dec 11, 2025

Copilot AI reviewed Dec 11, 2025

View reviewed changes

seizethedave and others added 2 commits December 10, 2025 16:54

Tidying.

ec7e339

Update pkg/storage/ingest/fetcher.go

127f700

Co-authored-by: Copilot <[email protected]>

seizethedave requested a review from Copilot December 11, 2025 00:57

Copilot started reviewing on behalf of seizethedave December 11, 2025 00:58 View session

Copilot AI reviewed Dec 11, 2025

View reviewed changes

seizethedave marked this pull request as ready for review December 11, 2025 01:08

seizethedave requested a review from a team as a code owner December 11, 2025 01:08

seizethedave changed the title ~~Refactor ConcurrentFetchers.run~~ ingest storage: Refactor ConcurrentFetchers.run Dec 11, 2025

seizethedave commented Dec 11, 2025

View reviewed changes

dimitarvdimitrov approved these changes Dec 11, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ingest storage: Refactor `ConcurrentFetchers.run` #13809

ingest storage: Refactor `ConcurrentFetchers.run` #13809

seizethedave commented Dec 10, 2025 •

edited by cursor bot

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Dec 11, 2025

Uh oh!

seizethedave Dec 11, 2025

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

seizethedave Dec 11, 2025 •

edited

Loading

Uh oh!

dimitarvdimitrov Dec 11, 2025

Uh oh!

dimitarvdimitrov left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

-	}
+	}
+	// Send any remaining bufferedResult before exiting.
+	if len(bufferedResult.Records) > 0 {
+		w.result <- bufferedResult
+	}

ingest storage: Refactor ConcurrentFetchers.run #13809

Are you sure you want to change the base?

ingest storage: Refactor ConcurrentFetchers.run #13809

Conversation

seizethedave commented Dec 10, 2025 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does

Checklist

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

seizethedave Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

seizethedave Dec 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dimitarvdimitrov Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

dimitarvdimitrov left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ingest storage: Refactor `ConcurrentFetchers.run` #13809

ingest storage: Refactor `ConcurrentFetchers.run` #13809

seizethedave commented Dec 10, 2025 •

edited by cursor bot

Loading

seizethedave Dec 11, 2025 •

edited

Loading