fix: preallocate compaction task id range per loop and assert bounds #24654

Li0k · 2026-01-30T07:58:23Z

I hereby agree to the terms of the RisingWave Labs, Inc. Contributor License Agreement.

What's changed and what's your intention?

As title

Checklist

I have written necessary rustdoc comments.
I have added necessary unit tests and integration tests.
I have added test labels as necessary.
I have added fuzzing tests or opened an issue to track them.
My PR contains breaking changes.
My PR changes performance-critical code, so I will run (micro) benchmarks and present the results.
I have checked the Release Timeline and Currently Supported Versions to determine which release branches I need to cherry-pick this PR into.

Documentation

My PR needs documentation updates.

Release note

Copilot

Pull request overview

This PR optimizes compaction task ID allocation by pre-allocating a range of IDs per loop iteration instead of fetching them individually. This reduces the number of SQL transactions needed when generating multiple compaction tasks.

Changes:

Added next_compaction_task_id_interval function to allocate multiple task IDs at once
Modified compaction task selection loop to pre-allocate IDs and assert bounds
Updated iceberg compaction manager to use the new helper function

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 2 comments.

File	Description
src/meta/src/hummock/manager/sequence.rs	Added new function to allocate multiple compaction task IDs in a single call
src/meta/src/hummock/manager/compaction/mod.rs	Pre-allocates task ID range before loop and replaces per-iteration allocation with local increment
src/meta/src/manager/iceberg_compaction.rs	Refactored to use the existing helper function instead of direct sequence call

Copilot · 2026-01-30T07:58:56Z

src/meta/src/hummock/manager/compaction/mod.rs


-            // StoredIdGenerator already implements ids pre-allocation by ID_PREALLOCATE_INTERVAL.
-            let task_id = next_compaction_task_id(&self.env).await?;
+            assert!(next_task_id < task_id_end);


The assertion should check next_task_id < task_id_end BEFORE using task_id, but the assignment happens after the assert. If the assert fails, we'd still have incremented past the end. Consider moving the assignment of task_id after the assertion, or restructure to check the condition before both the assignment and increment.

Copilot · 2026-01-30T07:58:57Z

src/meta/src/hummock/manager/compaction/mod.rs

+        // Pre-allocate task ids for this loop to avoid per-group SQL transactions.
+        let task_id_capacity = compaction_groups.len() as u32;
+        let mut next_task_id =
+            next_compaction_task_id_interval(&self.env, task_id_capacity).await?;
+        let task_id_end = next_task_id + u64::from(task_id_capacity);


Pre-allocating exactly compaction_groups.len() task IDs may be insufficient. The loop can exit early if pick_tasks.len() >= max_select_count or if a group selector returns None, meaning not all allocated IDs will be used. However, if the loop continues past allocated capacity (e.g., if groups are processed multiple times or the logic changes), the assertion at line 410 will panic. Consider either: (1) allocating min(compaction_groups.len(), max_select_count) IDs, or (2) adding logic to handle exhaustion of pre-allocated IDs gracefully rather than panicking.

Li0k · 2026-01-30T07:59:12Z

@zwang28 @hzxa21 Do you remember why each group fetches only one normal-task per pull?

Preallocate compaction task ID range per loop to avoid per-group SQL transactions. Capacity is min(group_count, max_select_count) since each group consumes at most one task ID (trivial tasks share the same ID with normal task). Fallback to single ID allocation if pre-allocated range is exhausted.

Li0k added 2 commits January 30, 2026 15:53

fix: preallocate compaction task id range per loop and assert bounds

fe81d45

typo

c394583

Li0k requested review from Copilot, hzxa21 and zwang28 January 30, 2026 07:58

github-actions bot added type/fix Type: Bug fix. Only for pull requests. ci/run-e2e-single-node-tests ci/run-e2e-test-other-backends ci/run-e2e-iceberg-tests labels Jan 30, 2026

Copilot AI reviewed Jan 30, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: preallocate compaction task id range per loop and assert bounds #24654

fix: preallocate compaction task id range per loop and assert bounds #24654

Uh oh!

Li0k commented Jan 30, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Jan 30, 2026

Uh oh!

Copilot AI Jan 30, 2026

Uh oh!

Li0k commented Jan 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fix: preallocate compaction task id range per loop and assert bounds #24654

Are you sure you want to change the base?

fix: preallocate compaction task id range per loop and assert bounds #24654

Uh oh!

Conversation

Li0k commented Jan 30, 2026

What's changed and what's your intention?

Checklist

Documentation

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Jan 30, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jan 30, 2026

Choose a reason for hiding this comment

Uh oh!

Li0k commented Jan 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants