feat: flownode inc query checkpoint by discord9 · Pull Request #8132 · GreptimeTeam/greptimedb

discord9 · 2026-05-19T07:26:22Z

I hereby agree to the terms of the GreptimeDB CLA.

Refer to a related PR or issue link (optional)

What's changed and what's your intention?

PR Checklist

Please convert it to a draft if some of the following conditions are not met.

I have written the necessary rustdoc comments.
I have added the necessary unit tests and integration tests.
This PR requires documentation updates.
API changes are backward compatible.
Schema or data changes are backward compatible.

Signed-off-by: discord9 <discord9@163.com>

gemini-code-assist

Code Review

This pull request implements incremental read support for batching flows by introducing a state machine that manages transitions between full snapshot and incremental modes based on region watermarks. Key changes include logic to analyze and rewrite aggregate plans into delta-left-join-sink merges, extensions to the FrontendClient for handling terminal metrics, and the addition of comprehensive integration tests. Feedback from the review focuses on improving the robustness of error handling by replacing brittle string parsing with structured error variants and simplifying the logic for validating incremental checkpoint advancement to remove redundant checks.

gemini-code-assist · 2026-05-19T07:35:40Z

+    pub fn inspect_query_error(err: &Error) -> FlowQueryFailure {
+        let debug = format!("{err:?}");
+        let stale_cursor = parse_stale_cursor_detail(&debug);
+        FlowQueryFailure { stale_cursor }
+    }


Parsing the debug representation of an error (format!("{err:?}")) is brittle and can easily break if the error formatting changes in the future. It would be more robust to introduce a specific error variant for "stale cursor" and propagate it as a structured error. This is especially important for correctly identifying connection-related errors, as only those should trigger a connection reset. While the TODO comment acknowledges this, this should be prioritized to improve the robustness of error handling.

References

Only reset a client connection for connection-related errors. Robust error identification is necessary to follow this rule.

gemini-code-assist · 2026-05-19T07:35:40Z

+    pub fn can_advance_incremental_checkpoints_with_participation(
+        &self,
+        participating_regions: &BTreeSet<u64>,
+        watermark_map: &HashMap<u64, u64>,
+    ) -> bool {
+        !self.checkpoints.is_empty()
+            && !participating_regions.is_empty()
+            && participating_regions.len() == watermark_map.len()
+            && participating_regions
+                .iter()
+                .all(|region_id| self.checkpoints.contains_key(region_id))
+            && participating_regions.iter().all(|region_id| {
+                let checkpoint = self.checkpoints.get(region_id);
+                watermark_map
+                    .get(region_id)
+                    .zip(checkpoint)
+                    .is_some_and(|(seq, checkpoint)| seq >= checkpoint)
+            })
+    }


This function's logic is a bit complex and can be simplified for better readability and maintainability. The check participating_regions.iter().all(|region_id| self.checkpoints.contains_key(region_id)) is redundant, as the subsequent check already covers this. Using a match statement can also make the logic more explicit and easier to understand than zip().is_some_and().

pub fn can_advance_incremental_checkpoints_with_participation( &self, participating_regions: &BTreeSet<u64>, watermark_map: &HashMap<u64, u64>, ) -> bool { !self.checkpoints.is_empty() && !participating_regions.is_empty() && participating_regions.len() == watermark_map.len() && participating_regions.iter().all(|region_id| { match (self.checkpoints.get(region_id), watermark_map.get(region_id)) { (Some(checkpoint), Some(seq)) => seq >= checkpoint, _ => false, } }) }

Signed-off-by: discord9 <discord9@163.com>

discord9 added 6 commits May 13, 2026 15:46

feat: inc query join rewrite helper

efd98df

Signed-off-by: discord9 <discord9@163.com>

chore: rm unwrap

3094a78

Signed-off-by: discord9 <discord9@163.com>

refactor: per review

c3f378d

Signed-off-by: discord9 <discord9@163.com>

chore

4b3efba

Signed-off-by: discord9 <discord9@163.com>

per review

0251113

Signed-off-by: discord9 <discord9@163.com>

refactor: per review

84c5bb9

Signed-off-by: discord9 <discord9@163.com>

github-actions Bot added size/XXL docs-not-required This change does not impact docs. labels May 19, 2026

gemini-code-assist Bot reviewed May 19, 2026

View reviewed changes

discord9 added 5 commits May 19, 2026 17:52

refactor: pre review

6d1a3b0

Signed-off-by: discord9 <discord9@163.com>

more per review

d4e9ec2

Signed-off-by: discord9 <discord9@163.com>

test: add expected plan test

90a119c

Signed-off-by: discord9 <discord9@163.com>

fix: disallow having

5ccfca5

Signed-off-by: discord9 <discord9@163.com>

feat: flownode inc query handle

4824025

Signed-off-by: discord9 <discord9@163.com>

discord9 force-pushed the flow-inc-pr3-flow-checkpoint branch from 02071a4 to 4824025 Compare May 20, 2026 03:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: flownode inc query checkpoint#8132

feat: flownode inc query checkpoint#8132
discord9 wants to merge 11 commits into
GreptimeTeam:mainfrom
discord9:flow-inc-pr3-flow-checkpoint

discord9 commented May 19, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot May 19, 2026

Uh oh!

gemini-code-assist Bot May 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

discord9 commented May 19, 2026

Refer to a related PR or issue link (optional)

What's changed and what's your intention?

PR Checklist

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot May 19, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 19, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant