continuous-test: Add a handful of verification queries for OOO data by alexweav · Pull Request #14394 · grafana/mimir

alexweav · 2026-02-17T22:55:31Z

What this PR does

This PR adds several queries that assert on the new OOO data that the continuous tester exercises.
We assert:

A range and mixture of instants on in-order written points, spanning our OOO window.
A range with a finer step asserting on the dense, partially OOO written region
A few instant queries on the "border" between inorder and out-of-order data

We never enable results cache, and we minimize assertions on data that the regular continuous test might catch.

These queries have been tested on and off in a dev cell since into last week. This seemed to be a good balance of a small set of queries, that asserts the mixture of inorder and out-of-order samples in the same series.

Remember that this test is still disabled by default, as we harden it further.

Which issue(s) this PR fixes or relates to

Contrib https://github.com/grafana/mimir-squad/issues/3373

Checklist

Tests updated.
Documentation added.
CHANGELOG.md updated - the order of entries should be [CHANGE], [FEATURE], [ENHANCEMENT], [BUGFIX]. If changelog entry is not needed, please add the changelog-not-needed label to the PR.
about-versioning.md updated with experimental features.

Note

Low Risk
Changes are isolated to the optional continuous test and mostly add extra query/verification logic and unit tests; main risk is increased query load when the test is enabled.

Overview
Adds post-write verification to the WriteReadOOOTest by issuing additional instant and range queries against the OOO sine-wave metric and validating results via verifySamplesSum (always with results cache disabled).

Introduces helpers to compute query windows for both in-order (last 24h + instants) and out-of-order dense regions (24h window ending at the OOO lag border + instants near/before the border), including clamping to MaxQueryAge and step-alignment to avoid false positives; adds span-based logging for query executions.

Expands tests to cover the new time-range selection behavior and to assert that Run performs the expected write(s) and query calls under empty and partial history scenarios.

^{Written by Cursor Bugbot for commit 6d929f4. This will update automatically on new commits. Configure here.}

…norder samples

…uery generation

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{Bugbot Autofix is OFF. To automatically fix reported issues with Cloud Agents, enable Autofix in the Cursor dashboard.}

cursor · 2026-02-17T23:07:40Z

pkg/continuoustest/write_read_ooo_test.go

+	flagext.DefaultValues(&cfg)
+	cfg.MaxQueryAge = 2 * 24 * time.Hour
+
+	now := time.Unix(int64((10*24*time.Hour)+(2*time.Second)), 0)


Test timestamp computed from nanoseconds instead of seconds

Low Severity

time.Unix(int64((10*24*time.Hour)+(2*time.Second)), 0) converts a time.Duration (which is nanoseconds) to int64 and passes it to time.Unix which expects seconds. This creates a timestamp ~27 million years in the future instead of the intended "10 days + 2 seconds." The pre-existing test on line 35 uses the correct pattern: time.Unix(10*86400, 0). The tests still pass because all assertions use relative time comparisons, but the now value is wildly different from what was intended.

Additional Locations (1)

pkg/continuoustest/write_read_ooo_test.go#L342-L343

Haha, good catch, but all these tests are very careful to not depend on an overly specific definition of now. So, it's of little consequence other than aesthetic.

I think, this suggestion was quite good, actually. Without a code comment, that mentions that this now value is meaningless, it could be confusing for a future code reader, who may think it's an honest bug.

Could we move the duration to nsec argument, to keep it both readable and sound:

// 10d and 10s (864002e9 nanos) after epoch now := time.Unix(0, int64(10*24*time.Hour + 2*time.Second))

narqo

I've left one nit, but the changes work for me, overall 🔥

narqo · 2026-02-19T10:09:48Z

pkg/continuoustest/write_read_ooo_test.go

+	flagext.DefaultValues(&cfg)
+	cfg.MaxQueryAge = 2 * 24 * time.Hour
+
+	now := time.Unix(int64((10*24*time.Hour)+(2*time.Second)), 0)


I think, this suggestion was quite good, actually. Without a code comment, that mentions that this now value is meaningless, it could be confusing for a future code reader, who may think it's an honest bug.

Could we move the duration to nsec argument, to keep it both readable and sound:

// 10d and 10s (864002e9 nanos) after epoch now := time.Unix(0, int64(10*24*time.Hour + 2*time.Second))

alexweav added 13 commits February 13, 2026 14:38

dry-run query inorder samples

397cb4f

force the random instant query to be minute-aligned so it only hits i…

d30872a

…norder samples

dry-run out of order queries

17530a3

add write summary showing border for debugging

6d65d92

minor refactor

966eb40

execute inorder instant queries

a08ba2b

validate instant query results

55334b2

start validating ooo samples

2f6c92f

issue range queries

82ab2ea

add a limiter to a query that grows to too long; add tests covering q…

0391140

…uery generation

tiny comment nudge

98b721b

write a couple more e2e-adjacent tests that exercise the full Run flow

da2eceb

cleanup

6d929f4

alexweav added the changelog-not-needed PRs that don't need a CHANGELOG.md entry label Feb 17, 2026

alexweav requested a review from a team as a code owner February 17, 2026 22:55

cursor bot reviewed Feb 17, 2026

View reviewed changes

narqo approved these changes Feb 19, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

continuous-test: Add a handful of verification queries for OOO data#14394

continuous-test: Add a handful of verification queries for OOO data#14394
alexweav wants to merge 13 commits intomainfrom
alexweav/ooo-cont-test-query

alexweav commented Feb 17, 2026 •

edited by cursor bot

Loading

Uh oh!

cursor bot left a comment

Uh oh!

cursor bot Feb 17, 2026

Uh oh!

alexweav Feb 17, 2026 •

edited

Loading

Uh oh!

narqo Feb 19, 2026

Uh oh!

narqo left a comment

Uh oh!

narqo Feb 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

Conversation

alexweav commented Feb 17, 2026 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does

Which issue(s) this PR fixes or relates to

Checklist

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

cursor bot Feb 17, 2026

Choose a reason for hiding this comment

Test timestamp computed from nanoseconds instead of seconds

Uh oh!

alexweav Feb 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

narqo Feb 19, 2026

Choose a reason for hiding this comment

Uh oh!

narqo left a comment

Choose a reason for hiding this comment

Uh oh!

narqo Feb 19, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

alexweav commented Feb 17, 2026 •

edited by cursor bot

Loading

alexweav Feb 17, 2026 •

edited

Loading