Fix Flaky Tests by jeremyprime · Pull Request #28 · valkey-io/spring-data-valkey

jeremyprime · 2025-11-25T18:15:24Z

Summary

Updates the following flaky tests that sometimes fail:

ValkeyGlideClusterConnectionCommandsIntegrationTests.testClusterGetClusterInfo:278 (expects 4, but gets 5)
ReactiveValkeyMessageListenerContainerIntegrationTests.multipleListenShouldTrackSubscriptions (expects null, but gets message)
DefaultHyperLogLogOperationsIntegrationTests.sizeShouldCountValuesCorrectly:96 (expects 3, but gets 2)
ClusterSlotHashUtilsTests.localCalculationShouldMatchServers:60 » JedisConnection Unexpected end of stream. (connection is stale/already closed)

Note there is another change in #23 to fix the Maven cache, This may help with network-related flaky tests:

Could not transfer artifact org.jetbrains.kotlin:kotlin-compiler:jar:1.9.25 from/to central
gzip: stdin: not in gzip format when getting engine binary

Closes #26.

Testing

Tests pass locally and in CI a couple times (although these tests failed intermittently, so will need to observe over time).

Signed-off-by: Jeremy Parr-Pearson <jeremy.parr-pearson@improving.com>

ikolomi · 2025-11-26T07:51:31Z

...data/valkey/connection/valkeyglide/ValkeyGlideClusterConnectionCommandsIntegrationTests.java

-
-        // Verify node count
-        assertThat(clusterInfo.getKnownNodes()).isEqualTo(EXPECTED_TOTAL_NODES);
+        // Wait for all cluster nodes to be available


i dont like retrying in the tests - we need to understand why does it happen...

I'm not sure of the underlying reason, but when checking getKnownNodes it reports a higher count than it should sometimes (5 instead of 4). It seems that the node count is temporarily inconsistent while the cluster is initializing. A small sleep or retry eventually settles on the expected number of nodes (4).

ikolomi · 2025-11-26T07:52:23Z

...ngframework/data/valkey/listener/ReactiveValkeyMessageListenerContainerIntegrationTests.java

-		assertThat(c1Collector.poll(5, TimeUnit.SECONDS)).isNotNull();
-		assertThat(c2Collector.poll(100, TimeUnit.MILLISECONDS)).isNull();
+		// Wait for active subscription to receive message
+		await().atMost(Duration.ofSeconds(5))


lets not use .untilAsserted in the test unless we understand the root cause

It takes time for the message to be propagated and for the disposed subscriber to stop listening, which is why they originally had a 0.5s sleep. But that is not enough sometimes and the disposed subscriber still gets the message when it should not. A larger sleep or retry allows the subscription cleanup to complete before the message is sent.

ikolomi · 2025-11-26T07:55:14Z

src/test/java/io/valkey/springframework/data/valkey/connection/ClusterSlotHashUtilsTests.java

+			}
 		}
-
-		jedis.close();


why is this code problematic - should not the jedis object be explicitly closed?

This issue does not happen as often, but seems to be caused by the connection pool closing the connection prematurely sometimes so it fails when jedis.close() is explicitly called on an already closed connection. The try-with-resources closes the connection without throwing an error in case it was already closed.

ikolomi · 2025-11-26T07:57:25Z

I would not touch the existing bugs in SpringDataRedis until we stabilize or at least after alpha
.untillAsserted() should be used as a last option - why do we even need to retry. Lets try to fix the root cause

Signed-off-by: Jeremy Parr-Pearson <jeremy.parr-pearson@improving.com>

jeremyprime · 2025-11-26T18:15:03Z

I would not touch the existing bugs in SpringDataRedis until we stabilize or at least after alpha

.untillAsserted() should be used as a last option - why do we even need to retry. Lets try to fix the root cause

Most failing tests seem to be timing issues that only occur in our CI. ValkeyGlideClusterConnectionCommandsIntegrationTests.testClusterGetClusterInfo fails most often, but some of the existing spring-data-redis tests fail frequently as well, especially ReactiveValkeyMessageListenerContainerIntegrationTests.multipleListenShouldTrackSubscriptions.

I replaced untilAsserted with a sleep (or just increased the existing sleep) to simplify things.

Signed-off-by: Jeremy Parr-Pearson <jeremy.parr-pearson@improving.com>

jeremyprime added 4 commits November 25, 2025 10:01

Update flaky tests

9643cab

Signed-off-by: Jeremy Parr-Pearson <jeremy.parr-pearson@improving.com>

Avoid duplicate verifiction

bc99a89

Signed-off-by: Jeremy Parr-Pearson <jeremy.parr-pearson@improving.com>

Use try-with-resources to close Jedis connection

28177ab

Signed-off-by: Jeremy Parr-Pearson <jeremy.parr-pearson@improving.com>

Use separate awaits for subscription test

e7a85ff

Signed-off-by: Jeremy Parr-Pearson <jeremy.parr-pearson@improving.com>

jeremyprime requested a review from ikolomi November 25, 2025 22:19

ikolomi reviewed Nov 26, 2025

View reviewed changes

Use sleep instead of await/untilAsserted

9733218

Signed-off-by: Jeremy Parr-Pearson <jeremy.parr-pearson@improving.com>

jeremyprime added 2 commits November 27, 2025 08:50

Merge remote-tracking branch 'upstream/main' into flaky-tests

4800753

Merge remote-tracking branch 'upstream/main' into flaky-tests

71397c8

jeremyprime force-pushed the flaky-tests branch from dfb1eab to 0354deb Compare November 27, 2025 19:20

Update message listener disposal to poll

7486d83

Signed-off-by: Jeremy Parr-Pearson <jeremy.parr-pearson@improving.com>

jeremyprime force-pushed the flaky-tests branch from 0354deb to 7486d83 Compare November 27, 2025 20:44

Check if Jedis connection is closed before using

292ab22

Signed-off-by: Jeremy Parr-Pearson <jeremy.parr-pearson@improving.com>

ikolomi approved these changes Nov 28, 2025

View reviewed changes

ikolomi merged commit 36e7276 into valkey-io:main Nov 28, 2025
106 checks passed

jeremyprime deleted the flaky-tests branch December 15, 2025 19:11

jeremyprime mentioned this pull request Jan 21, 2026

Fix Flaky Cluster Test #53

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Flaky Tests#28

Fix Flaky Tests#28
ikolomi merged 9 commits intovalkey-io:mainfrom
jeremyprime:flaky-tests

jeremyprime commented Nov 25, 2025 •

edited

Loading

Uh oh!

ikolomi Nov 26, 2025

Uh oh!

jeremyprime Nov 26, 2025

Uh oh!

ikolomi Nov 26, 2025

Uh oh!

jeremyprime Nov 26, 2025

Uh oh!

ikolomi Nov 26, 2025

Uh oh!

jeremyprime Nov 26, 2025

Uh oh!

ikolomi commented Nov 26, 2025

Uh oh!

jeremyprime commented Nov 26, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

jeremyprime commented Nov 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Testing

Uh oh!

ikolomi Nov 26, 2025

Choose a reason for hiding this comment

Uh oh!

jeremyprime Nov 26, 2025

Choose a reason for hiding this comment

Uh oh!

ikolomi Nov 26, 2025

Choose a reason for hiding this comment

Uh oh!

jeremyprime Nov 26, 2025

Choose a reason for hiding this comment

Uh oh!

ikolomi Nov 26, 2025

Choose a reason for hiding this comment

Uh oh!

jeremyprime Nov 26, 2025

Choose a reason for hiding this comment

Uh oh!

ikolomi commented Nov 26, 2025

Uh oh!

jeremyprime commented Nov 26, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jeremyprime commented Nov 25, 2025 •

edited

Loading