integration: direct node connectivity test #1189

wprzytula · 2025-02-01T15:26:25Z

Problem

I noticed that we lacked a test that requires direct connectivity to all nodes in the cluster in order to pass. Other tests would often succeed even if only one node (the initial contact point) was directly reachable.

The issue has manifested in testing serverless cloud why working on rustls support: tests would pass even with address translation disabled...

Side note

An example of a test which has exercised direct connectivity to all nodes is tablets.rs, yet it could be enabled only for non-cloud case, as it uses the proxy.

Solution

I wrote a test that iterates though all targets (pairs (node, shard)) and sends a request directly to them (using a load balancing policy that produces singleton query plans).

Bonus

I extracted and refactored some utils from tablets.rs to utils.rs, so that they can be used by other tests.

Pre-review checklist

I have split my patch into logically separate commits.
All commit messages clearly explain what they change and why.
I added relevant tests for new features and bug fixes.
All commits compile, pass static checks and pass test.
PR description sums up the changes and reasons why they should be introduced.
~~[ ] I have provided docstrings for the public items that I want to introduce.~~
~~[ ] I have adjusted the documentation in ./docs/source/.~~
~~[ ] I added appropriate Fixes: annotations to PR description.~~

github-actions · 2025-02-01T15:31:57Z

cargo semver-checks found no API-breaking changes in this PR.
Checked commit: 05470af

Lorak-mmk · 2025-02-03T14:32:06Z

scylla/tests/integration/tablets.rs


            // I expect Scylla to not send feedback for unprepared queries,
            // as such queries cannot be token-aware anyway
-            send_unprepared_query_everywhere(
+            execute_unprepared_statement_everywhere(
                &session,
                session.get_cluster_data().as_ref(),
                &Query::new(format!("INSERT INTO {ks}.t (a, b, c) VALUES (1, 1, 'abc')")),


🤔 I don't really like "execute" used with unprepared statements, because EXECUTE is a CQL command to execute prepared statements. Would "send_unprepared_statement_everywhere" be an acceptable name?

I've been thinking that once we do the execution API refactor for 2.0, there will be only one method for all kinds of statements: execute. Do you agree? If so, then why not use it here?

I've been thinking about it lately, and I came to the conclusion that even in new execution API we should keep separate methods for unprepared statements, prepared statements, and batches.
Why? The choice of the query type should be more concious, a single method makes this less obvious.
Also the interface would be simpler to learn: user would still have methods of the struct that accept simple types, and not traits which then user needs to research and learn what actually implements them.

What I think request execution refactor should be mostly about is enabling configuration of the request, meaning we can do session.execute(something).with_timestamp(....).paging_iter().await instead of having an exponential amount of methods.

OK, makes sense.

OTOH, the fact that the CQL protocol has some specific names for execution of prepared and unprepared statements does not imply that those names are a good fit for the names of high-level user-facing API functions. It's not intuitive at all for anyone not well-versed in the CQL protocol that "query" is related to unprepared statements and "execute" to prepared statements.

If in the future we were to have distinct names for execution of different types of statements, then I'd go for:

execute_unprepared(),

execute_prepared(),

execute_batch().

The names then make it perfectly clear that the action is execution and the object is a particular kind of a statement.

I agree that the names do not need to be the same (but in that case the documentation should clearly state which CQL command it corresponds to).
The proposed names are a bit too long for my taste.

Can you now agree with those names? Once we established that execute() is going to be a common name to execute any kind of a request in the future.

The following helpers are all usable universally, not only for tablets: - send_statement_everywhere, - send_unprepared_query_everywhere, so it makes sense to extract them to utils.rs.

The following helpers were renamed to better convey their semantics: - `send_statement_everywhere` -> `execute_prepared_statement_everywhere`, - `send_unprepared_query_everywhere` -> `execute_unprepared_statement_everywhere`.

wprzytula · 2025-06-06T07:36:22Z

Rebased on main.

scylla/tests/integration/load_balancing/tablets.rs

scylla/tests/integration/utils.rs

`execute_unprepared_statement_everywhere` was limited to empty values; now any value list is supported.

wprzytula · 2025-06-06T10:15:31Z

Addressed the comments.

`execute_(un)prepared_statement_everywhere` both have similar logic that can be extracted to a generic function. `for_each_target_execute` extraction will be even more important in the next commit, where we make its logic a bit more complex not to `unwrap()` the Sharder.

Before, `for_each_target_execute` would panic on absence of Sharder for a Node. This could be caused by two main reasons: 1) the node was unsharded (e.g., a Cassandra node), 2) there were no open connections to the node, e.g. due to the node being down. In both cases, we prefer not to panic in `for_each_target_execute` but rather assume the node is unsharded and proceed. In the case 1), execution of the request should succeed, whereas in 2) we should get the ConnectionPoolError.

I noticed that we lacked a test that requires direct connectivity to all nodes in the cluster in order to pass. Other tests would often succeed even if only one node (the initial contact point) was directly reachable. An example of a test which has exercised direct connectivity to all nodes is tablets.rs, yet it could be enabled only for non-cloud case, as it uses the proxy.

The struct was, interestingly, missing a docstring, despite being a core part of the cluster state management and introspection.

wprzytula added the area/testing Related to tests - unit, integration, or even out of repo label Feb 1, 2025

wprzytula requested review from Lorak-mmk and muzarski February 1, 2025 15:26

wprzytula self-assigned this Feb 1, 2025

Lorak-mmk reviewed Feb 3, 2025

View reviewed changes

wprzytula mentioned this pull request Apr 14, 2025

Merge the hackathon branch and PRs #1271

Open

wprzytula added 2 commits June 6, 2025 08:35

integration: move query helpers from tablets to utils

5777e74

The following helpers are all usable universally, not only for tablets: - send_statement_everywhere, - send_unprepared_query_everywhere, so it makes sense to extract them to utils.rs.

integration: rename "everywhere" query helpers

f59d16e

The following helpers were renamed to better convey their semantics: - `send_statement_everywhere` -> `execute_prepared_statement_everywhere`, - `send_unprepared_query_everywhere` -> `execute_unprepared_statement_everywhere`.

wprzytula force-pushed the direct-connectivity-test branch from fdaa97e to 1ed71b8 Compare June 6, 2025 07:35

wprzytula requested a review from Lorak-mmk June 6, 2025 07:36

wprzytula added this to the 1.3.0 milestone Jun 6, 2025

wprzytula removed the request for review from muzarski June 6, 2025 08:05

Lorak-mmk requested changes Jun 6, 2025

View reviewed changes

scylla/tests/integration/load_balancing/tablets.rs Outdated Show resolved Hide resolved

scylla/tests/integration/utils.rs Outdated Show resolved Hide resolved

scylla/tests/integration/utils.rs Outdated Show resolved Hide resolved

integration: enhance execute_unprepared_statement_everywhere

9524ee4

`execute_unprepared_statement_everywhere` was limited to empty values; now any value list is supported.

wprzytula force-pushed the direct-connectivity-test branch from 1ed71b8 to 8504f52 Compare June 6, 2025 10:15

wprzytula requested a review from Lorak-mmk June 6, 2025 10:15

wprzytula added 4 commits June 6, 2025 12:27

cluster/state: document ClusterState

05470af

The struct was, interestingly, missing a docstring, despite being a core part of the cluster state management and introspection.

wprzytula force-pushed the direct-connectivity-test branch from 8504f52 to 05470af Compare June 6, 2025 10:27

Lorak-mmk approved these changes Jun 6, 2025

View reviewed changes

wprzytula merged commit 64d721e into scylladb:main Jun 6, 2025
12 checks passed

wprzytula deleted the direct-connectivity-test branch June 6, 2025 10:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

integration: direct node connectivity test #1189

integration: direct node connectivity test #1189

Uh oh!

wprzytula commented Feb 1, 2025

Uh oh!

github-actions bot commented Feb 1, 2025 •

edited

Loading

Uh oh!

Lorak-mmk Feb 3, 2025

Uh oh!

wprzytula Feb 3, 2025

Uh oh!

Lorak-mmk Feb 3, 2025 •

edited

Loading

Uh oh!

wprzytula Feb 3, 2025

Uh oh!

wprzytula Feb 3, 2025

Uh oh!

Lorak-mmk Feb 3, 2025

Uh oh!

wprzytula Jun 6, 2025

Uh oh!

wprzytula commented Jun 6, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wprzytula commented Jun 6, 2025

Uh oh!

Uh oh!

Uh oh!

integration: direct node connectivity test #1189

integration: direct node connectivity test #1189

Uh oh!

Conversation

wprzytula commented Feb 1, 2025

Problem

Side note

Solution

Bonus

Pre-review checklist

Uh oh!

github-actions bot commented Feb 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Lorak-mmk Feb 3, 2025

Choose a reason for hiding this comment

Uh oh!

wprzytula Feb 3, 2025

Choose a reason for hiding this comment

Uh oh!

Lorak-mmk Feb 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

wprzytula Feb 3, 2025

Choose a reason for hiding this comment

Uh oh!

wprzytula Feb 3, 2025

Choose a reason for hiding this comment

Uh oh!

Lorak-mmk Feb 3, 2025

Choose a reason for hiding this comment

Uh oh!

wprzytula Jun 6, 2025

Choose a reason for hiding this comment

Uh oh!

wprzytula commented Jun 6, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wprzytula commented Jun 6, 2025

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Feb 1, 2025 •

edited

Loading

Lorak-mmk Feb 3, 2025 •

edited

Loading