-
Notifications
You must be signed in to change notification settings - Fork 100
Manager 3.5.0 release updates #10586
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Since the Manager 3.5 is out, we need to cover an upgrade from 3.4.* versions now. Manager versions 3.4.1 and 3.4.2 are two versions used in Production in Cloud currently.
Scylla 2023.1 is not officially supported by the latest Manager release. Because of that, older enterprise jobs switched to run with 2024.1. All Debian jobs is set to run with 2024.2 to keep the coverage for this release after switching the majority of jobs to 2025.1.
5d86e88
to
183c4c3
Compare
I would add 2025.1 backport, due to the repair issue we encountered |
8dbc99a
to
8a13330
Compare
As the new minor release of Manager - 3.5.0 - is here, it is set to be a default one used in tests. Together with that, Scylla 2025.1 is added to manager_versions.yaml config and will be the default one to test with Manager 3.5.0. test_sdcm_mgmt_common.py tests have been updated accordingly.
In previous implementation the test was trying to set rf=2 per each DC while one of DCs had only one node. As a result, the test failed with error (1) like "Datacenter us-west-2scylla_node_west doesn't have enough token-owning nodes for replication_factor=2" The new approach will dynamically define the number of nodes per DC and set this value to DC's replication factor. refs: 1. https://jenkins.scylladb.com/job/manager-3.5/job/ubuntu22-sanity-test/1/
8a13330
to
1abac5d
Compare
also can you run longevity 4h with the manager nemesis, i.e. repair backup and restore ? and if backporting it should be tested at least in this PR, and maybe even again on the backport PR. |
In addition to the version, need to update some error messages that changed in 3.5.0. |
Triggered Manager ops specific Nemesis: |
SCT redefines SnitchConfiguration here if Rerunning with |
Why is this an issue? Is backup bound to a rack as well the dc?
So the manager 3.5 wont work with simulated racks? |
When Manager does schema restore, it applies the schema from a backup snapshot (
where replication is defined as At the same time, the cluster under test has dc/racks configuration defined in
where dc name is eu-west-1. As a result, we have dc names mismatch and the Manager fails to restore the schema reporting
It's not related to this particular Manager 3.5 release but to the fact whether we use To confirm it, rerunning the previous test with |
@pehala Could you please send me some links to Nemesis jobs used in release testing where we run |
This is a problem regardless, we will switch to using simulated_racks as default very soon, so we need to fix this incompatiblity with a high degree of importance. Please open issue for it. But given it is not tied to manager 3.5, I think we can continue with merging this and resolve simulated_racks issue separately |
Just to prioritize it properly, when are you going to switch - in one week, one month, couple of months period of time? |
Was aiming for this week. We can discuss how to proceed once you create the issue and we know what is actually the problem |
The issue is here (scylladb/scylla-manager#4346). |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
@vponomaryov Could you please merge given it is approved and passing all the checks? |
Closes scylladb/scylla-manager#4343
Manager tests, configuration files, CI updates to introduce new Manager version 3.5.0 into SCT.
Manager 3.5.0 is a default version used in tests
As the new minor release of Manager - 3.5.0 - is here, it is set to be a default one used in tests.
Together with that, Scylla 2025.1 is added to manager_versions.yaml config and will be the default one to test with Manager 3.5.0.
Changed Manager version to upgrade from to 3.4.*
Since the Manager 3.5 is out, we need to cover an upgrade from 3.4.* versions now.
Manager versions 3.4.1 and 3.4.2 are two versions used in Production in Cloud currently.
Covered Scylla 2024.1/2024.2 in some of the Manager jobs
Scylla 2023.1 is not officially supported by the latest Manager release. Because of that, older enterprise jobs switched to run with 2024.1. All Debian jobs is set to run with 2024.2 to keep the coverage for this release after switching the majority of jobs to 2025.1.
Get rf dynamically for repair test with multiDC cluster
In previous implementation the test was trying to set rf=2 per each DC while one of DCs had only one node.
As a result, the test failed with error like:
Datacenter us-west-2scylla_node_west doesn't have enough token-owning nodes for replication_factor=2
The new approach will dynamically define the number of nodes per DC and set this value to DC's replication factor.
Testing
PR pre-checks (self review)
backport
labels