improvement(nemesis): speed up nemesis method discovering #10551

vponomaryov · 2025-04-01T09:22:27Z

Before, calling get_list_of_disrupt_methods method it was spending ~170ms for each nemesis class inspection.
And we have about hundred of such classes which get parsed.

So, speed it up about 2.7 times by making the following changes:

Inspect source code just once for whole module - before the class processing loop.
Compile regexes before the class processing loop.

As a result, it started taking ~5.8s instead of 15.5s.

Testing

[ ]

PR pre-checks (self review)

I added the relevant backport labels
I didn't leave commented-out/debugging code

Reminders

Add New configuration option and document them (in sdcm/sct_config.py)
Add unit tests to cover my changes (under unit-test/ folder)
Update the Readme/doc folder relevant to this change (if needed)

Before, calling 'get_list_of_disrupt_methods' method it was spending ~170ms for each nemesis class inspection. And we have about hundred of such classes which get parsed. So, speed it up about 2.7 times by making the following changes: - Inspect source code just once for whole module - before the class processing loop. - Compile regexes before the class processing loop. As a result, it started taking ~5.8s instead of 15.5s.

vponomaryov · 2025-04-01T09:22:53Z

Inspired by the following change:

improvement(nemesis_jobs): Rework nemesis jobs generation #10385

pehala · 2025-04-01T09:27:11Z

I think we can do even better by searching only in disrupt method

pehala · 2025-04-01T09:31:09Z

https://github.com/scylladb/scylla-cluster-tests/pull/10502/files#diff-594b8a1eab5ee7e3ca8fa60f868a966d96d6cb570586215d03201e4ae30d8a7aR16 This is the change I did, which improved the time to miliseconds

vponomaryov · 2025-04-01T09:54:45Z

https://github.com/scylladb/scylla-cluster-tests/pull/10502/files#diff-594b8a1eab5ee7e3ca8fa60f868a966d96d6cb570586215d03201e4ae30d8a7aR16 This is the change I did, which improved the time to miliseconds

Nice, but how do you measure it?
The 15.5, as well as 5.8 seconds - it is call of whole command in CLI which includes python init.
Saying milliseconds you definitely calculate different things, because only python process + env init takes more.

pehala · 2025-04-01T11:37:51Z

Nice, but how do you measure it?
The 15.5, as well as 5.8 seconds - it is call of whole command in CLI which includes python init.
Saying milliseconds you definitely calculate different things, because only python process + env init takes more.

You are absolutely right, my initial measurements were for the discovery part alone (and were not precise even at that). I redid the measurements by extracting the patch into pehala@75a1736. I used time to measure command speed:

Before:
time pytest unit_tests/test_nemesis_sisyphus.py

real    0m13,516s
user    0m12,883s
sys     0m0,541s

After:
time pytest unit_tests/test_nemesis_sisyphus.py

real    0m2,535s
user    0m2,203s
sys     0m0,322s

fruch · 2025-04-01T15:18:16Z

Why not do both things ?

pehala · 2025-04-01T15:20:07Z

Why not do both things ?

We can, lets see the improvement when we combine both.

vponomaryov · 2025-04-02T13:34:28Z

Why not do both things ?

We can, lets see the improvement when we combine both.

The referenced change

https://github.com/scylladb/scylla-cluster-tests/pull/10502/files#diff-594b8a1eab5ee7e3ca8fa60f868a966d96d6cb570586215d03201e4ae30d8a7aR16 This is the change I did, which improved the time to miliseconds

Deletes the code that gets updated here and implements new approach.
So, how exactly both are expected to be used?

fruch · 2025-04-02T13:51:08Z

Why not do both things ?

We can, lets see the improvement when we combine both.

The referenced change

https://github.com/scylladb/scylla-cluster-tests/pull/10502/files#diff-594b8a1eab5ee7e3ca8fa60f868a966d96d6cb570586215d03201e4ae30d8a7aR16 This is the change I did, which improved the time to miliseconds

Deletes the code that gets updated here and implements new approach. So, how exactly both are expected to be used?

you removed one call to get_disrupt_method_from_class, the other two place can still benfit from @pehala suggestion

vponomaryov · 2025-04-02T13:54:32Z

Why not do both things ?

We can, lets see the improvement when we combine both.

The referenced change

https://github.com/scylladb/scylla-cluster-tests/pull/10502/files#diff-594b8a1eab5ee7e3ca8fa60f868a966d96d6cb570586215d03201e4ae30d8a7aR16 This is the change I did, which improved the time to miliseconds

Deletes the code that gets updated here and implements new approach. So, how exactly both are expected to be used?

you removed one call to get_disrupt_method_from_class, the other two place can still benfit from @pehala suggestion

The #10502 just removed the code I update here.
So, I am saying that if we decide to use the #10502 then this PR is not needed because it's code will be just deleted.

vponomaryov · 2025-04-23T15:24:55Z

Superseded by the following change:

improvement(nemesis): Rework nemesis discovery #10502

github-actions bot assigned vponomaryov Apr 1, 2025

vponomaryov added the backport/2025.1 label Apr 1, 2025

vponomaryov requested review from fruch and pehala April 1, 2025 09:23

vponomaryov mentioned this pull request Apr 1, 2025

improvement(nemesis_jobs): Rework nemesis jobs generation #10385

Merged

3 tasks

vponomaryov closed this Apr 23, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

improvement(nemesis): speed up nemesis method discovering #10551

improvement(nemesis): speed up nemesis method discovering #10551

Uh oh!

vponomaryov commented Apr 1, 2025 •

edited

Loading

Uh oh!

vponomaryov commented Apr 1, 2025

Uh oh!

pehala commented Apr 1, 2025

Uh oh!

pehala commented Apr 1, 2025

Uh oh!

vponomaryov commented Apr 1, 2025

Uh oh!

pehala commented Apr 1, 2025

Uh oh!

fruch commented Apr 1, 2025

Uh oh!

pehala commented Apr 1, 2025

Uh oh!

vponomaryov commented Apr 2, 2025

Uh oh!

fruch commented Apr 2, 2025

Uh oh!

vponomaryov commented Apr 2, 2025

Uh oh!

vponomaryov commented Apr 23, 2025

Uh oh!

Uh oh!

improvement(nemesis): speed up nemesis method discovering #10551

improvement(nemesis): speed up nemesis method discovering #10551

Uh oh!

Conversation

vponomaryov commented Apr 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Testing

PR pre-checks (self review)

Reminders

Uh oh!

vponomaryov commented Apr 1, 2025

Uh oh!

pehala commented Apr 1, 2025

Uh oh!

pehala commented Apr 1, 2025

Uh oh!

vponomaryov commented Apr 1, 2025

Uh oh!

pehala commented Apr 1, 2025

Uh oh!

fruch commented Apr 1, 2025

Uh oh!

pehala commented Apr 1, 2025

Uh oh!

vponomaryov commented Apr 2, 2025

Uh oh!

fruch commented Apr 2, 2025

Uh oh!

vponomaryov commented Apr 2, 2025

Uh oh!

vponomaryov commented Apr 23, 2025

Uh oh!

Uh oh!

vponomaryov commented Apr 1, 2025 •

edited

Loading