feature(perf): support latte in gradual grow throughput test #10901

vponomaryov · 2025-05-15T10:33:47Z

Gradual throughput test uses several substitution variables and 2 of them,
called $threads and $throttle, are not compatible as-is with the latte benchmarking tool.

First, Latte has --threads=N and --concurrency=M parameters
whereas cassandra-stress has only --threads.
So, latte's values must be multiplied to get CS's value.

Second, latte uses --rate=100 format not fixed=100/s or throttle=100s like CS does.

Knowing above, add appropriate parsing of the latte commands
to support existing substitution values.

Example:

Following CS command:

  cassandra-stress mixed cl=QUORUM duration=$duration -mode cql3 native \
    -rate 'threads=$threads $throttle' \
    -col 'size=FIXED(1024) n=FIXED(1)' -pop seq=15000001..20000000

Can be simulated by the following Latte command:

  latte run --function=write,read --tag=mixed --sampling=10s --duration=$duration \
  $throttle --threads=7 --concurrency=$threads --consistency=QUORUM \
  --start-cycle 15000001 --end-cycle 20000000 \
  -P key_size=10 -P column_size=1024 -P column_count=1 -P row_count=20000000 \
  data_dir/latte/latte_cs_alike.rn

In case of Latte we specify --threads explicitly as (N-1) value where N is number of CPU cores on loader nodes.

Testing

scylla-staging/valerii/vp-perf-regression-predefined-throughput-steps-rust-vnodes#11

PR pre-checks (self review)

I added the relevant backport labels
I didn't leave commented-out/debugging code

Reminders

Add New configuration option and document them (in sdcm/sct_config.py)
Add unit tests to cover my changes (under unit-test/ folder)
Update the Readme/doc folder relevant to this change (if needed)

Copilot

Pull Request Overview

This PR introduces support for the Latte benchmarking tool in the gradual throughput tests by parsing its unique command line options for threads and rate.

Added a regex-based function to extract latte thread parameters.
Introduced a helper function to adjust the CS thread count based on latte threads.
Updated throttle formatting logic to distinguish between Latte and legacy CS commands.

Reviewed Changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

File	Description
sdcm/stress/latte_thread.py	Added regex and extraction function to determine Latte's thread count.
performance_regression_gradual_grow_throughput.py	Integrated Latte thread count logic and updated throttle handling in test steps.

performance_regression_gradual_grow_throughput.py

vponomaryov · 2025-05-15T10:58:02Z

Tested here: scylla-staging/valerii/vp-perf-regression-predefined-throughput-steps-rust-vnodes#11

Reason for the changes is here: https://github.com/scylladb/qa-tasks/issues/1659

soyacz

IIUC currently, still user needs to watch how many CPUs loaders have.

for me, the ultimate solution would be to introduce 'latte_cs_thread' that could parse prefixed by latte c-s command like: latte <full cassandra stress command>

This way, the thread could take proper rune scripts, translate c-s command to latte one and we wouldn't have to learn new syntax for running tests with latte (when need c-s like workload). WDYT?

vponomaryov · 2025-05-16T11:35:58Z

IIUC currently, still user needs to watch how many CPUs loaders have.

It is only about efficiency. And it is very easy. Instance type is known, CPU core count is known, do (n - 1) for high rate commands.

for me, the ultimate solution would be to introduce 'latte_cs_thread' that could parse prefixed by latte c-s command like: latte <full cassandra stress command>

It will limit all the flexibility latte provides.

This way, the thread could take proper rune scripts,

Each separate rune script has unique set of parameters for schema creation and for queries.

translate c-s command to latte one and we wouldn't have to learn new syntax for running tests with latte (when need c-s like workload). WDYT?

Parsing of CS command into latte is not really good idea for performance scenarios.
It would be ok only for some load in longevities.

And, anyway, potential feature of parsing CS into latte should be a separate PR.
It should not be a blocker for this one.

Gradual throughput test uses several substitution variables and 2 of them, called "$threads" and "$throttle", are not compatible "as-is" with the latte benchmarking tool. First, Latte has "--threads=N" and "--concurrency=M" parameters whereas cassandra-stress has only "--threads". So, latte's values must be multiplied to get CS's value. Second, latte uses "--rate=100" not "fixed=100/s" or "throttle=100s" like CS does. Knowing above, add appropriate parsing of the latte command to support existing substitution values. Example: Following CS command: cassandra-stress mixed cl=QUORUM duration=$duration -mode cql3 native \ -rate 'threads=$threads $throttle' \ -col 'size=FIXED(1024) n=FIXED(1)' -pop seq=15000001..20000000 Can be simulated by the following Latte command: latte run --function=write,read --tag=mixed --sampling=10s --duration=$duration \ $throttle --threads=7 --concurrency=$threads --consistency=QUORUM --start-cycle 15000001 --end-cycle 20000000 \ -P key_size=10 -P column_size=1024 -P column_count=1 -P row_count=20000000 data_dir/latte/latte_cs_alike.rn In case of Latte we specify "--threads" explicitly as (N-1) value where N is number of CPU cores on loader nodes.

vponomaryov · 2025-05-27T15:55:07Z

@fruch , @soyacz
So, what do we do with this?

fruch · 2025-05-27T16:51:44Z

@fruch , @soyacz
So, what do we do with this?

I'm not sure we want this computation

We will need to experiment with those parameters, and set them optimally for each workload, 1:1 mapping to the thread numbers in c-s isn't exactly what is needed.

soyacz · 2025-05-28T07:21:15Z

Sorry for the late reply,

for me, the ultimate solution would be to introduce 'latte_cs_thread' that could parse prefixed by latte c-s command like: latte <full cassandra stress command>

It will limit all the flexibility latte provides.

I'm not saying to replace LatteStressThread, only add another wrapper just for c-s compatibility at SCT level. Still could use current approach for complex cases.

This way, the thread could take proper rune scripts,

Each separate rune script has unique set of parameters for schema creation and for queries.

That's the hard part (at the beginning) and this could alleviate it. People don't know how things work in latte and need some learning to use it properly.

translate c-s command to latte one and we wouldn't have to learn new syntax for running tests with latte (when need c-s like workload). WDYT?

Parsing of CS command into latte is not really good idea for performance scenarios. It would be ok only for some load in longevities.

some load is actually what we do in most of the cases.

vponomaryov · 2025-05-28T10:16:35Z

@fruch , @soyacz
So, what do we do with this?

I'm not sure we want this computation

We will need to experiment with those parameters, and set them optimally for each workload, 1:1 mapping to the thread numbers in c-s isn't exactly what is needed.

This computation is made just to keep compatibility with existing interfaces for configuration.
Then, the number of latte worker threads is configured explicitly, this is already huge diff.

Then, we can update the number of CS threads which get considered in calculation for latte configurations.
So, I see only gains here:

The interface compatibility is kept
We still can configure any number of latte workers/threads and concurrency for it

for me, the ultimate solution would be to introduce 'latte_cs_thread' that could parse prefixed by latte c-s command like: latte <full cassandra stress command>

It will limit all the flexibility latte provides.

I'm not saying to replace LatteStressThread, only add another wrapper just for c-s compatibility at SCT level. Still could use current approach for complex cases.

I don't mind to have such a wrapper, just not in scope of this PR.

This way, the thread could take proper rune scripts,

Each separate rune script has unique set of parameters for schema creation and for queries.

That's the hard part (at the beginning) and this could alleviate it. People don't know how things work in latte and need some learning to use it properly.

translate c-s command to latte one and we wouldn't have to learn new syntax for running tests with latte (when need c-s like workload). WDYT?

Parsing of CS command into latte is not really good idea for performance scenarios. It would be ok only for some load in longevities.

some load is actually what we do in most of the cases.

As said above, I am ok to have just in a separate task/PR.

github-actions bot assigned vponomaryov May 15, 2025

vponomaryov added backport/perf-v15 backport/perf-v16 backport/2025.1 backport/2025.2 labels May 15, 2025

vponomaryov requested a review from Copilot May 15, 2025 10:34

Copilot AI reviewed May 15, 2025

View reviewed changes

performance_regression_gradual_grow_throughput.py Outdated Show resolved Hide resolved

performance_regression_gradual_grow_throughput.py Outdated Show resolved Hide resolved

vponomaryov force-pushed the support-latte-for-gradual-perf-tests branch from d121a79 to 8802a35 Compare May 15, 2025 10:55

vponomaryov requested review from fruch, soyacz and juliayakovlev May 15, 2025 10:56

vponomaryov mentioned this pull request May 15, 2025

ci(perf): add gradual thrpt testing CI job using latte #10903

Open

2 tasks

soyacz reviewed May 16, 2025

View reviewed changes

vponomaryov force-pushed the support-latte-for-gradual-perf-tests branch from 8802a35 to 76f3804 Compare May 16, 2025 12:56

vponomaryov force-pushed the support-latte-for-gradual-perf-tests branch from 76f3804 to f3ff8bd Compare May 26, 2025 13:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feature(perf): support latte in gradual grow throughput test #10901

feature(perf): support latte in gradual grow throughput test #10901

Uh oh!

vponomaryov commented May 15, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

vponomaryov commented May 15, 2025

Uh oh!

soyacz left a comment

Uh oh!

vponomaryov commented May 16, 2025

Uh oh!

vponomaryov commented May 27, 2025

Uh oh!

fruch commented May 27, 2025

Uh oh!

soyacz commented May 28, 2025

Uh oh!

vponomaryov commented May 28, 2025

Uh oh!

Uh oh!

feature(perf): support latte in gradual grow throughput test #10901

Are you sure you want to change the base?

feature(perf): support latte in gradual grow throughput test #10901

Uh oh!

Conversation

vponomaryov commented May 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Testing

PR pre-checks (self review)

Reminders

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

vponomaryov commented May 15, 2025

Uh oh!

soyacz left a comment

Choose a reason for hiding this comment

Uh oh!

vponomaryov commented May 16, 2025

Uh oh!

vponomaryov commented May 27, 2025

Uh oh!

fruch commented May 27, 2025

Uh oh!

soyacz commented May 28, 2025

Uh oh!

vponomaryov commented May 28, 2025

Uh oh!

Uh oh!

vponomaryov commented May 15, 2025 •

edited

Loading