Continuous Benchmarking using Github Actions #2134

pablo-gf · 2025-05-05T16:32:25Z

This PR aims to provide a new benchmarking approach for liboqs. It uses the Continuous Benchmarking action from Marketplace, like the mlkem-native repository. For speed benchmarking of the current algorithms in the library, 3 new files are included:

scripts/parse_liboqs_speed.py : Retrieves the benchmarking data from speed_kem and speed_sig and outputs it in a json file that matches the format required by the continuous benchmarking action.
workflows/kem-bench.yml: Iterates through the different KEM algorithms executing the speed test and gathering its information using parse_liboqs_speed.py. It then pushes the output json file to a gh-pages branch using the Continuous Benchmarking action.
workflows/sig-bench.yml: Same kem-bench.yml but for signature algorithms.

To complete the benchmarking, it is required to create a new gh-pages branch so that the workflows generate and continuously update a Github page with the visualization of the benchmarking results. I have adapted the html file to include some additional features here . I can include these changes once the new gh-pages branch for liboqs is set up. You can see an example of what the final output should look like here.

Let me know if you have any questions or suggestions!

Does this PR change the input/output behaviour of a cryptographic algorithm (i.e., does it change known answer test values)? (If so, a version bump will be required from x.y.z to x.(y+1).0.)
Does this PR change the list of algorithms available -- either adding, removing, or renaming? Does this PR otherwise change an API? (If so, PRs in fully supported downstream projects dependent on these, i.e., oqs-provider will also need to be ready for review and merge by the time this is merged.)

Signed-off-by: Pablo Gutiérrez Félix <[email protected]>

.github/workflows/kem-bench.yml

.github/workflows/sig-bench.yml

SWilson4

Thanks for your patience, @pablo-gf! I've taken a look at the GitHub warnings; I think I have an idea how to resolve each of them.

.github/workflows/kem-bench.yml

dstebila · 2025-05-20T17:30:50Z

I think it would be good if the build & configuration information that's currently in the expandable "Latest commit build information" is displayed directly at the top of the page.

In the pop-ups that show when you hover over a datapoint, it looks like all the commits have been authored by you. Is that placeholder information? Or should something else be showing here?

Signed-off-by: Pablo Gutiérrez <[email protected]>

pablo-gf · 2025-05-21T16:34:34Z

@dstebila I have added the build information at the top, let me know if that works: https://pablo-gf.github.io/liboqs/dev/bench/. As for your second comment, that is placeholder information. The idea is that those pop-ups will contain the details of each commit made to the library starting from the first commit after this continuous benchmarking framework is deployed.

.github/workflows/kem-bench.yml

dstebila · 2025-05-21T17:03:39Z

@dstebila I have added the build information at the top, let me know if that works: https://pablo-gf.github.io/liboqs/dev/bench/. As for your second comment, that is placeholder information. The idea is that those pop-ups will contain the details of each commit made to the library starting from the first commit after this continuous benchmarking framework is deployed.

Looks good, thanks!

Signed-off-by: Pablo Gutiérrez <[email protected]>

pablo-gf · 2025-05-23T11:47:12Z

@SWilson4 I fixed the security warnings that popped-up after my last commit. Let me know if you have any comments or suggestions. As I mentioned at the beginning, to make the entire process work we would also need to create a new gh-pages branch so that the workflows generate and continuously update a Github page with the visualization of the benchmarking results.

SWilson4 · 2025-05-23T20:45:48Z

Thanks for the updates, @pablo-gf! Are you able to merge this branch into main of your fork so we can test the commit-to-main flow and see if it's working as expected?

Signed-off-by: Pablo Gutiérrez <[email protected]>

pablo-gf · 2025-05-27T15:08:33Z

@SWilson4 @dstebila The tests now run successfully in the main branch of my fork (except for the basic downstream tests, as expected). I had to adjust a couple of minor details. Let me know if you have any suggestions before moving this PR to ready for review.

SWilson4

LGTM. @pablo-gf is there anything remaining to be done from an admin point of view to enable this?

pablo-gf · 2025-06-11T13:17:29Z

Thank you @SWilson4! What's left is to create a branch called gh-pages so that the benchmarking results are posted there.

SWilson4 · 2025-06-11T13:23:25Z

Thank you @SWilson4! What's left is to create a branch called gh-pages so that the benchmarking results are posted there.

Done! I also set up branch protection so that we don't accidentally delete it.

pablo-gf · 2025-06-11T13:31:16Z

Thank you @SWilson4! What's left is to create a branch called gh-pages so that the benchmarking results are posted there.

Done! I also set up branch protection so that we don't accidentally delete it.

Awesome! Should be good to merge now.

SWilson4 · 2025-06-11T13:38:38Z

Merging, thanks @pablo-gf for the contribution!

dstebila · 2025-06-12T14:51:20Z

In a merge today we got a whole bunch of alerts about speed regression, which I think is related to this continuous benchmarking PR landing. @pablo-gf do you have any idea what's going on here?

pablo-gf · 2025-06-12T15:17:18Z

In a merge today we got a whole bunch of alerts about speed regression, which I think is related to this continuous benchmarking PR landing. @pablo-gf do you have any idea what's going on here?

Yes, @dstebila. There is an option to throw an alert if the algorithm speed decreases a certain value (I believe it's a specific percentage). I can definitely look into that if it's something we are not interested in.

dstebila · 2025-06-12T15:22:04Z

Yes, @dstebila. There is an option to throw an alert if the algorithm speed decreases a certain value (I believe it's a specific percentage). I can definitely look into that if it's something we are not interested in.

In principle I think we'd be interested in that. But the alerts being thrown in commit I linked to seem to be too sensitive. Would you be able to check how the thresholds are configured?

pablo-gf · 2025-06-13T11:44:43Z

Yes, @dstebila. There is an option to throw an alert if the algorithm speed decreases a certain value (I believe it's a specific percentage). I can definitely look into that if it's something we are not interested in.

In principle I think we'd be interested in that. But the alerts being thrown in commit I linked to seem to be too sensitive. Would you be able to check how the thresholds are configured?

Yes @dstebila , here is some information from the page:

"This action can raise an alert to the commit when its benchmark results are worse than previous exceeding a specified threshold. By default, this action marks the result as performance regression when it is worse than the previous exceeding 200% threshold. For example, if the previous benchmark result was 100 iter/ns and this time it is 230 iter/ns, it means 230% worse than the previous and an alert will happen. The threshold can be changed by alert-threshold input."

I believe the current parameters are set to alert-threshold: 50%. I set it that way to make sure it was working during testing, but let me know the value you would like it to be increased to. I believe the default value in the ml-kem repository is 103%.

SWilson4 · 2025-06-13T13:46:12Z

In the past we deemed 15% as an acceptable variation. I think I would lower it now that we have some stable algorithms in the library—a 14% performance drop might not be cause for alarm for MAYO, but it certainly would be for ML-KEM. How about setting alert-threshold to 105% for now? If we find that we're getting a bunch of warnings for more experimental algs, we can raise it further.

pablo-gf · 2025-06-16T11:09:30Z

@SWilson4 Sounds good. Would you like me to create a new PR for that?

SWilson4 · 2025-06-16T13:00:44Z

@SWilson4 Sounds good. Would you like me to create a new PR for that?

That would be great, thanks @pablo-gf.

pablo-gf added 3 commits May 5, 2025 16:23

Added workflows and script for speed beanchmarking

3de7a36

Signed-off-by: Pablo Gutiérrez Félix <[email protected]>

changed branch push to main

91aeb66

Signed-off-by: Pablo Gutiérrez Félix <[email protected]>

Added SPDX-License-Identifer

301c72a

Signed-off-by: Pablo Gutiérrez Félix <[email protected]>

github-advanced-security bot found potential problems May 5, 2025

View reviewed changes

SWilson4 reviewed May 20, 2025

View reviewed changes

Fixed github security warnings

996d826

Signed-off-by: Pablo Gutiérrez <[email protected]>

github-advanced-security bot found potential problems May 21, 2025

View reviewed changes

.github/workflows/kem-bench.yml Fixed Show fixed Hide fixed

.github/workflows/kem-bench.yml Fixed Show fixed Hide fixed

.github/workflows/kem-bench.yml Fixed Show fixed Hide fixed

Fixed github security warnings 2

2f6657a

Signed-off-by: Pablo Gutiérrez <[email protected]>

Fixes after commit-to-main tests

4f29070

Signed-off-by: Pablo Gutiérrez <[email protected]>

pablo-gf mentioned this pull request May 27, 2025

Adding benchmarking page open-quantum-safe/www#275

Merged

pablo-gf marked this pull request as ready for review June 10, 2025 14:10

pablo-gf requested review from dstebila and baentsch as code owners June 10, 2025 14:10

dstebila approved these changes Jun 10, 2025

View reviewed changes

SWilson4 approved these changes Jun 11, 2025

View reviewed changes

SWilson4 merged commit d745d35 into open-quantum-safe:main Jun 11, 2025
79 checks passed

This was referenced Jun 11, 2025

liboqs benchmarking still running 0.9.0-rc1 open-quantum-safe/profiling#110

Closed

Switch open-quantum-safe/profiling to read-only open-quantum-safe/tsc#186

Closed

pablo-gf mentioned this pull request Jun 12, 2025

Adjust HTML file for the continuous benchamarking page #2164

Merged

2 tasks

pablo-gf mentioned this pull request Jun 16, 2025

Increase alert threshold for continuous benchmarking #2166

Merged

2 tasks

Continuous Benchmarking using Github Actions #2134

Continuous Benchmarking using Github Actions #2134

Uh oh!

Conversation

pablo-gf commented May 5, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

SWilson4 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dstebila commented May 20, 2025

Uh oh!

pablo-gf commented May 21, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dstebila commented May 21, 2025

Uh oh!

pablo-gf commented May 23, 2025

Uh oh!

SWilson4 commented May 23, 2025

Uh oh!

pablo-gf commented May 27, 2025

Uh oh!

SWilson4 left a comment

Choose a reason for hiding this comment

Uh oh!

pablo-gf commented Jun 11, 2025

Uh oh!

SWilson4 commented Jun 11, 2025

Uh oh!

pablo-gf commented Jun 11, 2025

Uh oh!

SWilson4 commented Jun 11, 2025

Uh oh!

Uh oh!

dstebila commented Jun 12, 2025

Uh oh!

pablo-gf commented Jun 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dstebila commented Jun 12, 2025

Uh oh!

pablo-gf commented Jun 13, 2025

Uh oh!

SWilson4 commented Jun 13, 2025

Uh oh!

pablo-gf commented Jun 16, 2025

Uh oh!

SWilson4 commented Jun 16, 2025

Uh oh!

Uh oh!

pablo-gf commented Jun 12, 2025 •

edited

Loading