[container_benchmark] MacOS and Linux Experiments #870

Honny1 · 2025-11-15T17:27:36Z

Summary by CodeRabbit

New Features
- Added configuration option to enable plot generation from pull request arguments.
Documentation
- Added clarifying comments regarding Docker Desktop remote-host behavior.

_{✏️ Tip: You can customize this high-level summary in your review settings.}

openshift-ci · 2025-11-15T17:27:41Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:
Once this PR has been reviewed and has the lgtm label, please assign tosokin for approval. For more information see the Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

coderabbitai · 2025-11-15T17:27:41Z

Warning

Rate limit exceeded

@Honny1 has exceeded the limit for the number of commits or files that can be reviewed per hour. Please wait 11 minutes and 18 seconds before requesting another review.

⌛ How to resolve this issue?

After the wait time has elapsed, a review can be triggered using the @coderabbitai review command as a PR comment. Alternatively, push new commits to this PR.

We recommend that you space out your commits to avoid hitting the rate limit.

🚦 How do rate limits work?

CodeRabbit enforces hourly rate limits for each developer per organization.

Our paid plans have higher rate limits than the trial, open-source and free plans. In all cases, we re-allow further reviews after a brief timeout.

Please see our FAQ for further information.

📥 Commits

Reviewing files that changed from the base of the PR and between df9d4f6 and 2087488.

📒 Files selected for processing (1)

projects/container_bench/testing/config.yaml (2 hunks)

Walkthrough

Configuration file updated with inline documentation for Docker Desktop remote-host behavior and a new field (exec_list.generate_plots_from_pr_args: true) added to enable plot generation from pull request arguments.

Changes

Cohort / File(s)	Change Summary
Configuration documentation and feature enablement `projects/container_bench/testing/config.yaml`	Added clarifying comments for Docker Desktop remote-host behavior; introduced new config field `exec_list.generate_plots_from_pr_args: true`

Estimated code review effort

🎯 1 (Trivial) | ⏱️ ~3 minutes

Possibly related PRs

[container_benchmark] Multiplatform support #851: Modifies the same config key (exec_list.generate_plots_from_pr_args) in the container_bench testing configuration.

Poem

🐰 With whiskers twitched and paws held high,
I add a flag to plots that fly,
Docker notes now crystal clear,
Configuration blooms, no fear! 🌱

Pre-merge checks and finishing touches

❌ Failed checks (1 warning, 1 inconclusive)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 0.00% which is insufficient. The required threshold is 80.00%.	You can run `@coderabbitai generate docstrings` to improve docstring coverage.
Title check	❓ Inconclusive	The title '[container_benchmark] MacOS and Linux Experiments' is vague and generic. It suggests running experiments on multiple platforms but doesn't describe the actual changes made (adding configuration comments and a new exec_list field).	Use a more specific title that describes the actual changes, such as '[container_benchmark] Add plot generation config and Docker Desktop documentation' or similar to clarify what was modified.

✅ Passed checks (1 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

openshift-ci · 2025-11-15T17:27:46Z

Skipping CI for Draft Pull Request.
If you want CI signal for your change, please convert it to an actual PR.
You can still manually trigger a test run with /test all

Honny1 · 2025-11-16T20:33:52Z

/test cont_bench-jump-ci
/cluster darwin

Honny1 · 2025-11-17T08:46:41Z

/test cont_bench-jump-ci
/cluster darwin

topsail-bot · 2025-11-17T14:42:58Z

🟢 Test of 'container_bench test test_ci' succeeded after 05 hours 42 minutes 07 seconds. 🟢

• Link to the test results.

• Link to the reports index.

Test configuration:

PR_POSITIONAL_ARGS: cont_bench-jump-ci
PR_POSITIONAL_ARG_0: cont_bench-jump-ci

Honny1 · 2025-11-17T15:29:08Z

/test cont_bench-jump-ci
/cluster darwin

topsail-bot · 2025-11-17T20:04:41Z

🟢 Test of 'container_bench test test_ci' succeeded after 04 hours 30 minutes 49 seconds. 🟢

• Link to the test results.

• Link to the reports index.

Test configuration:

PR_POSITIONAL_ARGS: cont_bench-jump-ci
PR_POSITIONAL_ARG_0: cont_bench-jump-ci

Honny1 · 2025-11-17T20:33:05Z

/test cont_bench-jump-ci
/cluster darwin

topsail-bot · 2025-11-17T20:36:39Z

🔴 Test of 'container_bench test prepare_ci' failed after 00 hours 00 minutes 01 seconds. 🔴

• Link to the test results.

• No reports index generated...

Test configuration:

PR_POSITIONAL_ARGS: cont_bench-jump-ci
PR_POSITIONAL_ARG_0: cont_bench-jump-ci

Failure indicator:

/tmp/topsail_202511171763411795/000__container_bench__prepare_benchmark_script_on_remote/FAILURE | [000__container_bench__prepare_benchmark_script_on_remote] ./run_toolbox.py container_bench prepare_benchmark_script_on_remote --source=/opt/topsail/src/projects/container_bench/testing/exec_time.py --dest=/Users/jrodak-topsail/utils/exec_time.py --> 4

topsail-bot · 2025-11-18T08:52:38Z

🔴 Test of 'container_bench test prepare_ci' failed after 00 hours 00 minutes 06 seconds. 🔴

• Link to the test results.

• No reports index generated...

Test configuration:

PR_POSITIONAL_ARGS: cont_bench-jump-ci
PR_POSITIONAL_ARG_0: cont_bench-jump-ci

Failure indicator: Empty. (See run.log)

topsail-bot · 2025-11-18T12:54:28Z

🔴 Test of 'container_bench test test_ci' failed after 03 hours 57 minutes 34 seconds. 🔴

• Link to the test results.

• No reports index generated...

Test configuration:

PR_POSITIONAL_ARGS: cont_bench-jump-ci
PR_POSITIONAL_ARG_0: cont_bench-jump-ci

Failure indicator:

/tmp/topsail_202511181763456212/000__matbenchmarking/FAILURE | SignalError: SignalError(sig=15)
Traceback (most recent call last):
  File "/opt/topsail/src/projects/container_bench/testing/test_container_bench.py", line 166, in matbench_run
    failed = matbenchmark.run_benchmark(args)
             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/opt/topsail/src/projects/matrix_benchmarking/library/matbenchmark.py", line 119, in run_benchmark
    run.run(f"{cmd} --run 2>&1 | tee -a '{test_log_file}'")
  File "/opt/topsail/src/projects/core/library/run.py", line 125, in run
    proc = subprocess.run(command, **args)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

[...]

topsail-bot · 2025-11-18T17:34:17Z

🟢 Test of 'container_bench test test_ci' succeeded after 04 hours 31 minutes 45 seconds. 🟢

• Link to the test results.

• Link to the reports index.

Test configuration:

PR_POSITIONAL_ARGS: cont_bench-jump-ci
PR_POSITIONAL_ARG_0: cont_bench-jump-ci

Honny1 · 2025-11-18T18:01:28Z

/test cont_bench-jump-ci
/cluster linux

topsail-bot · 2025-11-19T06:36:34Z

🟢 Test of 'container_bench test test_ci' succeeded after 12 hours 29 minutes 34 seconds. 🟢

• Link to the test results.

• Link to the reports index.

Test configuration:

PR_POSITIONAL_ARGS: cont_bench-jump-ci
PR_POSITIONAL_ARG_0: cont_bench-jump-ci

coderabbitai

Actionable comments posted: 2

♻️ Duplicate comments (2)

projects/container_bench/toolbox/container_bench_iperf_host_to_container_benchmark/tasks/main.yml (1)

13-29: Consistent with other iperf tasks; same optional concerns apply

The sleep_cmd and switch to docker.io/networkstatic/iperf3:latest mirror the net‑bridge benchmark and look consistent. The same optional considerations apply here: pinning a concrete image tag and using a readiness check instead of a fixed 5s sleep if you later want more deterministic runs.

projects/container_bench/toolbox/container_bench_iperf_net_host_benchmark/tasks/main.yml (1)

13-29: Net‑host iperf changes are consistent; image/tag and sleep handling match other tasks

The cross‑platform sleep_cmd and unified use of docker.io/networkstatic/iperf3:latest are consistent with the other iperf benchmarks. As with those files, you might later consider pinning a specific tag and replacing the fixed 5s sleep with a readiness check if you hit flakiness or need strict reproducibility.

🧹 Nitpick comments (4)

projects/container_bench/toolbox/container_bench_iperf_net_bridge_benchmark/tasks/main.yml (1)

13-32: Cross‑platform sleep and image switch look fine; consider making them more deterministic

The sleep_cmd fact and switch to docker.io/networkstatic/iperf3:latest are consistent and will work across OSes. Two optional refinements you might consider later:

Avoid :latest for reproducibility (pin a known-good tag and reference it via a single variable).

Replace the fixed sleep 5/Start-Sleep -Seconds 5 with something like a readiness check (wait_for on the TCP port, or a loop retry) to reduce flakiness on slow or loaded hosts.

projects/container_bench/testing/config.yaml (3)

111-118: iperf3 image in prepare container_images now uses networkstatic/iperf3:latest

This aligns with the task files and keeps a single image reference across configuration and playbooks. As elsewhere, if you later need stricter reproducibility, consider pinning a specific tag instead of :latest.

153-162: Increasing podman machine CPUs to 12—just ensure it matches host capacity

Bumping prepare.podman.machine.configuration.cpus to 12 is reasonable for heavier benchmarks, but on hosts with fewer physical cores it may either over‑commit or cause podman‑machine configuration/start failures. Worth double‑checking that all CI machines backing this config have at least that many cores.

195-235: Container benchmark runs increased to 10—verify CI runtime and noise

Raising all container benchmark runs from 1 to 10 will materially increase total test time, which is great for smoothing variance but may stress CI timeouts or resource usage. Make sure your CI presets and time budgets are adjusted so this doesn’t introduce flakes from timeouts.

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 8b3df72 and 105c68e.

📒 Files selected for processing (6)

projects/container_bench/testing/config.yaml (7 hunks)
projects/container_bench/testing/container_engine.py (2 hunks)
projects/container_bench/toolbox/container_bench_capture_system_state/tasks/windows.yml (1 hunks)
projects/container_bench/toolbox/container_bench_iperf_host_to_container_benchmark/tasks/main.yml (1 hunks)
projects/container_bench/toolbox/container_bench_iperf_net_bridge_benchmark/tasks/main.yml (1 hunks)
projects/container_bench/toolbox/container_bench_iperf_net_host_benchmark/tasks/main.yml (1 hunks)

🧰 Additional context used

🧬 Code graph analysis (1)

projects/container_bench/testing/container_engine.py (1)

projects/container_bench/testing/config_manager.py (3)

ConfigManager (82-263)

get_podman_machine_config (154-166)

is_windows (92-93)

🔇 Additional comments (4)

projects/container_bench/toolbox/container_bench_capture_system_state/tasks/windows.yml (1)

18-22: Windows system version formatting looks good

Using OsName and OsVersion from Get-ComputerInfo with WindowsBuildLabEx is a clear, stable way to capture the Windows OS string and keeps the format aligned with the macOS-style summary.

projects/container_bench/testing/config.yaml (3)

59-62: Docker Desktop remote‑host comment is a useful clarification

The added note about base_work_dir vs home_is_base_work_dir for Docker Desktop remote runs clarifies an easy‑to‑miss requirement; no changes needed.

300-321: Podman repo_version matrix and matbenchmarking wiring look consistent; confirm versions

Adding test.podman.repo_version values (v5.4.2, v5.7.0) and including them in iterable_test_fields, mapped to prepare.podman.repo.version, is structurally consistent with the existing matbenchmarking setup.

Please just confirm that:

These versions exist in the referenced podman repo/artifacts, and

The templated path " {@prepare.podman.repo.version}.zip" matches the actual filenames.

341-350: generate_plots_from_pr_args: true—ensure downstream tooling expects this

Enabling generate_plots_from_pr_args here is fine; just verify the visualization/CI entrypoint reads this flag and behaves correctly when it’s true (especially for non‑PR or local runs).

coderabbitai · 2025-11-20T13:36:11Z

projects/container_bench/testing/container_engine.py

    def stop(self):
        cmd = f"{self.get_cmd_env()} {get_podman_binary(self.base_work_dir)} machine stop {self.machine_name}"
-        if ConfigManager.is_windows():
-            # Immediate end of ssh session may lead to machine not stopping properly on Windows.
-            cmd = f"{cmd}; Start-Sleep -Seconds 30"
+        machine_config = ConfigManager.get_podman_machine_config()
+        is_wsl = machine_config['env_containers_machine_provider'] == "wsl"
+        if ConfigManager.is_windows() and is_wsl:
+            # There is a bug in WSL that doesn't unlock ports after stopping usage of that port.
+            # https://github.com/microsoft/WSL/issues/10601
+            cmd = f"{cmd}; wsl --shutdown"
        remote_access.run_with_ansible_ssh_conf(self.base_work_dir, cmd)


⚠️ Potential issue | 🟠 Major

🧩 Analysis chain

PodmanMachine.stop now shuts down all WSL; confirm that this side effect is acceptable

The new logic:

Detects env_containers_machine_provider == "wsl" and, on Windows, appends wsl --shutdown to the stop command.

This will fix the WSL port‑unlock bug on dedicated CI machines, but it also shuts down every running WSL distro on that host, which can impact any other workloads using WSL.

Consider gating this behind a dedicated config flag or at least logging very prominently that a full WSL shutdown is occurring, so it’s an explicit opt‑in behaviour for environments where WSL is shared.

I'll verify the review comment by examining the codebase context and checking the related code changes.

--- 🏁 Script executed: ```shell #!/bin/bash # Find and inspect container_engine.py around the PodmanMachine.stop() method fd -n "container_engine.py" --type f -x cat {} | head -300

Length of output: 230

🏁 Script executed:

#!/bin/bash # Search for DockerDesktopMachine class and its stop() method rg -n "class DockerDesktopMachine" -A 100 | head -150

Length of output: 4980

🏁 Script executed:

#!/bin/bash # Search for all occurrences of wsl --shutdown in the codebase rg -n "wsl --shutdown" -B 3 -A 3

Length of output: 1602

🏁 Script executed:

#!/bin/bash # Check for ConfigManager usage and wsl-related config flags rg -n "env_containers_machine_provider\|wsl" -C 2 --type py

Length of output: 0

🏁 Script executed:

#!/bin/bash # Search for the stop() method implementations in machine-related classes rg -n "def stop\(" -B 2 -A 10 --type py | head -200

Length of output: 6354

I need to dig deeper into the configuration and deployment context to fully validate the review comment.

--- 🏁 Script executed: ```shell #!/bin/bash # Search for ConfigManager and get_podman_machine_config implementation rg -n "class ConfigManager\|def get_podman_machine_config" -A 20 --type py | head -100

Length of output: 0

🏁 Script executed:

#!/bin/bash # Look for any environment variables or flags controlling wsl shutdown behavior rg -n "wsl\|WSL" --type py -C 3 | grep -i "env\|flag\|config\|disable\|opt" | head -50

Length of output: 0

🏁 Script executed:

#!/bin/bash # Check where PodmanMachine and DockerDesktopMachine are instantiated/used rg -n "PodmanMachine\|DockerDesktopMachine" --type py | grep -v "class " | head -30

Length of output: 0

🏁 Script executed:

#!/bin/bash # Look for any comments or documentation about deployment context rg -n "CI\|shared\|dedicated\|host" projects/container_bench/testing/container_engine.py -C 2 | head -50

Length of output: 0

🏁 Script executed:

#!/bin/bash # Check if there's a way to disable or control the wsl shutdown behavior rg -n "shutdown\|SHUTDOWN" --type py -C 2 | head -50

Length of output: 0

Based on the code verification, I found the review comment is partially inaccurate. Let me provide the corrected version:

DockerDesktopMachine.stop() unconditionally shuts down all WSL distros on Windows; PodmanMachine.stop() is properly gated

The implementations diverge significantly:

PodmanMachine.stop() (lines 259–267): Properly gated—appends wsl --shutdown only when ConfigManager.is_windows() AND is_wsl (machine_config provider check). Safer than the review comment suggested.

DockerDesktopMachine.stop() (lines 341–347): Unconditionally appends wsl --shutdown on any Windows host (if ConfigManager.is_windows() only), with no provider validation. This assumes Docker Desktop always uses WSL on Windows and will shut down all distros regardless.

The core risk remains valid—wsl --shutdown affects every running WSL distro on the host. However:

PodmanMachine is correctly scoped to its WSL provider only.

DockerDesktopMachine lacks this safeguard and poses the actual cross-workload risk. Consider adding a provider check or explicit config flag to gate this behavior on Windows.

projects/container_bench/testing/container_engine.py

Signed-off-by: Jan Rodák <[email protected]>

openshift-ci bot added the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Nov 15, 2025

Honny1 force-pushed the run-others branch 3 times, most recently from 18ad5c0 to 31fd1dd Compare November 16, 2025 20:24

Honny1 force-pushed the run-others branch from 31fd1dd to 0ab9221 Compare November 17, 2025 08:37

Honny1 force-pushed the run-others branch from 5738af8 to 105c68e Compare November 18, 2025 18:00

Honny1 marked this pull request as ready for review November 20, 2025 13:27

openshift-ci bot removed the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Nov 20, 2025

coderabbitai bot reviewed Nov 20, 2025

View reviewed changes

Honny1 added 4 commits November 20, 2025 15:06

[container_benchmark] Configure experiment for podman v5.4.2 on macOS

9a12bb5

Signed-off-by: Jan Rodák <[email protected]>

[container_benchmark] Configure experiment for podman v5.7.0 on macOS

0ff5623

Signed-off-by: Jan Rodák <[email protected]>

[container_benchmark] Configure experiment for docker on macOS

cfe6ac7

Signed-off-by: Jan Rodák <[email protected]>

[container_benchmark] Configure All experiments

df9d4f6

Signed-off-by: Jan Rodák <[email protected]>

Honny1 force-pushed the run-others branch from 105c68e to df9d4f6 Compare November 20, 2025 14:06

Merge branch 'main' into run-others

2087488

kpouget merged commit 6d90325 into openshift-psap:main Nov 20, 2025
8 of 9 checks passed

[container_benchmark] MacOS and Linux Experiments #870

[container_benchmark] MacOS and Linux Experiments #870

Uh oh!

Conversation

Honny1 commented Nov 15, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

openshift-ci bot commented Nov 15, 2025

Uh oh!

coderabbitai bot commented Nov 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Rate limit exceeded

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Poem

Pre-merge checks and finishing touches

Uh oh!

openshift-ci bot commented Nov 15, 2025

Uh oh!

Honny1 commented Nov 16, 2025

Uh oh!

Honny1 commented Nov 17, 2025

Uh oh!

topsail-bot bot commented Nov 17, 2025

Uh oh!

Honny1 commented Nov 17, 2025

Uh oh!

topsail-bot bot commented Nov 17, 2025

Uh oh!

Honny1 commented Nov 17, 2025

Uh oh!

topsail-bot bot commented Nov 17, 2025

Uh oh!

topsail-bot bot commented Nov 18, 2025

Uh oh!

topsail-bot bot commented Nov 18, 2025

Uh oh!

topsail-bot bot commented Nov 18, 2025

Uh oh!

Honny1 commented Nov 18, 2025

Uh oh!

topsail-bot bot commented Nov 19, 2025

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Nov 20, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Honny1 commented Nov 15, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Nov 15, 2025 •

edited

Loading