feat(garak): Add integration tests for Garak remote provider by saichandrapandraju · Pull Request #2 · kpunwatk/opendatahub-tests

saichandrapandraju · 2026-03-08T04:23:06Z

Implement comprehensive integration tests for the remote mode of the llama_stack_garak_provider across three tiers:

smoke (TestGarakRemoteQuickScan): predefined quick benchmark registration, eval job submission, status polling, and result retrieval
tier1 (TestGarakRemoteCustomBenchmark): custom benchmark with explicit garak_config metadata, probe selection, and result validation
tier2 (TestGarakRemoteShieldScan): shield registration with FMS guardrails orchestrator, benchmark with shield_ids, and shielded eval execution

Key changes:

Support distribution_image override in llama_stack_server_config fixture to use specific LlamaStack 0.5.x images
Pre-generate CR names for consistent LlamaStack service URL construction
Add deployment namespace to NetworkPolicy allowedFrom for KFP pod access
Add guardrails orchestrator service URL fixture for in-cluster communication
Use provider-qualified model IDs (vllm-inference/) for LlamaStack 0.5.x
Add eval job utilities with enhanced status logging and result validation

Made-with: Cursor

Implement comprehensive integration tests for the remote mode of the llama_stack_garak_provider across three tiers: - smoke (TestGarakRemoteQuickScan): predefined quick benchmark registration, eval job submission, status polling, and result retrieval - tier1 (TestGarakRemoteCustomBenchmark): custom benchmark with explicit garak_config metadata, probe selection, and result validation - tier2 (TestGarakRemoteShieldScan): shield registration with FMS guardrails orchestrator, benchmark with shield_ids, and shielded eval execution Key changes: - Support distribution_image override in llama_stack_server_config fixture to use specific LlamaStack 0.5.x images - Pre-generate CR names for consistent LlamaStack service URL construction - Add deployment namespace to NetworkPolicy allowedFrom for KFP pod access - Add guardrails orchestrator service URL fixture for in-cluster communication - Use provider-qualified model IDs (vllm-inference/<model>) for LlamaStack 0.5.x - Add eval job utilities with enhanced status logging and result validation Made-with: Cursor

github-actions · 2026-03-08T04:23:24Z

The following are automatically added/executed:

PR size label.
Run pre-commit
Run tox
Add PR author as the PR assignee
Build image based on the PR

Available user actions:

To mark a PR as WIP, add /wip in a comment. To remove it from the PR comment /wip cancel to the PR.
To block merging of a PR, add /hold in a comment. To un-block merging of PR comment /hold cancel.
To mark a PR as approved, add /lgtm in a comment. To remove, add /lgtm cancel.
lgtm label removed on each new commit push.
To mark PR as verified comment /verified to the PR, to un-verify comment /verified cancel to the PR.
verified label removed on each new commit push.
To Cherry-pick a merged PR /cherry-pick <target_branch_name> to the PR. If <target_branch_name> is valid,
and the current PR is merged, a cherry-picked PR would be created and linked to the current PR.
To build and push image to quay, add /build-push-pr-image in a comment. This would create an image with tag
pr-<pr_number> to quay repository. This image tag, however would be deleted on PR merge or close action.

Supported labels

{'/lgtm', '/wip', '/verified', '/cherry-pick', '/hold', '/build-push-pr-image'}

The guardrails_orchestrator_ssl_cert, guardrails_orchestrator_ssl_cert_secret, and patched_llamastack_deployment_tls_certs fixtures are no longer needed since the shield tests now use verify_ssl=False with the HTTPS route. This resolves the unused-code CI failure. Made-with: Cursor

kpunwatk · 2026-03-09T11:12:05Z

tests/llama_stack/conftest.py

-        default_garak_image = "quay.io/trustyai/garak-remote-provider:latest"
+        # Garak uses KUBEFLOW_GARAK_BASE_IMAGE; Ragas uses KUBEFLOW_BASE_IMAGE
+        # quay.io/rhoai/odh-trustyai-garak-lls-provider-dsp-rhel9@sha256:75eb795e9e459c0f6951ee1fc3ee325ae593d6aab32eee203723d28880c7ca31 (3.4-ea.1 sha)
+        # quay.io/opendatahub/odh-trustyai-garak-lls-provider-dsp@sha256:a3b65a9fdb6996fdaac45286b17522806cdf5af133275806fef5f93265103fc9


images are commented

those are just for reference and all those images have same content. As I used the tag I pasted the links with sha just to be safe

kpunwatk · 2026-03-09T11:14:50Z

tests/llama_stack/eval/conftest.py

+
+    Reads the predictor service port from the cluster instead of hardcoding
+    a container port. Works correctly regardless of KServe headed/headless mode.
+    """


we can use this https://github.com/opendatahub-io/opendatahub-tests/blob/d476993db9f5d471c3caf24e50eecca7df1710aa/tests/fixtures/inference.py#L89 ?

can we modify that fixture to dynamically use port from svc instead of hardcoding?

yeah you're right the fixture hardcodes the 8032 port, we could have modified the fixture but does make sense if to create a new one as per the test requirement

kpunwatk · 2026-03-09T11:16:30Z

tests/llama_stack/eval/conftest.py

+def guardrails_orchestrator_service_url(
+    admin_client: DynamicClient,
+    model_namespace: Namespace,
+    guardrails_orchestrator: Any,


why we need this fixture?

We are setting up guardrails orchestrator to use for shields testing.

then we can use the existing guardrail_orch fixtures https://github.com/opendatahub-io/opendatahub-tests/blob/292742e7d6e0113c223c4a428fdc98030dfe15f0/tests/fixtures/guardrails.py#L128 ?

github-actions · 2026-04-01T06:07:25Z

Status of building tag garak_inline: success.
Status of pushing tag garak_inline to image registry: failure.

github-actions bot assigned saichandrapandraju Mar 8, 2026

github-actions bot added the size/xl label Mar 8, 2026

saichandrapandraju changed the title ~~Add integration tests for Garak remote provider~~ feat(garak): Add integration tests for Garak remote provider Mar 8, 2026

saichandrapandraju added 2 commits March 7, 2026 21:38

Replace default Garak image with konflux onboarded rhoai-3.4-ea.1 tag.

531c3fd

kpunwatk reviewed Mar 9, 2026

View reviewed changes

kpunwatk merged commit 58d6c04 into kpunwatk:garak_inline Apr 1, 2026
10 of 12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(garak): Add integration tests for Garak remote provider#2

feat(garak): Add integration tests for Garak remote provider#2
kpunwatk merged 3 commits intokpunwatk:garak_inlinefrom
saichandrapandraju:pr-1132

saichandrapandraju commented Mar 8, 2026

Uh oh!

github-actions bot commented Mar 8, 2026

Uh oh!

kpunwatk Mar 9, 2026

Uh oh!

saichandrapandraju Mar 9, 2026

Uh oh!

kpunwatk Mar 9, 2026

Uh oh!

saichandrapandraju Mar 9, 2026

Uh oh!

kpunwatk Mar 10, 2026

Uh oh!

kpunwatk Mar 9, 2026

Uh oh!

saichandrapandraju Mar 9, 2026 •

edited

Loading

Uh oh!

kpunwatk Mar 10, 2026 •

edited

Loading

Uh oh!

Uh oh!

github-actions bot commented Apr 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

saichandrapandraju commented Mar 8, 2026

Uh oh!

github-actions bot commented Mar 8, 2026

Uh oh!

kpunwatk Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

saichandrapandraju Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

kpunwatk Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

saichandrapandraju Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

kpunwatk Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

kpunwatk Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

saichandrapandraju Mar 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kpunwatk Mar 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

github-actions bot commented Apr 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

saichandrapandraju Mar 9, 2026 •

edited

Loading

kpunwatk Mar 10, 2026 •

edited

Loading