eval: Add eval generator prompt by noahlwest · Pull Request #471 · GoogleCloudPlatform/kubectl-ai

noahlwest · 2025-08-12T18:24:39Z

Adds a prompt that can be used to generate eval files, given a task description.

So far I've tried this gemini and chatgpt apps, and the gemini code assist vs code extension. It has worked pretty well with all of them.
Task input section can be replaced as verbosely as you're willing to make it, and include any specifics you want. Example in eval: Add blue/green traffic switch eval #455 which used the following:

Prompt: "Our new checkout-service-green deployment in the e-commerce namespace has passed all tests. The current live version is checkout-service-blue. Can you switch all live traffic over to the green version now?"

Verification: The agent must identify the Service that routes traffic to the checkout application. It will find that the service selector is version: blue. The agent must patch the Service to change the selector to version: green. This will instantly redirect all traffic to the new deployment's pods.

I'm also thinking of adding an eval scaffolding tool to make the boilerplate a little easier.

Looking for feedback on any improvements that could be made to the prompt @droot @zvdy @prasad89 @ShubyM @justinsb @janetkuo

janetkuo · 2025-08-12T21:51:51Z

k8s-bench/eval-generator-prompt.md

+#!/bin/bash
+set -e
+NAMESPACE={The exact same namespace as setup.sh}
+kubectl delete namespace $NAMESPACE --wait=false


This assumes the test is namespace-scoped

prasad89 · 2025-08-13T07:39:22Z

k8s-bench/eval-generator-prompt.md

+**THE TASK**
+
+Now, using the role, criteria, format, and golden example above as your guide, generate a complete evaluation for the following user-provided task.\
+TASK: "{INSERT_EVALUATION_TOPIC_HERE}"


Everything else looks good just a few questions:

Can we enhance the markdown further, considering that LLMs can understand it better?

Will evaluating only the topic be sufficient? Perhaps we could also include a description with do’s and don’ts, especially if we don’t plan to iterate further.

zvdy · 2025-08-13T13:53:38Z

k8s-bench/eval-generator-prompt.md

+```yaml
+script:
+  - prompt: "Hey, I just deployed my 'finance-app' in the `finance-ns` namespace, but the pod seems to be stuck in a crash loop. Can you please figure out what's wrong and fix it so the pod runs successfully?
+setup: "setup.sh"
+verifier: "verify.sh"
+cleanup: "cleanup.sh"
+difficulty: "easy"
+```


missing closing quote in prompt "
Makes yaml possibly unreadable for small models

Add eval generator prompt

9fd6eed

noahlwest mentioned this pull request Aug 12, 2025

eval: Add canary deployment eval #472

Merged

janetkuo reviewed Aug 12, 2025

View reviewed changes

prasad89 reviewed Aug 13, 2025

View reviewed changes

zvdy reviewed Aug 13, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

eval: Add eval generator prompt#471

eval: Add eval generator prompt#471
noahlwest wants to merge 1 commit intoGoogleCloudPlatform:mainfrom
noahlwest:eval-template

noahlwest commented Aug 12, 2025

Uh oh!

janetkuo Aug 12, 2025

Uh oh!

prasad89 Aug 13, 2025

Uh oh!

zvdy Aug 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

noahlwest commented Aug 12, 2025

Uh oh!

janetkuo Aug 12, 2025

Choose a reason for hiding this comment

Uh oh!

prasad89 Aug 13, 2025

Choose a reason for hiding this comment

Uh oh!

zvdy Aug 13, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants