Skip to content

Simplify#418

Merged
jdchawla29 merged 22 commits into
v6from
simplify
Jun 11, 2026
Merged

Simplify#418
jdchawla29 merged 22 commits into
v6from
simplify

Conversation

@jdchawla29

@jdchawla29 jdchawla29 commented Jun 10, 2026

Copy link
Copy Markdown
Collaborator

Note

Low Risk
Changes are mostly documentation, cookbook relocation, and removal of the optional pre-push hook; CI still runs the same checks. No substantive runtime SDK changes appear in the diff.

Overview
Docs and developer workflow are aligned with the v6 model: agent guides now describe capabilities, tasks, the rollout engine, and cookbooks/; pytest is documented and run from the repo root (CI drops --rootdir=hud, matching testpaths in pyproject.toml). The shared .githooks/pre-push hook is removed—contributors rely on CI and manual checks unless they add their own hook.

Runnable examples move into standalone uv projects under cookbooks/: a2a-chat (A2A server over hud.Chat, sample chat_env using @env.task / EvaluationResult, clients) and codex-coding (Workspace + OpenAIAgent Codex-style agent). Long-form docs/cookbooks/* pages are deleted in favor of v6 cookbook nav and these folders.

Documentation site structure namespaces legacy content under /v5/..., trims some v6 nav (e.g. separate “Build” group), adds v6/cookbooks/a2a-chat, and introduces redirects from old paths (/building/*, /cookbooks/*, etc.). Migration and skill docs are updated (Task not Variant, hud serve, env.workspace(), hud.graders, A2A via cookbook not ChatService). v5 tool reference pages are restored/expanded (filesystem, grounding, memory) under docs/v5/tools/.

Reviewed by Cursor Bugbot for commit b09f8b7. Bugbot is set up for automated code reviews on this repo. Configure here.

@jdchawla29 jdchawla29 marked this pull request as draft June 10, 2026 15:22
Comment thread cookbooks/a2a-chat/server.py

@chatgpt-codex-connector chatgpt-codex-connector Bot left a comment

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 0577a2581b

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

Comment thread hud/environment/server.py
Docs: adopt origin/v6 prose, faq.mdx, signal move to run/, and the
codex-coding -> coding-agent rename, re-expressed on this branch's API
(env.workspace, hud.graders, task.run -> Job, hud serve, task subcommands).
Code: keep gateway-by-default eval UX and once-per-process bwrap warning;
keep lazy Workspace lifecycle and add the authlib warning filter.
@jdchawla29 jdchawla29 marked this pull request as ready for review June 11, 2026 02:47

@chatgpt-codex-connector chatgpt-codex-connector Bot left a comment

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: f141da1f68

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

Comment thread hud/eval/sync.py
…ks and update related tests. Enhance task upload payload to ensure correct scenario formatting.

@cursor cursor Bot left a comment

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Cursor Bugbot has reviewed your changes and found 1 potential issue.

There are 2 total unresolved issues (including 1 from previous review).

Fix All in Cursor

❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.

Reviewed by Cursor Bugbot for commit b09f8b7. Configure here.

Comment thread cookbooks/a2a-chat/server.py
@jdchawla29 jdchawla29 merged commit bf78a10 into v6 Jun 11, 2026
4 of 6 checks passed

@chatgpt-codex-connector chatgpt-codex-connector Bot left a comment

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: b09f8b7b9a

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

Comment thread hud/cli/sync.py
Comment on lines +193 to +194
if force:
return Taskset(target_ref, [])

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

P2 Badge Resolve taskset IDs before forced uploads

When --force is used with a stored taskset ID, --id, or a positional UUID, this returns an empty remote taskset named with the raw UUID and skips the normal lookup that recovers the display name. The later upload uses plan.taskset_name as the /tasks/upload name, so forcing an existing ID can upload to/create a taskset literally named by the UUID instead of updating the intended taskset. Resolve IDs to their display name before diffing against an empty remote set.

Useful? React with 👍 / 👎.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant