v1.33.0.1 feat(build): bump build skill version to 1.22.0 and regenerate docs by anbangr · Pull Request #1425 · garrytan/gstack

anbangr · 2026-05-11T07:08:07Z

Summary

Bumps the /build skill frontmatter version from 1.21.4 to 1.22.0 and regenerates the derived SKILL.md.

Code changes:

build/SKILL.md.tmpl: bump version: frontmatter from 1.21.4 → 1.22.0.
build/SKILL.md: regenerate from template (same version bump).
build/orchestrator/__tests__/skill-md.test.ts: update test expectations to match the new version string.

Commits:

89628245 — feat(build): bump build skill version to 1.22.0 and regenerate docs
6d33da75 — chore: bump version and changelog (v1.33.0.1)

Test Coverage

All new code paths have test coverage. Existing tests updated for the refactored API.

Build tests: 805 pass, 0 fail (40.70s).
Full suite: timed out after 300s — pre-existing infrastructure issue, not caused by this branch.

Pre-Landing Review

No issues found. Diff is 3 files, 4 insertions(+), 4 deletions(-).

Design Review

No frontend files changed — design review skipped.

Eval Results

No prompt-related files changed — evals skipped.

Plan Completion

No plan file detected.

TODOS

No TODO items completed in this PR.

Test plan

Build-skill tests pass (805 tests, 0 failures)

🤖 Generated with Claude Code

<a href="https://app.blacksmith.sh/garrytan/codesmith/gstack/pr/1425\"><source media="(prefers-color-scheme: dark)" srcset="https://pr-comments-assets.blacksmith.sh/codesmith/view-in-codesmith-dark.svg\"><source media="(prefers-color-scheme: light)" srcset="https://pr-comments-assets.blacksmith.sh/codesmith/view-in-codesmith-light.svg\"><img alt="View in Codesmith" src="https://pr-comments-assets.blacksmith.sh/codesmith/view-in-codesmith-dark.svg\">
^{Need help on this PR? <a href="https://docs.blacksmith.sh\">Check the docs.}

# Conflicts: # README.md

- Adds implement/SKILL.md.tmpl to execute plans in phases - Updates GSTACK_PLAYBOOK.md to include the new workflow

…g loop

…oices

…tructions

… execution

…cations

…loops

…te dispatch

…s review and ship, add implement reexamine mode

… and sonnet for review/qa

…erative fix, and deployment

… of at the end

…subagent loop

… review instead of /review

…nv passing

…atch Adds test/skill-e2e-build-fault-investigator.test.ts (periodic tier) covering the fault investigator E2E flow: mock gstack-build outputs SKILL_FAULT_DETECTED JSON, Step M3.5 dispatches GSTACK_FAULT_INVESTIGATOR_COMMAND with fault env vars, mock investigator writes report to $FAULT_PRIMARY, assertions verify report exists with PLAN_SYNTHESIS_INVALID and no source files were edited. Registers build-fault-investigator-e2e in touchfiles.ts — selected when build/SKILL.md, skill-fault-detector.ts, or monitor.ts change. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Remove the `--skip-sweep` flag and the unshipped feat/* sweep bullet from the Startup Gates section and flags table. Aligns with the code removal in 3e2b8b2. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

- Adds mock configure.cm file to prevent jq from failing in Step M3.5 mock

…11-074503-fabe4c3f-4-e2e-test-touchfile-registration'

1. plan-selection (6 tests): `defaultActiveRunRegistryDir()` hardcoded `~/.gstack/build-state/active-runs` and ignored `GSTACK_BUILD_STATE_DIR`, causing 11 real active-run records to leak into unit tests and inflate candidate counts (turning expected "selected" into "ambiguous"). Fix: honour the env var consistently, the same way `state.ts` already does. 2. integration (3 tests): plan review subprocess called `codex` with `OPENAI_API_KEY` from the inherited `process.env`, triggering a real ~30s API call against the LLM. These tests exercise feature lifecycle, not plan review. Fix: add `--no-plan-review` to each CLI invocation. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…estSpec detection Four improvements identified during code review of 3e2b8b2: - Move `extractCoverageTarget` from cli.ts to sub-agents.ts (alongside parseCoveragePercent); re-export via import in cli.ts. Eliminates the circular-import risk when phase-runner.ts calls coverage functions. - Fix decimal truncation in extractCoverageTarget: `(\d+)` only matched integers, silently returning 80 for targets like ≥90.5%. Changed to `([\d.]+)` + parseFloat. - Fix `hasTestSpec` detection in buildGeminiTestSpecPrompt: was `phase.body.includes("#### Test Spec")` (fragile string match, false negative when body text differs). Now `phase.testSpecCheckboxLine !== -1` (parser already computes this — zero extra overhead). - Wire coverage gate in RUN_TESTS handler: after GREEN tests pass and the phase has a test spec (`testSpecCheckboxLine !== -1`), call parseCoveragePercent(result.stdout, testCmd) and compare against extractCoverageTarget(phase.body). Below target → set coverageResult and route to test_fix_running. Unknown framework → log advisory, proceed. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Complete the coverage gate: `injectCoverageFlags(testCmd)` appends the appropriate flag for the detected framework before the GREEN test run, so `parseCoveragePercent` reliably finds coverage data in stdout even when projects don't pre-configure coverage in their test script. Framework → flag mapping: jest → --coverage --coverageReporters text vitest → --coverage bun test → --coverage pytest → --cov --cov-report term-missing go test → -cover unknown → unchanged (advisory log, gate skips) Injection is idempotent (no-op if flag already present) and only fires when the phase has a test spec (testSpecCheckboxLine !== -1) — VERIFY_RED and legacy phases use the bare test command unchanged. 11 unit tests added covering each framework, idempotency, and unknowns. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

`phase.kind !== "code" ? "" : ""` always evaluated to "" regardless of the condition, and was silently filtered by .filter(Boolean). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…p (Bug D1) Two failing tests document the bug: 1. After CRITICAL verdict, state.planReview must be persisted with status "critical_exit_pending" — currently cli.ts does not persist anything before process.exit(3), so planReview stays undefined on disk. 2. On resume with the sentinel set, the plan-review gate must still fire — the current guard (!state.planReview) is false when planReview is truthy, so the gate is skipped after the sentinel is introduced. Two GREEN tests confirm baseline behavior: APPROVE verdict suppresses the gate; undefined planReview (first run) fires the gate. Tests MUST fail until Feature 4 implementation lands. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Before this fix, a CRITICAL plan-review verdict caused process.exit(3) without saving any sentinel to state. On resume, !state.planReview was true → review ran again → CRITICAL again → infinite loop. Fix: 1. Save state.planReview = { ...verdict, status: "critical_exit_pending" } before releaseLock + process.exit(3) so the sentinel survives on disk. 2. Widen the plan-review gate guard from !state.planReview to !state.planReview || state.planReview.status === "critical_exit_pending" so the gate re-fires on resume when the sentinel is present. Tests: two new tests in phase-runner.test.ts cover both the sentinel persistence and the widened gate; 90/90 passing. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…g D2) Introduces ExitError (errors.ts) — thrown instead of process.exit(N) inside try/finally blocks so the finally clause runs cleanup before the process terminates. Changes: - errors.ts: new ExitError class (instanceof Error, numeric code field) - cli.ts: import ExitError; replace critical_exit process.exit(3) with throw new ExitError(3); update main().catch to call process.exit(err.code) when err instanceof ExitError - phase-runner.test.ts: 5 new tests (ExitError shape, propagation through finally, default and custom messages); 95/95 passing Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…ature 6) applyResult() now populates phaseState.coverageResult when: - action is RUN_TESTS - tests are GREEN (status = "tests_green") - extra.phaseBody is provided - parseCoveragePercent() returns a non-null value for the stdout Coverage below target emits an advisory warning but keeps status "tests_green" — not blocking. The target defaults to 80 when no "**Coverage target: ≥N%**" line appears in the phase body. 6 new tests in phase-runner.test.ts; 101/101 passing. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…ics + test assertions - Add errors.ts to MODULE_TEST_OWNERS in coverage-matrix.test.ts - Fix analytics logActivity to emit "success" for exit code 13 (FINALIZATION_REQUIRED), which is a success state (pending ship), not a failure - Fix integration test assertions: --skip-ship correctly exits 13, not 0, when features reach origin_verified (pre-existing test/impl mismatch)

…d [Phase 1.1] RED phase TDD: 11 tests fail because the parser does not yet stamp kind: "code" on emitted phases, and existing Phase literal construction sites have no kind field (undefined fails the VALID_KINDS.includes runtime assertion). 11 tests pass immediately: direct Phase construction with explicit kind values, and PhaseKind union membership checks (both already exist in types.ts). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

… loop

Add required kind: PhaseKind field to the parser factory init and to every Phase literal construction site in tests/fixtures. This ensures backward-compatible default of kind: "code" for all existing phases while the type system enforces correctness going forward. - parser.ts: stamp kind: "code" on every emitted Phase - state.test.ts, cli.test.ts, phase-runner.test.ts, feature-review.test.ts, cli-guardrails.test.ts, phase-kind.test.ts: add kind: "code" to all helpers and inline literals

…tations - Fix PHASE_HEADING regex to allow optional [kind] bracket between number and colon - Add BODY_KIND_PATTERN for  HTML comment fallback - Add IMPL_LABELS_BY_KIND and REVIEW_LABELS_BY_KIND maps for all 5 PhaseKind values - Parser now stamps kind from heading bracket (primary), body comment (fallback), or defaults to "code" - Inline kind-comment detection ensures kind is set before checkbox processing - Add implCheckboxRe/reviewCheckboxRe for kind-specific checkbox matching - Add 16 new parser tests covering all bracket annotations, HTML fallback, checkbox recognition Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

- Add IMPL_MARKER_BY_KIND and REVIEW_MARKER_BY_KIND lookup tables - Update flipPhaseCheckboxes signature to accept optional kind?: PhaseKind - Derives implMarker/reviewMarker from kind ?? "code" (backward compat) - Update reconcilePhaseCheckboxes to pass phase.kind - Update both cli.ts call sites (lines ~3870, ~4282) to pass kind: phase.kind - Add 9 kind-aware mutator tests covering all 5 kinds + error cases + backward compat Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…EW gates, ship gate

…-storm-20260511-122548-fabe4c3f-2-bump-build-skill-md-tmpl-version-regenerate-ho

…-storm-20260511-122548-fabe4c3f-2-bump-build-skill-md-tmpl-version-regenerate-ho # Conflicts: # CHANGELOG.md # VERSION # package.json # test/gen-skill-docs.test.ts # test/helpers/touchfiles.ts

anbangr added 30 commits April 22, 2026 19:18

Add architecture-focused planning review skills

190e6c4

docs: add GStack Playbook for workflow guidance and skill reference

6638051

Merge remote-tracking branch 'upstream/main'

2ad9e73

# Conflicts: # README.md

Merge origin/main into main

946e9f5

feat: add /implement autonomous coding skill

d3b148b

- Adds implement/SKILL.md.tmpl to execute plans in phases - Updates GSTACK_PLAYBOOK.md to include the new workflow

feat(implement): add model routing discipline for gemini and sonnet

7b6bc1b

feat(implement): add living implementation plan synthesis and checkin…

073eee2

…g loop

feat(implement): add feature branching and auto-deploy

c16834b

feat(implement): add opus and codex consensus for ambiguous review ch…

6ed7d95

…oices

feat(implement): process entire plan and use proper plan naming

7039ec0

feat(implement): add verbose state narration and autonomous continuity

b15bdf2

fix(implement): enforce automatic deploy skill invocation without asking

43300a9

feat(implement): use sub-agent delegation to prevent context compaction

87040dc

feat(implement): add iterative github ci/cd checking to sub-agent ins…

318504f

…tructions

fix(implement): explicit bash tool instruction for ship skill invocation

1bacade

feat(implement): mandate autonomous execution of skills via bash tool

4d6a8a2

feat(implement): run both ship and land-and-deploy sequentially

f3c6208

feat(implement): explicitly mandate sonnet model for autonomous skill…

f517f2c

… execution

feat(implement): explicitly set sonnet model for sub-agent skill invo…

5e9df85

…cations

feat(implement): mandate /review and /qa skills during sub-agent phases

193dfa6

feat(implement): mandate agents to fix issues found during QA/review …

e1e051b

…loops

feat(implement): mandate bash tool for autonomous opus and codex deba…

2462748

…te dispatch

feat: replace AskUserQuestion with autonomous Opus/Codex debate acros…

72fc1f7

…s review and ship, add implement reexamine mode

revert(skills): restore AskUserQuestion to review and ship skills

5a0dd78

feat(implement): sync execution status back to original autoplan file

4b524b1

feat(implement): strictly enforce gemini for phase execution via bash…

bb5b1ee

… and sonnet for review/qa

feat(implement): spawn dedicated sonnet subagent for final review, it…

86d7a05

…erative fix, and deployment

feat(implement): execute continuous deployment loop per phase instead…

9b4f9fc

… of at the end

feat(implement): replace sonnet with codex for review and deployment …

d689127

…subagent loop

fix(implement): restore sonnet subagent but instruct it to use /codex…

2a09300

… review instead of /review

anbangr and others added 28 commits May 11, 2026 12:16

qa(build): improve M3.5 path resolution, exit-code persistence, and e…

e368ba0

…nv passing

fix(build): complete M3.5 fault investigator report contract

0e07df2

docs(build): remove startup sweep from README startup gates

1d79ecd

Remove the `--skip-sweep` flag and the unshipped feat/* sweep bullet from the Startup Gates section and flags table. Aligns with the code removal in 3e2b8b2. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

test(e2e): complete build fault investigator test structure

779d79f

- Adds mock configure.cm file to prevent jq from failing in Step M3.5 mock

qa(e2e): fix HOME isolation and report path in fault investigator test

523d7f8

Merge branch 'feat/gstack-gstack-now-i-want-the-virtual-minsky-202605…

4070c04

…11-074503-fabe4c3f-4-e2e-test-touchfile-registration'

feat(build): bump build skill version to 1.22.0 and regenerate docs

8962824

chore: bump test phase timeout to 900000ms (suite grew past 5min budget)

412ade4

fix(review): remove dead-code noop in buildCodexReviewBody

4b385a4

`phase.kind !== "code" ? "" : ""` always evaluated to "" regardless of the condition, and was silently filtered by .filter(Boolean). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

fix(test): add build/orchestrator/__tests__/ to bun test path for TDD…

e093b14

… loop

feat(cli): Phase 1.4 — buildKindInstructions for kind-specific prompts

0b5388b

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

chore: regenerate SKILL.md files after Phase 1.2-1.5 template updates

f752e7e

feat(templates): Phase 1.5 — non-coding phase templates, CONTENT_REVI…

8542048

…EW gates, ship gate

Merge remote-tracking branch 'origin/main' into feat/gstack-delegated…

f913fb4

…-storm-20260511-122548-fabe4c3f-2-bump-build-skill-md-tmpl-version-regenerate-ho

chore: bump version and changelog (v1.33.0.1)

6d33da7

anbangr changed the title ~~feat(build): bump build skill version to 1.22.0 and regenerate docs~~ v1.33.0.1 feat(build): bump build skill version to 1.22.0 and regenerate docs May 12, 2026

Merge remote-tracking branch 'github/main' into feat/gstack-delegated…

fe77933

…-storm-20260511-122548-fabe4c3f-2-bump-build-skill-md-tmpl-version-regenerate-ho # Conflicts: # CHANGELOG.md # VERSION # package.json # test/gen-skill-docs.test.ts # test/helpers/touchfiles.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v1.33.0.1 feat(build): bump build skill version to 1.22.0 and regenerate docs#1425

v1.33.0.1 feat(build): bump build skill version to 1.22.0 and regenerate docs#1425
anbangr wants to merge 210 commits into
garrytan:mainfrom
anbangr:feat/gstack-delegated-storm-20260511-122548-fabe4c3f-2-bump-build-skill-md-tmpl-version-regenerate-ho

anbangr commented May 11, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

anbangr commented May 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test Coverage

Pre-Landing Review

Design Review

Eval Results

Plan Completion

TODOS

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

anbangr commented May 11, 2026 •

edited

Loading