Skip to content

Commit a306418

Browse files
fix: rename reviewer dimension names to match record-review.sh schema (a5cb-5a76) (merge worktree-20260324-164324)
2 parents 15ebf4e + 66fba4f commit a306418

22 files changed

+585
-174
lines changed

.test-index

Lines changed: 8 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -16,7 +16,7 @@ plugins/dso/docs/decisions/adr-config-system.md:tests/scripts/test-adr-config-sy
1616
plugins/dso/docs/workflow-config-schema.json:tests/scripts/test-workflow-config-schema.sh
1717
plugins/dso/docs/workflows/COMMIT-WORKFLOW.md:tests/scripts/test-commit-workflow-step-1-5.sh
1818
plugins/dso/docs/workflows/REVIEW-PROTOCOL-WORKFLOW.md:tests/hooks/test-review-protocol-workflow.sh
19-
plugins/dso/docs/workflows/REVIEW-WORKFLOW.md:tests/workflows/test-review-workflow-no-snapshot.sh,tests/workflows/test-review-workflow-size-thresholds.sh
19+
plugins/dso/docs/workflows/REVIEW-WORKFLOW.md:tests/workflows/test-review-workflow-no-snapshot.sh,tests/workflows/test-review-workflow-size-thresholds.sh,tests/workflows/test-review-workflow-classifier-override-prevention.sh
2020
plugins/dso/hooks/dispatchers/pre-bash.sh:tests/hooks/test-pre-bash-dispatcher.sh
2121
plugins/dso/hooks/dispatchers/pre-edit.sh:tests/hooks/test-pre-edit-write-dispatcher.sh
2222
plugins/dso/hooks/dispatchers/pre-write.sh:tests/hooks/test-pre-edit-write-dispatcher.sh
@@ -53,7 +53,7 @@ plugins/dso/skills/brainstorm/SKILL.md:plugins/dso/tests/test-sprint-skill-step1
5353
plugins/dso/skills/brainstorm/docs/reviewers/agent-clarity.md:tests/reviewers/test-agent-clarity-epic-calibration.sh
5454
plugins/dso/skills/brainstorm/docs/reviewers/scope.md:tests/scripts/test-isolation-rule-no-unscoped-export.sh
5555
plugins/dso/skills/brainstorm/docs/reviewers/value.md:tests/scripts/test-value-reviewer-signals.sh
56-
plugins/dso/skills/debug-everything/SKILL.md:plugins/dso/tests/test-sprint-skill-step10-no-merge-to-main.sh,tests/plugin/test-audit-skill-resolution.sh,tests/hooks/test-fix-bug-skill.sh,tests/hooks/test-generate-claude-md-skill.sh,tests/hooks/test-init-skill.sh,tests/scripts/test-qualify-skill-refs.sh,tests/scripts/test-skill-path-refs.sh,tests/scripts/test-check-skill-refs.sh,tests/skills/test_end_skill_final_verification_step.py,tests/skills/test_implementation_plan_skill_tdd_enforcement.py,tests/skills/test-quick-ref-skill.sh,tests/skills/test_project_setup_skill_conditional_prompts.py,tests/skills/test_fix_bug_skill.py,tests/skills/test_end_skill_summary_displays_stored_learnings.py,tests/skills/test_end_skill_learnings_step_before_commit.py,tests/skills/test-design-skills-cross-stack.sh,tests/skills/test_end_skill_dirty_worktree_resolution.py,tests/skills/test_fix_bug_skill_escalated_section.py,tests/skills/test_end_skill_bug_tickets_before_commit.py,tests/hooks/test-sub-agent-guard.sh
56+
plugins/dso/skills/debug-everything/SKILL.md:plugins/dso/tests/test-sprint-skill-step10-no-merge-to-main.sh,tests/plugin/test-audit-skill-resolution.sh,tests/hooks/test-fix-bug-skill.sh,tests/hooks/test-generate-claude-md-skill.sh,tests/hooks/test-init-skill.sh,tests/scripts/test-qualify-skill-refs.sh,tests/scripts/test-skill-path-refs.sh,tests/scripts/test-check-skill-refs.sh,tests/skills/test_end_skill_final_verification_step.py,tests/skills/test_implementation_plan_skill_tdd_enforcement.py,tests/skills/test-quick-ref-skill.sh,tests/skills/test_project_setup_skill_conditional_prompts.py,tests/skills/test_fix_bug_skill.py,tests/skills/test_end_skill_summary_displays_stored_learnings.py,tests/skills/test_end_skill_learnings_step_before_commit.py,tests/skills/test-design-skills-cross-stack.sh,tests/skills/test_end_skill_dirty_worktree_resolution.py,tests/skills/test_fix_bug_skill_escalated_section.py,tests/skills/test_end_skill_bug_tickets_before_commit.py,tests/scripts/test-debug-everything-fix-bug-delegation.sh,tests/hooks/test-sub-agent-guard.sh
5757
plugins/dso/skills/debug-everything/prompts/critic-review.md:tests/plugin/test-debug-everything-readonly-enforcement.sh
5858
plugins/dso/skills/debug-everything/prompts/diagnostic-and-cluster.md:tests/plugin/test-debug-everything-readonly-enforcement.sh
5959
plugins/dso/skills/debug-everything/prompts/full-validation.md:tests/plugin/test-debug-everything-readonly-enforcement.sh
@@ -111,4 +111,10 @@ plugins/dso/skills/verification-before-completion/SKILL.md:plugins/dso/tests/tes
111111
examples/ci.example.yml: tests/scripts/test-ci-no-v2-paths.sh [test_ci_example_yml_no_tickets_paths]
112112
examples/pre-commit-config.example.yaml: tests/scripts/test-ci-no-v2-paths.sh [test_precommit_example_no_tickets_exclude]
113113
plugins/dso/hooks/pre-commit-ticket-gate.sh: tests/hooks/test-pre-commit-ticket-gate.sh [test_blocks_missing_ticket_id]
114+
plugins/dso/agents/code-reviewer-light.md:tests/agents/test-reviewer-dimension-names.sh
115+
plugins/dso/agents/code-reviewer-standard.md:tests/agents/test-reviewer-dimension-names.sh
116+
plugins/dso/agents/code-reviewer-deep-arch.md:tests/agents/test-reviewer-dimension-names.sh
117+
plugins/dso/agents/code-reviewer-deep-correctness.md:tests/agents/test-reviewer-dimension-names.sh
118+
plugins/dso/agents/code-reviewer-deep-hygiene.md:tests/agents/test-reviewer-dimension-names.sh
119+
plugins/dso/agents/code-reviewer-deep-verification.md:tests/agents/test-reviewer-dimension-names.sh
114120
plugins/dso/scripts/merge-to-main.sh: tests/scripts/test-merge-to-main.sh [test_merge_to_main_no_v2_tickets_md_pattern]

plugins/dso/agents/code-reviewer-deep-arch.md

Lines changed: 18 additions & 18 deletions
Original file line numberDiff line numberDiff line change
@@ -3,7 +3,7 @@ name: code-reviewer-deep-arch
33
model: opus
44
description: Deep-tier architectural reviewer (Opus): synthesizes specialist findings, assesses systemic risk, produces unified verdict across all dimensions.
55
---
6-
<!-- content-hash: a3821eb8570327748980abce1eb951e42d77a7124d9c4783998686cbcf8df87c -->
6+
<!-- content-hash: 2882c000e741e4e2e411ebd862039793db8f4ad43a2350cf292b755dec79311e -->
77
<!-- generated by build-review-agents.sh — do not edit manually -->
88

99
# Code Reviewer — Universal Base Guidance
@@ -71,12 +71,12 @@ Focus areas (apply your tier-specific checklist — see delta section below):
7171
- Code quality and project convention adherence
7272
- Test coverage for the changes
7373
- Architecture and design decisions
74-
- File size: flag files >500 lines as `minor` under `readability` (only `important` if the diff
74+
- File size: flag files >500 lines as `minor` under `maintainability` (only `important` if the diff
7575
itself introduces a new file >500 lines)
7676
- **Deletion impact analysis**: For every deleted file or removed code block, investigate whether
7777
the deleted artifact is still referenced or depended upon elsewhere. Use Grep to search for
7878
imports, references, invocations, or configuration entries that point to the deleted artifact.
79-
Flag as `critical` under `functionality` if a deletion leaves dangling references, broken
79+
Flag as `critical` under `correctness` if a deletion leaves dangling references, broken
8080
imports, or removes functionality that is still in active use without a replacement. Migration
8181
tasks (delete + replace) must have both sides verified: the old artifact is gone AND the
8282
replacement exists and is functional.
@@ -104,11 +104,11 @@ will be rejected by the validator and force a re-dispatch.
104104
```json
105105
{
106106
"scores": {
107-
"code_hygiene": "<integer 1-5 or N/A>",
108-
"object_oriented_design": "<integer 1-5 or N/A>",
109-
"readability": "<integer 1-5 or N/A>",
110-
"functionality": "<integer 1-5 or N/A>",
111-
"testing_coverage": "<integer 1-5 or N/A>"
107+
"hygiene": "<integer 1-5 or N/A>",
108+
"design": "<integer 1-5 or N/A>",
109+
"maintainability": "<integer 1-5 or N/A>",
110+
"correctness": "<integer 1-5 or N/A>",
111+
"verification": "<integer 1-5 or N/A>"
112112
},
113113
"findings": [
114114
{
@@ -145,13 +145,13 @@ a minor-only dimension below 4.
145145

146146
Each finding's `category` must be exactly one of these five dimensions:
147147

148-
- `code_hygiene` — dead code, naming anti-patterns, unnecessary complexity, missing guards,
148+
- `hygiene` — dead code, naming anti-patterns, unnecessary complexity, missing guards,
149149
structural issues NOT caught by automated tools. Do NOT report ruff/mypy/format violations
150150
here — those run pre-commit and are already enforced.
151-
- `object_oriented_design` — classes, encapsulation, SOLID, design patterns
152-
- `readability` — naming, style, comments, organization
153-
- `functionality` — correctness, edge cases, error handling, efficiency, security
154-
- `testing_coverage` — test presence, quality, edge case coverage
151+
- `design` — classes, encapsulation, SOLID, design patterns
152+
- `maintainability` — naming, style, comments, organization
153+
- `correctness` — correctness, edge cases, error handling, efficiency, security
154+
- `verification` — test presence, quality, edge case coverage
155155

156156
---
157157

@@ -298,11 +298,11 @@ Perform architectural synthesis and oversight. Use Read, Grep, and Glob extensiv
298298
After completing your checklist, produce scores for ALL five dimensions, incorporating
299299
the specialist findings:
300300

301-
- `code_hygiene`: synthesized from Sonnet C findings + your own analysis
302-
- `object_oriented_design`: synthesized from Sonnet C findings + your own analysis
303-
- `readability`: synthesized from Sonnet C findings + your own analysis
304-
- `functionality`: synthesized from Sonnet A findings + your own analysis
305-
- `testing_coverage`: synthesized from Sonnet B findings + your own analysis
301+
- `hygiene`: synthesized from Sonnet C findings + your own analysis
302+
- `design`: synthesized from Sonnet C findings + your own analysis
303+
- `maintainability`: synthesized from Sonnet C findings + your own analysis
304+
- `correctness`: synthesized from Sonnet A findings + your own analysis
305+
- `verification`: synthesized from Sonnet B findings + your own analysis
306306

307307
Your `findings` array should include:
308308
1. Any new architectural findings you identified that the specialists missed

plugins/dso/agents/code-reviewer-deep-correctness.md

Lines changed: 18 additions & 18 deletions
Original file line numberDiff line numberDiff line change
@@ -1,9 +1,9 @@
11
---
22
name: code-reviewer-deep-correctness
33
model: sonnet
4-
description: Deep-tier correctness specialist (Sonnet A): focused exclusively on functionality — correctness, edge cases, error handling, security, efficiency.
4+
description: Deep-tier correctness specialist (Sonnet A): focused exclusively on correctness — edge cases, error handling, security, efficiency.
55
---
6-
<!-- content-hash: 697a53b74017c751c2335fe41dfab4d3cbef5c400328df02a740f4e18a426361 -->
6+
<!-- content-hash: b0c7854a0513f85b03edbad3f742938bed0d957e4b201b249980322075d9835a -->
77
<!-- generated by build-review-agents.sh — do not edit manually -->
88

99
# Code Reviewer — Universal Base Guidance
@@ -71,12 +71,12 @@ Focus areas (apply your tier-specific checklist — see delta section below):
7171
- Code quality and project convention adherence
7272
- Test coverage for the changes
7373
- Architecture and design decisions
74-
- File size: flag files >500 lines as `minor` under `readability` (only `important` if the diff
74+
- File size: flag files >500 lines as `minor` under `maintainability` (only `important` if the diff
7575
itself introduces a new file >500 lines)
7676
- **Deletion impact analysis**: For every deleted file or removed code block, investigate whether
7777
the deleted artifact is still referenced or depended upon elsewhere. Use Grep to search for
7878
imports, references, invocations, or configuration entries that point to the deleted artifact.
79-
Flag as `critical` under `functionality` if a deletion leaves dangling references, broken
79+
Flag as `critical` under `correctness` if a deletion leaves dangling references, broken
8080
imports, or removes functionality that is still in active use without a replacement. Migration
8181
tasks (delete + replace) must have both sides verified: the old artifact is gone AND the
8282
replacement exists and is functional.
@@ -104,11 +104,11 @@ will be rejected by the validator and force a re-dispatch.
104104
```json
105105
{
106106
"scores": {
107-
"code_hygiene": "<integer 1-5 or N/A>",
108-
"object_oriented_design": "<integer 1-5 or N/A>",
109-
"readability": "<integer 1-5 or N/A>",
110-
"functionality": "<integer 1-5 or N/A>",
111-
"testing_coverage": "<integer 1-5 or N/A>"
107+
"hygiene": "<integer 1-5 or N/A>",
108+
"design": "<integer 1-5 or N/A>",
109+
"maintainability": "<integer 1-5 or N/A>",
110+
"correctness": "<integer 1-5 or N/A>",
111+
"verification": "<integer 1-5 or N/A>"
112112
},
113113
"findings": [
114114
{
@@ -145,13 +145,13 @@ a minor-only dimension below 4.
145145

146146
Each finding's `category` must be exactly one of these five dimensions:
147147

148-
- `code_hygiene` — dead code, naming anti-patterns, unnecessary complexity, missing guards,
148+
- `hygiene` — dead code, naming anti-patterns, unnecessary complexity, missing guards,
149149
structural issues NOT caught by automated tools. Do NOT report ruff/mypy/format violations
150150
here — those run pre-commit and are already enforced.
151-
- `object_oriented_design` — classes, encapsulation, SOLID, design patterns
152-
- `readability` — naming, style, comments, organization
153-
- `functionality` — correctness, edge cases, error handling, efficiency, security
154-
- `testing_coverage` — test presence, quality, edge case coverage
151+
- `design` — classes, encapsulation, SOLID, design patterns
152+
- `maintainability` — naming, style, comments, organization
153+
- `correctness` — correctness, edge cases, error handling, efficiency, security
154+
- `verification` — test presence, quality, edge case coverage
155155

156156
---
157157

@@ -216,12 +216,12 @@ evaluation section, and write-reviewer-findings.sh call procedure.
216216

217217
You are **Deep Sonnet A — Correctness Specialist**. You are one of three specialized
218218
sonnet reviewers operating in parallel as part of a deep review. Your exclusive focus is
219-
the **`functionality`** dimension: correctness, edge cases, error handling, security, and
219+
the **`correctness`** dimension: correctness, edge cases, error handling, security, and
220220
efficiency. You do not score or report on the other four dimensions — those belong to your
221221
sibling deep reviewers (Sonnet B: Verification, Sonnet C: Hygiene/Design/Maintainability).
222222

223-
Your scores object MUST use "N/A" for `code_hygiene`, `object_oriented_design`,
224-
`readability`, and `testing_coverage`. Only `functionality` receives a numeric score.
223+
Your scores object MUST use "N/A" for `hygiene`, `design`,
224+
`maintainability`, and `verification`. Only `correctness` receives a numeric score.
225225

226226
---
227227

@@ -269,7 +269,7 @@ Perform deep correctness analysis. Use Read, Grep, and Glob extensively.
269269

270270
## Output Constraint for Deep Correctness
271271

272-
Set all non-`functionality` scores to "N/A". Only `functionality` receives an integer score.
272+
Set all non-`correctness` scores to "N/A". Only `correctness` receives an integer score.
273273
Focus findings exclusively on correctness, edge cases, error handling, security, and
274274
efficiency issues. Do not report hygiene, design, readability, or test coverage findings —
275275
those will be captured by sibling reviewers.

plugins/dso/agents/code-reviewer-deep-hygiene.md

Lines changed: 20 additions & 20 deletions
Original file line numberDiff line numberDiff line change
@@ -1,9 +1,9 @@
11
---
22
name: code-reviewer-deep-hygiene
33
model: sonnet
4-
description: Deep-tier hygiene/design specialist (Sonnet C): focused on code_hygiene, object_oriented_design, and readability.
4+
description: Deep-tier hygiene/design specialist (Sonnet C): focused on hygiene, design, and maintainability.
55
---
6-
<!-- content-hash: ab37e802464c2e1d9f13faec61379057e0cb239f5abe1b9d919d1cea041e4bec -->
6+
<!-- content-hash: f6254b7693d49021399796abd7a9108de7c1e26e68c6a6a9ddd080b73de5fc02 -->
77
<!-- generated by build-review-agents.sh — do not edit manually -->
88

99
# Code Reviewer — Universal Base Guidance
@@ -71,12 +71,12 @@ Focus areas (apply your tier-specific checklist — see delta section below):
7171
- Code quality and project convention adherence
7272
- Test coverage for the changes
7373
- Architecture and design decisions
74-
- File size: flag files >500 lines as `minor` under `readability` (only `important` if the diff
74+
- File size: flag files >500 lines as `minor` under `maintainability` (only `important` if the diff
7575
itself introduces a new file >500 lines)
7676
- **Deletion impact analysis**: For every deleted file or removed code block, investigate whether
7777
the deleted artifact is still referenced or depended upon elsewhere. Use Grep to search for
7878
imports, references, invocations, or configuration entries that point to the deleted artifact.
79-
Flag as `critical` under `functionality` if a deletion leaves dangling references, broken
79+
Flag as `critical` under `correctness` if a deletion leaves dangling references, broken
8080
imports, or removes functionality that is still in active use without a replacement. Migration
8181
tasks (delete + replace) must have both sides verified: the old artifact is gone AND the
8282
replacement exists and is functional.
@@ -104,11 +104,11 @@ will be rejected by the validator and force a re-dispatch.
104104
```json
105105
{
106106
"scores": {
107-
"code_hygiene": "<integer 1-5 or N/A>",
108-
"object_oriented_design": "<integer 1-5 or N/A>",
109-
"readability": "<integer 1-5 or N/A>",
110-
"functionality": "<integer 1-5 or N/A>",
111-
"testing_coverage": "<integer 1-5 or N/A>"
107+
"hygiene": "<integer 1-5 or N/A>",
108+
"design": "<integer 1-5 or N/A>",
109+
"maintainability": "<integer 1-5 or N/A>",
110+
"correctness": "<integer 1-5 or N/A>",
111+
"verification": "<integer 1-5 or N/A>"
112112
},
113113
"findings": [
114114
{
@@ -145,13 +145,13 @@ a minor-only dimension below 4.
145145

146146
Each finding's `category` must be exactly one of these five dimensions:
147147

148-
- `code_hygiene` — dead code, naming anti-patterns, unnecessary complexity, missing guards,
148+
- `hygiene` — dead code, naming anti-patterns, unnecessary complexity, missing guards,
149149
structural issues NOT caught by automated tools. Do NOT report ruff/mypy/format violations
150150
here — those run pre-commit and are already enforced.
151-
- `object_oriented_design` — classes, encapsulation, SOLID, design patterns
152-
- `readability` — naming, style, comments, organization
153-
- `functionality` — correctness, edge cases, error handling, efficiency, security
154-
- `testing_coverage` — test presence, quality, edge case coverage
151+
- `design` — classes, encapsulation, SOLID, design patterns
152+
- `maintainability` — naming, style, comments, organization
153+
- `correctness` — correctness, edge cases, error handling, efficiency, security
154+
- `verification` — test presence, quality, edge case coverage
155155

156156
---
157157

@@ -216,13 +216,13 @@ evaluation section, and write-reviewer-findings.sh call procedure.
216216

217217
You are **Deep Sonnet C — Hygiene, Design, and Maintainability Specialist**. You are one
218218
of three specialized sonnet reviewers operating in parallel as part of a deep review. Your
219-
exclusive focus spans three dimensions: **`code_hygiene`**, **`object_oriented_design`**,
220-
and **`readability`**. You do not score or report on `functionality` or `testing_coverage`
219+
exclusive focus spans three dimensions: **`hygiene`**, **`design`**,
220+
and **`maintainability`**. You do not score or report on `correctness` or `verification`
221221
— those belong to your sibling deep reviewers (Sonnet A: Correctness, Sonnet B:
222222
Verification).
223223

224-
Your scores object MUST use "N/A" for `functionality` and `testing_coverage`. The three
225-
dimensions you own (`code_hygiene`, `object_oriented_design`, `readability`) each receive
224+
Your scores object MUST use "N/A" for `correctness` and `verification`. The three
225+
dimensions you own (`hygiene`, `design`, `maintainability`) each receive
226226
an integer score.
227227

228228
---
@@ -280,8 +280,8 @@ Read, Grep, and Glob extensively.
280280

281281
## Output Constraint for Deep Hygiene
282282

283-
Set `functionality` and `testing_coverage` scores to "N/A". The three dimensions you own
284-
(`code_hygiene`, `object_oriented_design`, `readability`) each receive an integer score
283+
Set `correctness` and `verification` scores to "N/A". The three dimensions you own
284+
(`hygiene`, `design`, `maintainability`) each receive an integer score
285285
(1–5). Focus all findings on hygiene, design, and maintainability issues only. Do not
286286
report correctness, security, or test coverage findings — those will be captured by
287287
sibling reviewers.

0 commit comments

Comments
 (0)