Skip to content

Commit 980462c

Browse files
authored
Merge pull request #17 from shobman/feature/aidos-v1.2.1-rubric-refinements
feat: AIDOS framework v1.2.1 — implementation neutrality + rubric simplification sweep
2 parents 145f799 + d5a276c commit 980462c

18 files changed

Lines changed: 107 additions & 100 deletions

README.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -133,12 +133,12 @@ docs/
133133
src/
134134
├── framework.md ← The full operating model — start here
135135
├── rubrics/
136-
│ ├── core.md ← Universal criteria (C1–C13) for every artifact
137-
│ ├── problem.md ← Problem criteria (P1–P10) — Product lens
138-
│ ├── solution.md ← Solution criteria (S1–S9) — Analysis lens
136+
│ ├── core.md ← Universal criteria (C1–C12) for every artifact
137+
│ ├── problem.md ← Problem criteria (P1–P11) — Product lens
138+
│ ├── solution.md ← Solution criteria (S1–S10) — Analysis lens
139139
│ ├── tech-design.md ← Tech Design criteria (A1–A10) — Architecture lens
140140
│ ├── testing.md ← Testing criteria (T1–T9) — Quality lens
141-
│ └── definition.md ← Definition criteria (F1–F8) — Maintenance lens
141+
│ └── definition.md ← Definition criteria (F1–F7) — Maintenance lens
142142
├── templates/
143143
│ ├── problem.md ← Problem artifact template
144144
│ ├── solution.md ← Solution artifact template

VERSION

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1 +1 @@
1-
1.2.0
1+
1.2.1

docs/worked-example.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -161,7 +161,7 @@ Moving to Tech Design:
161161
> | Criterion | Assessment | Evidence | Classification |
162162
> |---|---|---|---|
163163
> | A2: Integration points | Partial | Rate limits not addressed for any channel API. The burst deploy scenario (twelve teams deploying within ten minutes) could hit Slack's rate limit of ~1 message per second per channel if multiple teams share an alerts channel. | Bug |
164-
> | C8: Operational impact | Partial | The document specifies monitoring (health endpoint, delivery latency metric) but does not identify who monitors the notification service itself. If notifications silently fail — e.g., the service is down during a deploy — who gets paged? The monitoring observes the service but nobody is named as the operator. | Bug |
164+
> | A7: Deployment and environment | Partial | The document specifies monitoring (health endpoint, delivery latency metric) but does not identify who monitors the notification service itself. If notifications silently fail — e.g., the service is down during a deploy — who gets paged? The monitoring observes the service but nobody is named as the operator. | Bug |
165165
166166
The auditor also raises two non-Bug findings:
167167

@@ -210,7 +210,7 @@ The auditor re-assesses only the criteria that were Partial or Fail in Pass 1.
210210
> |---|---|---|
211211
> | S6: Dependency identification | Pass | Rate limits addressed. One-team-one-channel constraint stated. Slack, Teams, and PagerDuty dependencies documented with their limitations. |
212212
> | A2: Integration points | Pass | Rate limit strategy documented. Pipeline retry mechanism covers notification service unavailability. |
213-
> | C8: Operational impact | Pass | Platform team named as owner. On-call paging specified for health endpoint failure. |
213+
> | A7: Deployment and environment | Pass | Platform team named as owner. On-call paging specified for health endpoint failure. |
214214
> | Coherence (Problem → Tech Design) | Pass | Pipeline retry + pipeline monitoring closes the five-minute notification gap for the service-down scenario. |
215215
216216
**Pass 2: All clear.** Artifact status moves to ACCEPTED.

skills/auditor/SKILL.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -23,12 +23,12 @@ When the user presents an artifact for review, you:
2323
|---|---|
2424
| `auditor-prompt.md` | **Your system prompt.** Read this first — it defines your behaviour, the three-pass rule, and output format. |
2525
| `framework.md` | The AIDOS operating model. Reference for scaling, coherence rules, and the artifact stack. |
26-
| `rubrics/core.md` | Core rubric (C1–C13). Universal criteria applied to every artifact at every scale. |
27-
| `rubrics/problem.md` | Problem rubric (P1–P10). Product lens criteria. |
28-
| `rubrics/solution.md` | Solution rubric (S1–S9). Analysis lens criteria. |
26+
| `rubrics/core.md` | Core rubric (C1–C12). Universal criteria applied to every artifact at every scale. |
27+
| `rubrics/problem.md` | Problem rubric (P1–P11). Product lens criteria. |
28+
| `rubrics/solution.md` | Solution rubric (S1–S10). Analysis lens criteria. |
2929
| `rubrics/tech-design.md` | Tech Design rubric (A1–A10). Architecture lens criteria. |
3030
| `rubrics/testing.md` | Testing rubric (T1–T9). Quality lens criteria. |
31-
| `rubrics/definition.md` | Definition rubric (F1–F8). Maintenance lens criteria for post-delivery Definitions. |
31+
| `rubrics/definition.md` | Definition rubric (F1–F7). Maintenance lens criteria for post-delivery Definitions. |
3232
| `templates/retrospective.md` | Retrospective template for rubric evolution. |
3333
| `CONTRIBUTING.md` | How to propose rubric changes — the contribution model for evolving the framework. |
3434
| `VERSION` | **Framework version.** Plain-text file containing the current AIDOS framework semver (e.g. `1.0.0`). Read on session start — used to compare against the audited file's `AIDOS Version` metadata. |

skills/builder/SKILL.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -22,15 +22,15 @@ When the user describes work they want to deliver, you:
2222
|---|---|
2323
| `builder-prompt.md` | **Your system prompt.** Read this first — it defines your behaviour, session flow, and constraints. |
2424
| `framework.md` | The AIDOS operating model. Reference for scaling, coherence rules, and the artifact stack. |
25-
| `rubrics/core.md` | Core rubric (C1–C13). Universal criteria applied to every artifact at every scale. |
25+
| `rubrics/core.md` | Core rubric (C1–C12). Universal criteria applied to every artifact at every scale. |
2626
| `templates/problem.md` | Problem artifact template with section-to-rubric mapping. |
2727
| `templates/solution.md` | Solution artifact template with section-to-rubric mapping. |
2828
| `templates/tech-design.md` | Tech Design artifact template with section-to-rubric mapping. |
2929
| `templates/testing.md` | Testing artifact template with section-to-rubric mapping. |
3030
| `templates/issues-log.md` | Issues Log template for tracking escalations across the project. |
3131
| `templates/overflow-log.md` | Overflow Log template for content that can't yet be placed in an artifact. |
3232
| `templates/meeting-minutes.md` | Lean meeting capture template. |
33-
| `rubrics/definition.md` | Definition rubric (F1–F8). Maintenance lens criteria for post-delivery Definitions. |
33+
| `rubrics/definition.md` | Definition rubric (F1–F7). Maintenance lens criteria for post-delivery Definitions. |
3434
| `templates/definition.md` | Definition artifact template with section-to-rubric mapping. |
3535
| `templates/retrospective.md` | Retrospective template for rubric evolution. |
3636
| `CONTRIBUTING.md` | How to propose rubric changes — the contribution model for evolving the framework. |

src/README.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -52,12 +52,12 @@ prompts/
5252
├── builder-prompt.md ← Self-contained system prompt for building artifacts
5353
└── auditor-prompt.md ← Self-contained system prompt for auditing artifacts
5454
rubrics/
55-
├── core.md ← Universal criteria (C1–C13)
56-
├── problem.md ← Product lens (P1–P10)
57-
├── solution.md ← Analysis lens (S1–S9)
55+
├── core.md ← Universal criteria (C1–C12)
56+
├── problem.md ← Product lens (P1–P11)
57+
├── solution.md ← Analysis lens (S1–S10)
5858
├── tech-design.md ← Architecture lens (A1–A10)
5959
├── testing.md ← Quality lens (T1–T9)
60-
└── definition.md ← Maintenance lens (F1–F8)
60+
└── definition.md ← Maintenance lens (F1–F7)
6161
templates/
6262
├── problem.md ← Problem artifact template
6363
├── solution.md ← Solution artifact template

src/framework.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -99,9 +99,9 @@ AIDOS depends on separation between artifact creation and artifact audit. The sa
9999

100100
Every artifact is assessed against two rubric layers:
101101

102-
**Core Rubric** — universal criteria that apply to every artifact at every scale. Alignment to goals. Simplicity. Explicit trade-offs. Failure modes. Testability. Observability. Security. Operational impact. Reversibility. Future team readiness. Unit coherence. No duplication.
102+
**Core Rubric** — universal criteria that apply to every artifact at every scale. Alignment to goals. Simplicity. Explicit trade-offs. Failure modes. Testability. Observability. Security. Reversibility. Future team readiness. Unit coherence. No duplication.
103103

104-
**Discipline Rubric** — criteria specific to each artifact type. The Problem rubric (P1–P10) checks clarity, stakeholders, measurability, root cause confidence, scope, assumptions, constraints, impact, and alternatives. The Solution rubric (S1–S9) checks coherence, workflows, edge cases, alternatives, dependencies, migration, and minimum viable slice. The Tech Design rubric (A1–A10) checks components, integration, data model, error handling, technology choices, performance, deployment, and coding agent readiness. The Testing rubric (T1–T9) checks coverage, traceability, scenarios, exit criteria, expected results, test data, environments, regression, and prioritisation. The Definition rubric (F1–F8) checks outcome accuracy, key trade-offs, maintainer orientation, known limitations, operational context, domain placement, standalone comprehension, and currency.
104+
**Discipline Rubric** — criteria specific to each artifact type. The Problem rubric (P1–P11) checks clarity, stakeholders, measurability, root cause confidence, scope, assumptions, constraints, impact, alternatives, and implementation neutrality. The Solution rubric (S1–S10) checks coherence, workflows, edge cases, alternatives, dependencies, migration, minimum viable slice, and implementation neutrality. The Tech Design rubric (A1–A10) checks components, integration, data model, error handling, technology choices, performance, deployment, and coding agent readiness. The Testing rubric (T1–T9) checks coverage, traceability, scenarios, exit criteria, expected results, test data, environments, regression, and prioritisation. The Definition rubric (F1–F7) checks outcome accuracy, key trade-offs, maintainer orientation, known limitations, operational context, domain placement, and currency.
105105

106106
Each criterion has a defined "what pass looks like." The auditor assesses Pass, Partial, or Fail with cited evidence. The evidence requirement is what gives rubrics teeth — you can't hand-wave a Pass. Partials are accepted or rejected by the human directing the audit, not waved through. The artifact doesn't advance until bugs are fixed.
107107

@@ -355,7 +355,7 @@ The Definition lives in the Feature Repository (`definitions/`), organised by pr
355355

356356
**A project cannot close with delivery artifacts unarchived.** Just as a project cannot close with PARKED overflow items, it cannot close without distilling its outcome into Definitions and archiving the delivery stack. The distillation session is how a project ends.
357357

358-
The Definition rubric (F1–F8) is assessed through the Maintenance lens. Full criteria are in `src/rubrics/definition.md`.
358+
The Definition rubric (F1–F7) is assessed through the Maintenance lens. Full criteria are in `src/rubrics/definition.md`.
359359

360360
---
361361

src/prompts/auditor-prompt.md

Lines changed: 18 additions & 18 deletions
Original file line numberDiff line numberDiff line change
@@ -82,25 +82,24 @@ Then proceed to the audit.
8282

8383
## Rubric Criteria
8484

85-
### Core Rubric (C1–C13) — Every Artifact, Every Scale
85+
### Core Rubric (C1–C12) — Every Artifact, Every Scale
8686

8787
| # | Criterion | What "Pass" Looks Like |
8888
|---|---|---|
8989
| C1 | Alignment to goals | Every element traces to a stated goal or requirement. Nothing is included that doesn't serve a declared purpose. |
90-
| C2 | Simplicity | The simplest approach that meets the requirements. Complexity is justified where it exists. A simpler alternative was considered and rejected for a stated reason. |
90+
| C2 | Simplicity | The simplest approach that meets the requirements. Complexity is justified where it exists. |
9191
| C3 | Explicit trade-offs | Trade-offs are named. Options considered, decision taken, and reasoning are documented. |
9292
| C4 | Failure modes | What can go wrong and how failures are detected or handled. Silence on failure is itself a failure. |
9393
| C5 | Testability | Every claim, requirement, or design choice can be verified by a specific action. |
9494
| C6 | Observability | How you would know — in practice — whether the thing is working or not. |
9595
| C7 | Security | Security implications considered proportionate to the risk. "Not applicable" is stated, not assumed. |
96-
| C8 | Operational impact | Who runs this, how it's deployed, what changes for operations. Ownership identified and accepted. |
97-
| C9 | Reversibility | What can be undone and what can't. Irreversible choices are acknowledged and justified. |
98-
| C10 | Future team readiness | Someone unfamiliar could pick this up and understand what was done, why, and what's left. |
99-
| C11 | Internal consistency | Terminology used consistently, sections don't contradict each other, reads as one coherent unit. |
100-
| C12 | No duplication | References rather than copies. Each fact lives in one place. |
101-
| C13 | Single unit of work | Addresses a single deliverable that can be independently understood, built, tested, and released. |
96+
| C8 | Reversibility | What can be undone and what can't. Irreversible choices are acknowledged and justified. |
97+
| C9 | Future team readiness | Someone unfamiliar could pick this up and understand what was done, why, and what's left. |
98+
| C10 | Internal consistency | Terminology used consistently, sections don't contradict each other, reads as one coherent unit. |
99+
| C11 | No duplication | References rather than copies. Each fact lives in one place. |
100+
| C12 | Single unit of work | Addresses a single deliverable that can be independently understood, built, tested, and released. |
102101

103-
### Problem Rubric (P1–P10) — Product Lens
102+
### Problem Rubric (P1–P11) — Product Lens
104103

105104
| # | Criterion | What "Pass" Looks Like |
106105
|---|---|---|
@@ -114,8 +113,9 @@ Then proceed to the audit.
114113
| P8 | Constraints identified | Regulatory, technical, organisational, timeline, budget constraints explicit. Solution author won't discover them later. |
115114
| P9 | Impact and urgency | Cost quantified where possible. Why now. What happens if not addressed. Evidence-based, not assertion-based. |
116115
| P10 | Existing alternatives | Whether the problem is already solved acknowledged. If alternatives exist, insufficiency is stated. Building is justified. |
116+
| P11 | Implementation neutrality | Problem describes what's wrong, for whom, why — not how it's solved. Tools, vendors, schemas, APIs absent unless pre-existing constraints (then in P8). Implementation language captured in Overflow tagged for Solution or Tech Design. |
117117

118-
### Solution Rubric (S1–S9) — Analysis Lens
118+
### Solution Rubric (S1–S10) — Analysis Lens
119119

120120
| # | Criterion | What "Pass" Looks Like |
121121
|---|---|---|
@@ -128,6 +128,7 @@ Then proceed to the audit.
128128
| S7 | Migration and transition | Path from current to proposed state described. Cutover, compatibility, rollback addressed. |
129129
| S8 | Actor identification | Every person, team, or system that interacts is identified with specific interactions described. |
130130
| S9 | Constraint compliance | Solution respects Problem constraints. Gaps acknowledged with explicit mitigation or trade-off. |
131+
| S10 | Implementation neutrality | Solution describes how the response works as a system — actors, workflows, edge cases, alternatives — not which technology executes it. Tables, columns, joins, data types, libraries, services, frameworks belong in Tech Design unless pre-existing constraints (then noted in S9). Implementation detail captured in Overflow tagged for Tech Design. |
131132

132133
### Tech Design Rubric (A1–A10) — Architecture Lens
133134

@@ -158,30 +159,29 @@ Then proceed to the audit.
158159
| T8 | Regression awareness | Existing functionality at risk identified with regression tests. Proportionate to blast radius. |
159160
| T9 | Risk-based prioritisation | Must-pass vs should-pass distinguished. Priority clear when time is short. |
160161

161-
### Definition Rubric (F1–F8) — Maintenance Lens
162+
### Definition Rubric (F1–F7) — Maintenance Lens
162163

163164
The Definition is the post-delivery artifact — the living description of what was built, maintained as the feature evolves. Its audience is someone who was never in the room.
164165

165166
| # | Criterion | What "Pass" Looks Like |
166167
|---|---|---|
167168
| F1 | Outcome accuracy | Describes what was actually built, not what was planned. Divergences stated with reason. |
168169
| F2 | Key trade-offs preserved | Significant decisions captured with context. Not every decision — the shaping ones. |
169-
| F3 | Maintainer orientation | Answers: what does this do, why this way, limitations, what to know to change it safely. No delivery-process language. |
170+
| F3 | Maintainer orientation | Self-contained. Answers: what does this do, why this way, limitations, what to know to change it safely. May link to delivery artifacts for forensic detail; reader using only the Definition has enough context. No delivery-process language. |
170171
| F4 | Known limitations and debt | Tech debt, accepted risks, deferred scope listed explicitly. BACKLOG items represented. |
171172
| F5 | Operational context | Who owns it, how monitored, failure modes, runbook. Enough for on-call without the full Tech Design. |
172173
| F6 | Domain placement | Filed by product domain, not project. Findable by domain browsing. |
173-
| F7 | Standalone comprehension | Self-contained. May link to archived delivery artifacts but doesn't require them. |
174-
| F8 | Currency | Reflects current system state. Updates visible via version history or "Last updated" summary. |
174+
| F7 | Currency | Reflects current system state. Updates visible via version history or "Last updated" summary. |
175175

176176
**Story scale exception:** Stories do not produce Definitions. They inherit from their parent Feature or Epic Definition. Do not audit a Definition at Story scale.
177177

178178
### Story-Scale Subset
179179

180180
At story scale, audit is lighter but the criteria still apply. Focus on these as the primary assessment:
181181

182-
**Core:** C1, C2, C3, C4, C5, C10, C13
183-
**Problem (Context):** P1, P5
184-
**Solution (User Story):** S1, S4
182+
**Core:** C1, C2, C3, C4, C5, C9, C12
183+
**Problem (Context):** P1, P5, P11
184+
**Solution (User Story):** S1, S4, S10
185185
**Tech Design (Technical Approach):** A1, A4, A9, A10
186186
**Testing (Acceptance Criteria):** T1, T2, T5
187187

@@ -243,7 +243,7 @@ Every finding is classified:
243243
- **Risk** — decision required. The human decides: accept, mitigate, or defer. Risks don't block the artifact, but they need explicit disposition.
244244
- **Idea** — noted, not actioned unless chosen. Ideas do not drive additional audit passes. Table them separately.
245245

246-
**C13 failures are always Bugs.** If the artifact is trying to cover too many concerns, recommend decomposition into sibling artifacts at the same scale level.
246+
**C12 failures are always Bugs.** If the artifact is trying to cover too many concerns, recommend decomposition into sibling artifacts at the same scale level.
247247

248248
---
249249

0 commit comments

Comments
 (0)