You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Persist local review feedback, suppress dismissed or snoozed review items across dashboard and Chickens, and record the corresponding feature-pack/design/config updates.
Copy file name to clipboardExpand all lines: .plans/features/agent-knowledge-sandbox/spec.md
+27-3Lines changed: 27 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -19,6 +19,9 @@ The "YouTube Kids for agents" model: sandbox the knowledge, not just the executi
19
19
- The agent has no way to learn from YouTube channels, GitHub repos, or RSS feeds that the coop follows
20
20
- Reasoning traces evaporate after skill execution — no institutional memory, no precedent system
21
21
- The Neo4j Context Graph talk validated that graph memory + reasoning traces + hybrid retrieval is production-ready and the patterns map directly to Coop's architecture
22
+
- The durable-agent runtime lesson is clear: if models can change, memory needs independent
23
+
provenance, confirmation status, and retrieval rules so Coop behaves continuously without
24
+
turning model output into unaccountable truth.
22
25
23
26
## Scope
24
27
@@ -57,7 +60,7 @@ The "YouTube Kids for agents" model: sandbox the knowledge, not just the executi
57
60
- Context assembly for skill prompts (token-budgeted)
58
61
- No LLM calls during retrieval (hard requirement)
-**E2E**: Member adds source → agent ingests → agent uses in recommendation → member sees provenance and can distinguish inferred from confirmed memory
207
231
-**A/B**: Baseline (flat memory) vs graph-enhanced (graph retrieval) quality comparison on eval corpus
208
232
-**Regression**: All existing skill eval cases + unit tests must pass at pre-implementation thresholds
Copy file name to clipboardExpand all lines: .plans/features/agent-knowledge-sandbox/status.json
+4-3Lines changed: 4 additions & 3 deletions
Original file line number
Diff line number
Diff line change
@@ -60,8 +60,9 @@
60
60
"No LLM during retrieval (hard perf requirement)",
61
61
"Vellum material language for all UI surfaces",
62
62
"7-phase dependency-ordered build with gates",
63
-
"Entity extraction as new skill using existing cascade"
63
+
"Entity extraction as new skill using existing cascade",
64
+
"Memory provenance and confirmation labels are part of the product contract: observed, inferred, user-confirmed, imported, and stale memories must behave differently in retrieval and UI."
64
65
],
65
-
"updated_at": "2026-04-19",
66
-
"notes": "UI + state lanes materially landed; graph backend still snapshot-persisted (Kuzu-WASM deferred). QA pass 1 is now ready to run."
66
+
"updated_at": "2026-05-07",
67
+
"notes": "UI + state lanes materially landed; graph backend still snapshot-persisted (Kuzu-WASM deferred). QA pass 1 is now ready to run, including provenance, confirmation, and retrieval-before-work checks."
Copy file name to clipboardExpand all lines: .plans/features/next-gen-model-readiness/lanes/api.claude.todo.md
+7-2Lines changed: 7 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -21,14 +21,17 @@ done_when:
21
21
skills:
22
22
- architecture
23
23
- testing
24
-
updated: 2026-04-02
24
+
updated: 2026-05-07
25
25
---
26
26
27
27
# Phase 3: Prepare the Agent Pipeline for Model Upgrade
28
28
29
29
Target: Introduce a "capable model" code path alongside the 0.5B legacy path. Define tools from deterministic skills. Collapse output handlers into generic tools. Feature-flagged via `VITE_COOP_AGENT_MODE`.
30
30
31
-
**Principle**: The legacy path (0.5B + heuristic fallbacks) continues working unchanged. The autonomous path is additive — same observation lifecycle, same approval gates, same memory system, different execution strategy.
31
+
**Principle**: The legacy path (0.5B + heuristic fallbacks) continues working unchanged. The
32
+
autonomous path is additive — same observation lifecycle, same approval gates, same memory system,
33
+
same trace evidence, same fallback semantics, different execution strategy. Model routing remains
34
+
internal runtime evidence; simple mode should not become a provider-management surface.
Before deleting or shortening any instruction, classify it using `../context.md`:
35
+
36
+
-`deterministic-gate`: keep or move to code, hooks, schemas, validators, tests, or validation scripts.
37
+
-`repo-constraint`: keep once in the canonical repo instruction/rule surface.
38
+
-`product-intent`: preserve in product context or another short pointer.
39
+
-`soft-guidance`: remove or replace with a source pointer.
40
+
41
+
Do not convert a deterministic gate into prompt-only guidance. If the old prose was the only place a
42
+
gate existed, stop and route it to an executable surface or record the gap.
43
+
32
44
## Step 1: Replace context code maps with pointer files
33
45
34
46
Replace `.claude/context/app.md` (163 lines), `extension.md` (466 lines), `shared.md` (448 lines) with ~15-line pointer files. Each file becomes:
@@ -53,6 +65,9 @@ Read the source files above for architecture details.
53
65
54
66
**Verify**: Each pointer file < 20 lines. No constraint lost (cross-reference against rules/ files).
55
67
68
+
**Guardrail audit**: Record any removed `deterministic-gate`, `repo-constraint`, or `product-intent`
69
+
in the implementation notes with its new canonical home.
70
+
56
71
## Step 2: Reduce skills to constraint cards
57
72
58
73
For each `.claude/skills/*/SKILL.md`, reduce to 30-50 lines:
@@ -93,6 +108,9 @@ For each `.claude/skills/*/SKILL.md`, reduce to 30-50 lines:
93
108
94
109
**Verify**: `wc -l .claude/skills/*/SKILL.md` shows each file 20-50 lines.
95
110
111
+
**Guardrail audit**: For each skill, keep Coop-specific constraints and anti-patterns; remove generic
112
+
methodology only after confirming enforcement-sensitive behavior lives in a rule, hook, schema, or test.
113
+
96
114
## Step 3: Delete meta-documentation
97
115
98
116
-[ ] Delete `.claude/skills/index.md` (238 lines) — meta-documentation about the prompt system itself. The model discovers skills via registry, not an index file.
@@ -139,6 +157,9 @@ Remove ~120 lines from CLAUDE.md:
0 commit comments