Skip to content

Commit 409e39d

Browse files
committed
feat(data): add nine dream-system validation scenarios
Add `data/dream-validation.yaml` covering the four roadmap items the new scorer kinds were built to grade: P-1.3 retract-vs-soft-delete (Snodgrass bi-temporal): - dream-demotion-retract-discipline (passes against compliant fixture) - dream-demotion-snodgrass-violation (negative: fixture sets both timestamps; scenario MUST fail on timestamp_discipline) P0.3a stale-fact deprecation: - dream-demotion-stale-fact (pricing edge correctly demoted to superseded) P0.3b scoped cascading expiry: - dream-demotion-cascade-bounded (single-hop discipline held) - dream-demotion-cascade-runaway (negative: fixture touches a 2+ hop edge; scenario MUST fail on cascade_bounded) P1 procedure synthesis (per `dreaming-procedures.md`): - dream-procedure-weekly-report (4 ordered steps + 2 parameters) - dream-procedure-client-onboarding (different workflow, exercises the scorer on a non-degenerate case) P2 memory dedup: - dream-dedup-near-duplicates (clean merge of Sarah-billing facts; HubSpot + fiscal-year stay singletons) - dream-dedup-false-positive (negative: over-merges Sarah-billing with Sarah-manager; scenario MUST fail on ARI + pairwise precision) Eight JSON fixtures under `data/fixtures/dream/` back the offline path; each scenario also accepts a `source.raw_exchange_key` swap when AutoGPT's dream pass starts emitting structured payloads inline. Pack-level pinning in `tests/unit/dream-validation.test.ts` asserts: - ship-shape counts (>=4 demotion, >=2 procedure, >=2 dedup) - every fixture exists on disk relative to the YAML - happy-path scenarios pass against their golden - negative scenarios fail in the expected way (timestamp_discipline, cascade_bounded, over-merge) Docs updated: `platform.md` gets a new spec scenario, `current-state.md` marks it implemented, `e2e-checklist.md` references the test files, generated workspace inventory + quality score refreshed.
1 parent d253e3b commit 409e39d

16 files changed

Lines changed: 676 additions & 2 deletions

data/dream-validation.yaml

Lines changed: 436 additions & 0 deletions
Large diffs are not rendered by default.
Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,10 @@
1+
{
2+
"clusters": [
3+
[
4+
"fact-sarah-billing-1",
5+
"fact-sarah-billing-2",
6+
"fact-sarah-manager-1"
7+
],
8+
["fact-hubspot-crm-1"]
9+
]
10+
}
Lines changed: 7 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,7 @@
1+
{
2+
"clusters": [
3+
["fact-sarah-billing-1", "fact-sarah-billing-2"],
4+
["fact-hubspot-crm-1"],
5+
["fact-fiscal-year-1"]
6+
]
7+
}
Lines changed: 8 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,8 @@
1+
{
2+
"observed": ["edge-northstar-marcus", "edge-northstar-cto"],
3+
"cascade_touched": ["edge-northstar-marcus", "edge-northstar-cto"],
4+
"retract_actions": [
5+
{ "uuid": "edge-northstar-marcus", "expired_at_set": true, "invalid_at_set": false, "status": "superseded" },
6+
{ "uuid": "edge-northstar-cto", "expired_at_set": true, "invalid_at_set": false, "status": "superseded" }
7+
]
8+
}
Lines changed: 13 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,13 @@
1+
{
2+
"observed": ["edge-northstar-marcus", "edge-northstar-cto", "edge-hubspot-crm"],
3+
"cascade_touched": [
4+
"edge-northstar-marcus",
5+
"edge-northstar-cto",
6+
"edge-hubspot-crm"
7+
],
8+
"retract_actions": [
9+
{ "uuid": "edge-northstar-marcus", "expired_at_set": true, "invalid_at_set": false, "status": "superseded" },
10+
{ "uuid": "edge-northstar-cto", "expired_at_set": true, "invalid_at_set": false, "status": "superseded" },
11+
{ "uuid": "edge-hubspot-crm", "expired_at_set": true, "invalid_at_set": false, "status": "superseded" }
12+
]
13+
}
Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,6 @@
1+
{
2+
"observed": ["edge-budget-q2"],
3+
"retract_actions": [
4+
{ "uuid": "edge-budget-q2", "expired_at_set": true, "invalid_at_set": false, "status": "superseded" }
5+
]
6+
}
Lines changed: 11 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,11 @@
1+
{
2+
"observed": ["edge-budget-q2"],
3+
"retract_actions": [
4+
{
5+
"uuid": "edge-budget-q2",
6+
"expired_at_set": true,
7+
"invalid_at_set": true,
8+
"status": "superseded"
9+
}
10+
]
11+
}
Lines changed: 11 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,11 @@
1+
{
2+
"observed": ["edge-pricing-old"],
3+
"retract_actions": [
4+
{
5+
"uuid": "edge-pricing-old",
6+
"expired_at_set": true,
7+
"invalid_at_set": false,
8+
"status": "superseded"
9+
}
10+
]
11+
}
Lines changed: 9 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,9 @@
1+
{
2+
"steps": [
3+
"create folder in workspace",
4+
"draft welcome email",
5+
"schedule kickoff call",
6+
"share onboarding doc"
7+
],
8+
"parameters": ["client_name", "kickoff_date"]
9+
}
Lines changed: 9 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,9 @@
1+
{
2+
"steps": [
3+
"pull metrics from analytics",
4+
"summarize per channel",
5+
"draft email to stakeholders",
6+
"send"
7+
],
8+
"parameters": ["recipient_list", "week_window"]
9+
}

0 commit comments

Comments
 (0)