Close Phase 3D, open Phase 3E

Your Name · claude · Your Name · commit f2c55adcbedb · 2026-06-13T11:00:58.000+08:00
Phase 3D is complete:
- Hypothesis registry established (19 hypotheses, 8 categories)
- Tier 1 validation complete (3 supported, 1 inconclusive, 1 not supported)
- Tier 2 honestly deferred (failure samples genuinely rare: 1/100)
- Closure report at phase3d/closure_report_v0.2.5.md

Phase 3E is now active:
- Controlled Transition &amp; Intervention-aware Validation
- Lanes: direct_prompt_native, routed_prompt_intervention,
  superpowers_workflow_intervention, controlled_prompt_morphology
- Deferred Tier 2/3/4 hypotheses carried forward
- Phase 4 remains not open
- Background acquisition continues, not a phase gate

Research index updated to reflect current phase roadmap.

Co-Authored-By: Claude Opus 4.7 &lt;noreply@anthropic.com&gt;
diff --git a/docs/research/README.md b/docs/research/README.md
@@ -2,33 +2,43 @@
 
 This directory groups the research tracks and branch studies that sit alongside the main `causetrace` runtime-morphology work.
 
-## Active Research Tracks
-
-- [Phase 3A](phase3a/README.md)
-- [Phase 3B](phase3b/README.md)
-- [Phase 3C](phase3c/README.md)
-- [Phase 3D](phase3d/README.md)
-
-## Current Research Status
+## Research Phase Status
+
+| Phase | Status | Summary |
+|-------|--------|---------|
+| Phase 2.5 | complete | Baseline infrastructure |
+| Phase 3A | complete | Descriptive corpus |
+| Phase 3B | complete | Topology taxonomy |
+| Phase 3C | complete | Metadata & provenance |
+| [Phase 3D](phase3d/README.md) | **complete** | Hypothesis registry + Tier 1 validation |
+| [Phase 3E](phase3e/README.md) | **active** | Controlled transition & intervention-aware validation |
+| Phase 4 | **not open** | Theory finalization |
+
+## Current Corpus Snapshot
+
+- sessions: `1351`
+- events: `128,552`
+- strict research-grade sessions: `157`
+- native strict sessions: `100`
+- agent field coverage: `100%` (inline)
+- provider field coverage: `99.8%` (inline)
+- runtime breadth: `7`
+- task breadth: `9`
 
-`causetrace` now has enough corpus scale to support validation-oriented work, but not enough metadata density to support theory finalization or default automation policy.
+## Phase 3D Closure Summary
 
-Current snapshot:
+Phase 3D delivered the hypothesis registry (19 hypotheses, 8 categories), completed Tier 1 validation (3 supported, 1 inconclusive, 1 not supported), and honestly deferred Tier 2 (failure samples genuinely rare in real agent behavior: 1/100 native failure, 0/100 near-failure). See [closure report](phase3d/closure_report_v0.2.5.md).
 
-- sessions: `1315`
-- events: `64429`
-- strict research-grade sessions: `157`
-- dominant_chain: `1111`
-- mixed: `195`
-- retry-heavy: `541`
-- branchy sessions: `179`
-- long sessions >=100 events: `53`
+## Phase 3E Active Scope
 
-The next mainline stage is:
+Controlled transition and intervention-aware validation. Lanes kept separate:
 
-`Phase 3D-T2B: Intervention-aware Acquisition`
+- `direct_prompt_native`
+- `routed_prompt_intervention`
+- `superpowers_workflow_intervention`
+- `controlled_prompt_morphology`
 
-This stage continues Tier 2 acquisition while keeping workflow-intervention lanes separate from the native direct-prompt baseline.
+Deferred hypotheses from Phase 3D Tier 2/3/4 carried forward. Tier 2 validation is opportunistic (background acquisition), not a phase gate. See [Phase 3E README](phase3e/README.md).
 
 ## Cross-project Branch Studies
 
diff --git a/docs/research/phase3d/status.md b/docs/research/phase3d/status.md
@@ -1,6 +1,6 @@
-# Phase 3D Status (v0.2.5)
+# Phase 3D Status (v0.2.5) — CLOSED
 
-Phase 3D is recommended for graduation. See [closure report](closure_report_v0.2.5.md) for full assessment.
+Phase 3D is complete. See [closure report](closure_report_v0.2.5.md) for full assessment.
 
 It delivered the hypothesis registry layer for runtime morphology research. Tier 1 validation is complete. Tier 2 is deferred honestly (failure samples genuinely rare in real agent behavior, not an execution failure).
 
@@ -10,8 +10,9 @@ It delivered the hypothesis registry layer for runtime morphology research. Tier
 - Phase 3A: complete
 - Phase 3B: complete
 - Phase 3C: complete
-- Phase 3D: recommended for graduation
-- Phase 3E: preparing
+- Phase 3D: complete
+- Phase 3E: active
+- Phase 4: not open
 
 ## Current Corpus Baseline
 
diff --git a/docs/research/phase3e/README.md b/docs/research/phase3e/README.md
@@ -0,0 +1,122 @@
+# Phase 3E: Controlled Transition & Intervention-aware Validation
+
+Phase 3E validates selected runtime morphology hypotheses under controlled or intervention-aware conditions. It does not enter Phase 4 theory finalization.
+
+## Position
+
+- Phase 3D: complete
+- Phase 3E: active
+- Phase 4: not open
+
+## Mission
+
+Validate the relationship between events, observations, interventions, workflow conditions and topology transitions. Specifically:
+
+```
+event / observation / intervention / workflow condition
+→ topology transition
+```
+
+## Scope
+
+### Active lanes (all kept separate)
+
+| Lane | Description | Merge into native? |
+|------|-------------|-------------------|
+| `direct_prompt_native` | User gave the agent a task directly | Baseline |
+| `routed_prompt_intervention` | `prompt-routing-skill` selected posture first | No |
+| `superpowers_workflow_intervention` | Structured workflow plugin changed execution shape | No |
+| `controlled_prompt_morphology` | Controlled prompt comparison or pilot run | No |
+| `external_trajectory` | External data source | No |
+
+### Validation targets
+
+- controlled benchmark protocol activation
+- intervention lane comparison (routed vs superpowers vs controlled)
+- correction-trigger studies (test failure, tool error, human correction, explicit correction mark)
+- observation-triggered transition studies (contradictory outputs, test failures, shell errors)
+- prompt posture / routing impact on retry, branch, and convergence
+- workflow intervention impact on topology (superpowers, subagent dispatching, structured workflows)
+- Tier 2 sample natural accumulation (failure, near-failure, human-intervention) with opportunistic validation
+
+### Evidence gates
+
+Every Phase 3E claim must report:
+
+- lane
+- corpus snapshot
+- denominator
+- runtime distribution
+- task_type distribution
+- intervention type (if applicable)
+- whether the result is exploratory or validation-grade
+
+## Non-goals
+
+- Phase 4 theory finalization
+- Prediction of agent behavior
+- Anomaly scoring or detection
+- Automatic diagnosis
+- Universal prompt policy recommendations
+- Cross-lane aggregation without lane disclosure
+- Promoting Tier 2 hypotheses to conclusions without sufficient evidence
+- Changing topology taxonomy
+- Changing readiness gates without explicit justification
+
+## Deferred Hypotheses Carried Forward
+
+### From Phase 3D Tier 2 (failure / intervention morphology)
+
+Registry entries, not validated. Validation deferred until corpus naturally accumulates more samples.
+
+- H-FM-001: failure/near-failure sessions enriched for retry_heavy or branchy topology
+- H-FM-002: failed sessions less likely to show branch_collapse
+- H-IM-001: human intervention acts as external correction trigger
+- H-IM-002: post-intervention traces show topology regime shifts
+- H-EV-004: failure sessions may contain silent divergence-like patterns
+- H-EV-005: human intervention may produce topology regime shifts
+
+Target: opportunistic validation when native failure >= 10, near-failure >= 10, multi-runtime failure coverage >= 3.
+
+### From Phase 3D Tier 3 (controlled benchmark / external lane)
+
+Activate when controlled benchmark protocol is operational.
+
+- H-OT-001: test failures trigger corrective branch exploration
+- H-OT-002: contradictory tool observations precede branch_collapse
+- H-EG-001: controlled benchmark lanes show lower branch entropy after task normalization
+- H-EG-002: external trajectories over-represent retry-heavy and branchy morphologies
+- H-EV-002: external tool observations may substitute for epistemic verbalization as correction triggers
+- H-EV-003: branch collapse may occur after uncertainty resolution signals
+
+### From Phase 3D Tier 4 (literature-inspired, registry-only)
+
+Maintain in registry for future corpus expansion.
+
+- H-EV-001: uncertainty verbalization may precede exploratory topology
+- H-LH-001: long-horizon tasks produce more fan-in and branch-collapse
+- H-LH-002: multi-file tasks increase root spawning and transition entropy
+
+## Operating Rules
+
+- All claims must bind to a specific corpus snapshot and lane.
+- Every percentage must include its denominator.
+- Every runtime conclusion must disclose runtime distribution.
+- Negative results are first-class entries and must not be deleted.
+- Do not promote hypotheses to conclusions without corpus-backed validation.
+- Do not enter Phase 4.
+- Do not implement prediction, anomaly detection, or automatic diagnosis.
+- Do not merge intervention lanes into the native direct-prompt baseline.
+- Do not change topology taxonomy or readiness gates unless explicitly justified.
+- Cross-lane comparison may report trends only.
+- Intervention-lane findings do not become universal policy without additional validation.
+
+## Background Processes
+
+- Intervention-aware acquisition continues (formerly Phase 3D-T2B).
+- Native lane maintained as a living baseline.
+- Tier 2 failure/intervention opportunistic validation.
+
+## Current State
+
+Phase 3E is newly opened. The first action is to activate the controlled benchmark protocol and begin lane-separated intervention comparisons. No hypotheses in the carried-forward set are yet validation-ready.