docs(research): lock artifact-mechanics investigation PLAN (descriptive-only, no claim)

JohnCCarter · claude · JohnCCarter · commit 70174df63b41 · 2026-06-24T13:46:53.000+02:00
Door (i) from the campaign checkpoint: a small, low-risk plan to explain the
MECHANICS behind the artifact-probe (4H reached-legs less clean; 4H snapping lowers
cleanliness; snapping flips sign on 1D). Descriptive-only on frozen data — issues NO
verdict, NO claim, changes no lock, resolves nothing about the crux.

Feasibility: clean (deterministic from frozen data + locked detection; no new
universe, no matched-null, no arbitrary frame) but NOT answerable from existing
aggregates → needs a small descriptive pass recording per-leg {span_bars,
magnitude_atr, snap_span_delta}. Headline descriptive object = the 4H&lt;-&gt;1D
snap_span_delta asymmetry (detector granularity vs human-anchoring precision), NOT
the partly-arithmetic span&lt;-&gt;cleanliness correlation. Pre-locked stats (single
Spearman + single median split, fixed confound set) + population guard (M1
reached/unreached != Stage-2 lead) + marginal-gap caveat (surfacing CI upper
-0.00095).

No code/run/build here. No matched-null, no new candidate universe, no new model
feature, no Genesis, no 1H, no ETH, no data-refresh, no edge/behaviour claim.
Execution (Commit 2) needs a separate explicit GO.

Co-Authored-By: Claude Opus 4.8 &lt;noreply@anthropic.com&gt;
diff --git a/docs/research_wiki/handoff.md b/docs/research_wiki/handoff.md
@@ -152,6 +152,17 @@ legs/ranges* (labels = facit; **no edge/behaviour/backtest/PnL/Genesis/auto-fib
   behaviour/Genesis/1H/ETH. [Results](reviews/btc-fib-selection-learning-artifact-results-20260624.md);
   summary + `cells/*.json` gitignored/regenerable. Re-run (deterministic, frozen data, no `--refresh`):
   `PYTHONUNBUFFERED=1 uv run --no-sync python -u -m fibengine.research.selection_learning_artifact --artifact`.
+- **2026-06-24 Fib SELECTION-LEARNING artifact-MECHANICS investigation — PLAN locked (docs-only),
+  RUN PENDING separate GO.** Door (i) from the checkpoint: explain the *mechanics* behind the
+  artifact-probe (why 4H reached-legs less clean; why 4H snapping lowers cleanliness; why snapping
+  flips sign on 1D), **descriptive-only on frozen data — NO verdict, NO claim, no lock change**. Plan
+  [artifact-mechanics PLAN](reviews/btc-fib-selection-learning-artifact-mechanics-plan-20260624.md):
+  feasible cleanly (deterministic from frozen data + locked detection) but NOT answerable from existing
+  aggregates → needs a small descriptive pass recording per-leg {span_bars, magnitude_atr,
+  snap_span_delta}. Headline object = the **4H↔1D `snap_span_delta` asymmetry** (detector granularity
+  vs human-anchoring precision), not the partly-arithmetic span↔cleanliness correlation. Pre-locked
+  stats (P3) + population guard (M1 reached/unreached ≠ Stage-2 lead) + marginal-gap caveat. No
+  matched-null/new universe/Genesis/1H/ETH/refresh. Commit 2 needs a separate GO.
 
 **Next work requires a separate explicit GO. No W/gap, no Stage 1, no new sensitivity, and no Genesis
 may be started automatically.** Parked (test-only, separate GO): lock the facit-discipline refusal
diff --git a/docs/research_wiki/reviews/btc-fib-selection-learning-artifact-mechanics-plan-20260624.md b/docs/research_wiki/reviews/btc-fib-selection-learning-artifact-mechanics-plan-20260624.md
@@ -0,0 +1,94 @@
+# BTC Fib Selection-Learning — artifact-probe MECHANICS investigation PLAN (2026-06-24)
+
+**Lean Fib Research. Research-only. DESCRIPTIVE-ONLY — produces NO verdict, NO claim, no
+edge/behaviour/PnL/backtest/Genesis/auto-fib. Docs-only plan (no code/run authorised here).** This is
+the small, low-risk **investigate** plan named as door (i) in the
+[campaign checkpoint](btc-fib-selection-learning-checkpoint-20260624.md), to explain the *mechanics*
+behind the [artifact-probe result](btc-fib-selection-learning-artifact-results-20260624.md)
+(`1573b56`). It does **not** test the crux, change any lock, or add any positive claim. Execution
+needs a **separate explicit GO**.
+
+## P0. Question (mechanical, descriptive — three observations to explain)
+
+On the artifact-probe's existing frozen data (no refresh):
+
+1. **4H — reached legs are *less* clean than unreached** (0.743 vs 0.799; gap −0.0557).
+2. **4H — snapping anchors to detector pivots *lowers* cleanliness** (snapped − exact = −0.0219).
+3. **1D — snapping *flips* to inflation** (+0.0222), opposite sign to 4H.
+
+These are already locked as **"investigate, not a finding"** (artifact LOCK A7). This plan asks only
+**WHY, mechanically** — it issues **no verdict** and the artifact-probe's reading is unchanged.
+
+## P1. Can it be tested cleanly on existing data? (feasibility — YES, with one caveat)
+
+- **YES, methodically clean.** Every quantity needed is a **deterministic function of the already-frozen
+  data + the locked detection** (`anchor_b + k=3`, frozen config) — no new candidate universe, no
+  matched-null, no arbitrary frame, no new model feature. It is a **descriptive decomposition** of
+  per-leg quantities the artifact-probe already computes internally.
+- **Caveat:** it **cannot** be answered from the *existing artifacts* (summary.json holds only
+  aggregates — `clean_reached`, `clean_unreached`, the snap gaps). The per-leg quantities (span length,
+  magnitude, snapped-vs-exact span) are not persisted. So execution needs a **small descriptive pass**
+  that records them — a build+run on the **same frozen data**, low-risk, no new arbitrary choice.
+
+## P2. Mechanical hypotheses (pre-stated — to keep the decomposition honest)
+
+- **M1 (reached less clean = a size/length confound):** the detector reconstructs **larger / longer**
+  swings (it needs fractal structure + `min_prominence_atr=0.5`); longer swings carry more intermediate
+  retracement → lower `cleanliness`. Unreached legs (detector misses) are shorter/smaller → straighter →
+  higher `cleanliness`. Prediction: reached legs have **longer span / larger magnitude**, and the
+  (**marginal** — surfacing CI upper was −0.00095) cleanliness gap **attenuates** when conditioning on
+  span. *M1 explains a marginal gap; the note must not retroactively harden the surfacing result.*
+- **M2 (4H snapping lowers cleanliness):** detector pivots sit at the **fuller** swing extremes; the
+  human anchors **inside** them, so snapping **extends** the bar-index span and `cleanliness = net/path`
+  drops. **Important — the bare correlation `snap_span_delta ~ (snapped − exact)` is partly arithmetic
+  by construction (changing the span changes net/path mechanically), so it is NOT the descriptive
+  object.** The genuinely informative, non-trivial fact is **M3's cross-TF asymmetry** below.
+- **M3 (the real descriptive object — cross-TF asymmetry in `snap_span_delta`):** on **4H** detector
+  pivots systematically sit **outside** the human anchors (`snap_span_delta > 0`, extension lands in
+  retracement → cleanliness down); on **1D** they do **not** (`snap_span_delta ≤ 0` → cleanliness up).
+  That asymmetry — **detector granularity vs human-anchoring precision differing by TF** — is the real
+  content explaining the sign flip, reported as the headline, **not** "span extension lowers
+  cleanliness" dressed as a discovery.
+
+## P3. Pre-locked statistics (anti-forking-paths — fixed BEFORE any number)
+
+- **Per-leg quantities (descriptive):** `span_bars = |pos_b − pos_a|`, `magnitude_atr` (net move ÷
+  causal ATR at `anchor_b`), `snap_span_delta` (bars), all on the frozen data / locked detection.
+- **Reported as:** distributions (median + IQR) for reached vs unreached (M1); a **single Spearman
+  correlation** `cleanliness ~ span_bars` and a **single median split** on `span_bars` (M1 attenuation);
+  and — the headline — the **per-TF sign + distribution of `snap_span_delta`** to expose the 4H↔1D
+  **asymmetry** (M3). The bare `snap_span_delta ~ (snapped − exact)` correlation may be shown for
+  completeness but is flagged **partly arithmetic, not the finding**. **Fixed confound set =
+  {span_bars, magnitude_atr}** — no tuned bins, no cherry-picked controls.
+- **NO bootstrap verdict, NO CI-based decision, NO new locked verdict** — purely descriptive. The output
+  is a **mechanics note**, not a result with a verdict.
+
+## P4. Discipline / non-claims (binding)
+
+- **Descriptive-only.** Issues **no verdict**, adds **no positive claim**, and does **not** change the
+  artifact-probe reading (still "investigate, not a finding") or any lock.
+- Does **not** resolve the cleanliness crux (stays OPEN); a clean mechanical explanation of the gap is
+  **not** evidence either for or against "genuine human signal."
+- **Population guard (binding):** M1's reached-vs-unreached contrast is detector-**reached** vs
+  detector-**missed** human legs — a **different population** from the Stage-2 cleanliness lead
+  (human-matched vs non-human candidates, both *inside* the detector universe). "Span explains the
+  reached/unreached gap" must **NOT** bleed into "span explains/dissolves the Stage-2 lead" — different
+  populations, different (un-made) claim.
+- **No matched-null, no new candidate universe, no new model feature, no Genesis, no auto-fib-as-truth,
+  no 1H, no ETH, no label/corpus mutation, no `data.fetch --refresh`** (frozen-data parity).
+
+## P5. Execution sketch (Commit 2 — NOT executed here; needs a separate GO)
+
+- A small **descriptive pass** recording per-leg `{span_bars, magnitude_atr, snap_span_delta}` +
+  the P3 summaries — either a `--artifact-mechanics` descriptive mode on
+  `selection_learning_artifact.py` or a tiny sibling; **no code into byte-capped
+  `selection_learning.py`**. Reuses the existing rows / detection / ε / frozen data.
+- Tests for the new descriptive quantities; a **mechanics note**
+  (`btc-fib-selection-learning-artifact-mechanics-20260624.md`, descriptive, no verdict). Artifacts
+  under `experiments/review/fib_selection_learning/artifact/` (**gitignored**).
+
+## P6. What this plan does NOT do
+
+No code, no run, no build, no verdict, no claim, no lock change, no matched-null, no new universe, no
+push beyond this plan doc. Execution requires a **separate explicit GO**; **halt and report** if, at
+build time, any pre-locked statistic (P3) or the descriptive-only discipline (P4) is found unclear.