Commit d544cb0
authored
ops(e2e): flip IS-060 datamarking to ACTIVE mode for ASR-delta collection (#220)
Stage 2 of the IS-060 PR-2 validation chain. Shadow mode validated
2026-05-15 via workflow_dispatch run 25937040210: 85/91 scenarios
passed, zero errors, zero 4xx-rate shift vs prior nightly, transform
ran cleanly through the full e2e harness with zone_detection +
boundary_defense + datamarking shadow active.
This flip enables marker substitution against the real upstream
(Gemini 2.0 Flash via OpenRouter):
request → zone-detect → boundary-wrap → DATAMARK (active)
→ forward upstream WITH marker-substituted Data zones
The system-reminder addendum tells the upstream model the marked
text is data, not instructions.
Acceptance criteria for the 3-5 nightly cycles that follow this merge:
- Pass rate stays in the 84-88 / 91 range (matches the recent
per-day Gemini variance band; not a regression metric on its own)
- llmtrace_spotlighting_byte_delta_total > 0 (transform fires)
- llmtrace_spotlighting_failures_total == 0
- llmtrace_spotlighting_marker_collision_total stays bounded
- upstream_fell_for_it rate on indirect_injection family scenarios
shifts DOWNWARD relative to the pre-datamarking baseline (the
ASR delta — the headline metric)
Evidence will be committed to
docs/research/results/upstream_judge_datamarking_evidence_<date>.md
after 3-5 cycles of post-active-mode data.
Refs: #214 (PR-2 implementation), #213 (PR-3 corpus), #216
(shadow-mode enable), #219 (zone_detection enable), workflow_dispatch
run 25937040210 (shadow-mode validation evidence).1 parent 685e10b commit d544cb0
1 file changed
Lines changed: 10 additions & 8 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
66 | 66 | | |
67 | 67 | | |
68 | 68 | | |
69 | | - | |
70 | | - | |
71 | | - | |
72 | | - | |
73 | | - | |
74 | | - | |
75 | | - | |
| 69 | + | |
| 70 | + | |
| 71 | + | |
| 72 | + | |
| 73 | + | |
| 74 | + | |
| 75 | + | |
| 76 | + | |
| 77 | + | |
76 | 78 | | |
77 | 79 | | |
78 | 80 | | |
79 | 81 | | |
80 | 82 | | |
81 | 83 | | |
82 | | - | |
| 84 | + | |
83 | 85 | | |
84 | 86 | | |
85 | 87 | | |
| |||
0 commit comments