AX: Agent Experience initiative — tracking issue

## Tracking issue — Agent Experience (AX) initiative

Following the v0.67.118–125 high-velocity sprint, we have a structured way to discover AX bottlenecks: `scripts/stall_log_mine.py` (74a6b6d6) mines Claude Code JSONL transcripts for confusion signals and produces a friction-ranked report.

This issue tracks the AX programme as a whole.

## Programme goals

1. **Reduce agent friction on the top-N hot zones** by ~50% measured by re-running `stall_log_mine.py` quarterly.
2. **Maintain the HTMX/Alpine/Fragment substrate** as the empirical baseline. Stack-comparison work is deferred; longitudinal "this codebase vs itself last quarter" tells us if AX work is moving the number.
3. **Promote successful patterns** into CLAUDE.md as durable guidance.

## Child issues

- [x] #1063 — Bump-cycle formatter race (shipped v0.67.126 + v0.67.127)
- [x] #1064 — `renderer.py` decomposition — **complete, 3,784 → 364 lines (-90%) across 9 PRs (v0.67.136–v0.67.144)**
- [x] #1065 — `region_adapter.py` decomposition — complete, 2,871 → 164 lines (-94%) across 8 PRs (v0.67.128–v0.67.135)
- [x] #1066 — `route_generator.py` section-map docstring (shipped v0.67.126)
- [x] #1067 — forward-pointer comments on transit-point files (shipped v0.67.126)

## All children closed ✓

5 of 5 closed across **27 versions** (v0.67.118 → v0.67.144). The two large decompositions (#1064 + #1065) established the canonical mixin-per-family pattern — two reference implementations now exist, one with dict-based dispatch (region_adapter) and one with match-based dispatch (renderer).

## Validation

Next monthly `scripts/stall_log_mine.py` sweep will quantify the friction-score impact. Expected changes vs. April baseline:

- `pyproject.toml` / `CLAUDE.md` / `homebrew` / `CHANGELOG` (#1063) → out of top-5.
- `region_adapter.py` (#1065) → off the friction list (now 8 smaller files).
- `renderer.py` (#1064) → off the friction list (now 9 smaller files).
- `route_generator.py` (#1066) → ~50% drop in repeat-read count.

Issue can be closed once that sweep confirms the predictions. Until then keep open as the measurement anchor.

## Measurement protocol

- Re-run `python scripts/stall_log_mine.py --days 30` at end of each month.
- Diff the friction-ranked top-15 month-over-month.
- A row dropping out of the top-15 = confirmed win. A row climbing = regression or new touch-zone.

## Limitations of the current data

1. **Sample bias**: window dominated by recent high-velocity work. Patterns reflect what was being worked on.
2. **Heuristic weights**: `repeat_reads + 3·edit_failures + 5·errors` is a guess.
3. **Doesn't catch all stalls**: agents giving up and asking the user, or stalling on a non-tool reasoning loop, leave no signal in the transcript tool stream.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AX: Agent Experience initiative — tracking issue #1068

Tracking issue — Agent Experience (AX) initiative

Programme goals

Child issues

All children closed ✓

Validation

Measurement protocol

Limitations of the current data

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

AX: Agent Experience initiative — tracking issue #1068

Description

Tracking issue — Agent Experience (AX) initiative

Programme goals

Child issues

All children closed ✓

Validation

Measurement protocol

Limitations of the current data

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions