You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
feat(intent): run commit-intent debugger on real git history + doc accuracy pass (v1.1.89)
IntentDebugger was a fixture-only prototype. It now reads real commits:
- new Tauri command list_commit_intents(repo_path, limit) parses
`git log --numstat` with control-char separators, classifies each
file's surface + agent-vs-human authorship, and derives a test-evidence
signal per commit
- IntentDebugger.tsx gains a repo picker, shows the real commit subject as
the card title (previously dropped), and keeps fixtures as a browser
fallback
- gate the "agent-authored UI change" risk on uiFileCount>0 so non-UI
agent commits get a generic intent-check risk instead of a false UI flag
- reachable via links on the Roadmap page (no new top-nav tab)
Docs accuracy pass:
- README gaps table reflects real state (synthetic-QA + intent debugger
are no longer "not implemented")
- landing page corrected to Astro (deployed dir is apps/landing-page-astro,
not the legacy Next.js one); CI section documents auto-release.yml
- dropped the orphaned "2022 Themesberg" license note (no such code in repo);
fixed the Tauri v1 -> v2 prerequisites link
- agents.md nav list updated to the 8 tabs + URL-only surfaces
Verification: cargo test (classify_surface/classify_author) + tsc + eslint
+ test:intent-debugger all green; report quality checked against this
repo's real commits.
Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
Copy file name to clipboardExpand all lines: README.md
+16-15Lines changed: 16 additions & 15 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -23,10 +23,10 @@ The near-term wedge is not beating Claude, Codex, or hosted PR bots at generic r
23
23
|---|---|---|
24
24
| Code review | Review tab runs local diffs through CLI agents and persists findings. | Needs multi-pass specialist review, better AGENTS.md/project-context ingestion, and benchmarked catch-rate evidence. |
25
25
| Bug finding | Findings, severity, code viewer, and re-review loop exist. | Needs runtime evidence from tests/browser sessions/logs, not only static diff judgment. |
26
-
| Agent-written code verification |Product is aimed at agent output and can fix/re-review selected findings. | Needs agent provenance: which agent changed what, prompt/task context, and whether the fix actually resolved the original user goal. |
26
+
| Agent-written code verification |Aimed at agent output; fixes/re-reviews selected findings and emits a full verification handoff proof (`review-proof` + `agent-fix-packet`: per-finding evidence, fixed/reproduced/unchecked tallies, and a copyable reviewer handoff). | Needs to close the intent loop: did the fix actually resolve the original user goal, and which agent/prompt produced the change. |
27
27
| Debugging/replay | History indexes Claude/Codex sessions and can replay conversations. | Replay is not connected to files, diffs, failures, screenshots, tests, or review findings. |
28
-
| Synthetic user QA |Not implemented as a first-class workflow. | Needs browser/app automation that performs user tasks, captures screenshots/traces, and converts failures into review findings. |
29
-
| AI step-through debugger |Not implemented. |Needs an execution timeline across agent actions, file edits, commands, test failures, and UI observations. |
28
+
| Synthetic user QA |Prototype — `QaReplay` (`/qa-replay`, linked from Roadmap) runs fixture-backed synthetic-QA loops with a live agent-runner track. | Needs real browser/app automation that drives the actual product, captures screenshots/traces, and converts failures into review findings. |
29
+
| AI step-through debugger |Commit-intent debugger (`/intent-debugger`, linked from Roadmap) now runs over **real** recent commits — pick a repo, and it infers intent, risks, verification gaps, and agent-vs-human authorship per commit. |Still per-commit static analysis; needs a full execution timeline across agent actions, file edits, commands, test failures, and UI observations. |
30
30
| Codebase history explainer | Repo Unpacked generates repo briefs; History indexes agent sessions. | Needs commit/decision mining tied to touched files so reviews can catch intent regressions. |
31
31
32
32
The product should prefer narrow, evidence-backed loops over broad "code intelligence" surfaces. A feature is on-strategy when it helps answer: "What changed, why did the agent change it, what could break, can we reproduce it, and did the fix actually work?"
@@ -36,11 +36,11 @@ The product should prefer narrow, evidence-backed loops over broad "code intelli
| CI/CD | GitHub Actions — `release.yml`builds Tauri binaries on GitHub release; `deploy-landing.yml` deploys the landing page to Cloudflare Pages on push to `main`|
43
+
| CI/CD | GitHub Actions — `auto-release.yml`cuts a `v<version>` release when `apps/desktop/src-tauri/tauri.conf.json`'s version changes on `main`, which dispatches `release.yml` to build/sign/upload the Tauri binaries; `deploy-landing.yml` deploys the landing page to Cloudflare Pages on push to `main`|
44
44
45
45
## Installation
46
46
@@ -66,7 +66,7 @@ cd CodeVetter
66
66
npm install
67
67
```
68
68
69
-
> Requires [Rust + Tauri prerequisites](https://tauri.app/v1/guides/getting-started/prerequisites) for the desktop app.
69
+
> Requires the [Rust + Tauri 2 prerequisites](https://v2.tauri.app/start/prerequisites/) for the desktop app.
70
70
71
71
## Quick Start
72
72
@@ -77,32 +77,33 @@ npm install
77
77
```
78
78
3. Open the Review tab, pick a local repository, and run your first review through an installed CLI agent.
79
79
80
-
## Usage Examples
80
+
## Common Tasks
81
81
82
-
**Run the desktop app (dev mode)**
82
+
**Build a production desktop binary**
83
83
```bash
84
84
cd apps/desktop
85
-
npm run tauri:dev
85
+
npm run tauri:build
86
86
```
87
87
88
-
**Run Playwright end-to-end tests for the desktop app**
88
+
**Run the Playwright end-to-end suite**
89
89
```bash
90
90
cd apps/desktop
91
91
npm test
92
92
```
93
93
94
94
**Build the landing page**
95
95
```bash
96
-
cd apps/landing-page
96
+
cd apps/landing-page-astro
97
97
npm run build
98
98
```
99
99
100
100
## Monorepo Structure
101
101
102
102
```
103
103
apps/
104
-
desktop/ Tauri 2 + React 19 + Vite desktop app — the core product
105
-
landing-page/ Next.js marketing site (static export, deployed to Cloudflare Pages — codevetter.com)
104
+
desktop/ Tauri 2 + React 19 + Vite desktop app — the core product
105
+
landing-page-astro/ Astro marketing site (static export, deployed to Cloudflare Pages — codevetter.com)
106
+
landing-page/ Legacy Next.js marketing site — superseded by landing-page-astro, no longer deployed
-**Tauri IPC**: all Rust commands called via typed wrappers in `src/lib/tauri-ipc.ts` → `invoke()` → `src-tauri/src/commands/`.
54
54
-**`isTauriAvailable()` guard**: all IPC calls wrapped so React code also works in plain browser.
55
55
-**FIXED**: Dead `@code-reviewer/*` workspace deps removed — `packages/` dir no longer exists and is no longer referenced. Build passes.
56
-
-**Active screens**: Dashboard (usage/token analytics), History (session search), Review (`/review` — AI code review with diff + fix), Repo Unpacked (`/unpack` — whole-repo evidence-backed system brief, scanner in `src-tauri/src/commands/unpack.rs`, page in `apps/desktop/src/pages/RepoUnpacked.tsx`, persisted to `repo_unpacked_reports` table). Other tabs (Board, Workspaces) are legacy — do not invest in them.
57
-
-**GH Actions**: `ci.yml` runs lint + Playwright; `release.yml` builds platform binaries and uploads to GitHub Releases.
56
+
-**Nav (8 tabs)**: Home (`/` — usage/token analytics + session history), Review (`/review` — AI code review with diff + fix), Roadmap (`/roadmap` — shipped/verification telemetry dashboard), Unpack (`/unpack` — whole-repo evidence-backed system brief; scanner in `src-tauri/src/commands/unpack.rs`, page in `apps/desktop/src/pages/RepoUnpacked.tsx`, persisted to `repo_unpacked_reports` table), Intel (`/intel`), Fleet (`/fleet` — SaaS Maker fleet projects + repo↔project linking), T-Rex (`/trex`), Settings (`/settings` — also hosts Ops, Memories, Rubrics, usage, about).
57
+
-**URL-only surfaces** (reachable but intentionally off the top nav after the v1.1.86 declutter): Rubrics (`/rubrics`, linked from Review), IntentDebugger (`/intent-debugger` — commit-intent analysis over real git commits), QaReplay (`/qa-replay` — synthetic-QA fixture/live runner). The old Ask/Personas tabs and their Rust backend were removed in v1.1.87.
58
+
-**GH Actions**: `ci.yml` runs lint + Playwright; `auto-release.yml` cuts a `v<version>` release on `tauri.conf.json` version bump → dispatches `release.yml` to build/sign/upload binaries; `deploy-landing.yml` deploys `apps/landing-page-astro` to Cloudflare Pages.
58
59
- Husky pre-commit runs lint-staged on `apps/desktop/src/**/*.{ts,tsx}`; pre-push hook also configured.
0 commit comments