An interactive demo that proves AI agents in enterprise audit can be controlled, transparent, and measurably improvable — without model retraining or black-box infrastructure.
Built for C-Suite, Audit Leaders, and Compliance Officers.
| Step | Page | What the audience sees |
|---|---|---|
| 1 | Overview | Four pillars of reliable AI + the improvement loop |
| 2 | Run Agent | Real audit case — agent returns Medium Risk at 68% confidence |
| 3 | Trace Viewer | Every step the agent took, with tool names, inputs, outputs, confidence |
| 4 | Evidence Map | Every claim grounded to a source — missing evidence flagged explicitly |
| 5 | Human Review | Expert overrides to High Risk, proposes a rule |
| 6 | Learning Log | Rule captured instantly — no retraining |
| 7 | Agent Rerun | Same case → High Risk at 91% confidence, rule cited |
| 8 | Evaluation Dashboard | Before/after: Accuracy +28pp, Time −66% |
| 9 | Business Impact + Architecture | ₹19L+ savings per 1,000 cases, operating model |
The demo is fully stateful — running the agent, submitting feedback, and rerunning all flow through shared React context. "Reset Demo" restores the initial state for repeat runs.
- Frontend: React 18 + Vite + TypeScript + Tailwind CSS v4
- Charts: Recharts
- Routing: Wouter
- Monorepo: pnpm workspaces (Node.js 24)
- Backend: Express 5 (API server — not used by the demo itself, included as workspace scaffolding)
.
├── artifacts/
│ ├── cockpit/ # The demo app (React + Vite)
│ │ └── src/
│ │ ├── pages/ # One file per demo page (9 pages)
│ │ ├── context/ # DemoContext — shared stateful demo flow
│ │ └── components/ # Layout, sidebar
│ ├── api-server/ # Express API server (workspace scaffolding)
│ └── mockup-sandbox/ # Component preview server (workspace tooling)
├── lib/ # Shared TypeScript libraries
├── scripts/ # Utility scripts
├── pnpm-workspace.yaml # Workspace config + catalog pins
├── tsconfig.base.json # Shared strict TS config
└── tsconfig.json # Solution file (libs only)
- Node.js 24+
- pnpm 9+
pnpm installpnpm --filter @workspace/cockpit run devOpen http://localhost:5173 in your browser.
pnpm --filter @workspace/api-server run devpnpm run typecheckThe demo uses a shared DemoContext with four states:
initial → ran_initial → feedback_submitted → reran
- Run Agent page — click "Run Initial Agent" → transitions to
ran_initial - Human Review page — click "Submit Feedback" → transitions to
feedback_submitted - Run Agent page — "Rerun After Feedback" button unlocks → click → transitions to
reran - All other pages (Trace Viewer, Evidence Map, Learning Log, Evaluation, etc.) react to the current state
Click Reset Demo on the Run Agent page to return to initial.
| Audience | Focus |
|---|---|
| C-Suite / CFO | Business Impact → Evaluation Dashboard → skip Architecture |
| Audit Leaders | Evidence Map → Learning Log → Audit Trail metric |
| Compliance Officers | Trace Viewer → Architecture governance layers |
| Technical | Trace Viewer (tool calls, confidence scores, IF-THEN rule syntax) |
| 5-minute version | Run Agent → Trace Viewer → Human Review → Rerun → Metrics |
- "Traceability is the foundation." — Trace Viewer shows what it did. Evidence Map shows where it got information.
- "Explainability is proof." — We don't just say High Risk. We explain why.
- "Review is control." — The human decides. The agent suggests.
- "Learning is improvement." — Each correction becomes a rule for future cases.
- "Metrics are accountability." — We measure. We improve. We report.
Internal demo — not for public distribution.