vicarious11
diff --git a/‎README.md‎
Lines changed: 203 additions & 115 deletions b/‎README.md‎
Lines changed: 203 additions & 115 deletions
@@ -1,161 +1,249 @@
-# agenttop
+<p align="center">
+  <img src="assets/logo.png" alt="agenttop" width="120">
+</p>
 
-`htop` for AI coding agents.
+<h1 align="center">agenttop</h1>
 
-```bash
-git clone https://github.com/vicarious11/agenttop && cd agenttop && ./setup.sh
-./run.sh    # localhost:8420
-```
+<p align="center">
+  <b>See where your AI coding tokens and money actually go.</b>
+  <br>
+  <sub>htop for Claude Code, Cursor, Kiro, Codex, and Copilot.</sub>
+</p>
 
-![agenttop dashboard](assets/screenshots/optimizer.png)
+<p align="center">
+  <a href="#install">Install</a> ·
+  <a href="#what-you-see">Screenshots</a> ·
+  <a href="#features">Features</a> ·
+  <a href="#how-it-works">How it works</a> ·
+  <a href="#ai-analysis">AI Analysis</a> ·
+  <a href="#architecture">Architecture</a>
+</p>
 
-Monitors **Claude Code**, **Cursor**, **Kiro**, **Codex**, **Copilot**. Reads the local files they already write (`~/.claude/`, `~/.cursor/`, etc). Read-only. Nothing leaves your machine.
+<p align="center">
+  <img src="https://img.shields.io/badge/python-3.10%2B-blue" alt="Python">
+  <img src="https://img.shields.io/github/license/vicarious11/agenttop" alt="License">
+  <img src="https://img.shields.io/badge/tools-5%20supported-green" alt="Tools">
+  <img src="https://img.shields.io/badge/telemetry-zero-brightgreen" alt="No telemetry">
+</p>
 
-## what it does
+---
 
-- unified dashboard across all your AI coding tools
-- every session, every prompt, every token, every dollar — one place
-- search sessions by project, sort by cost, view full prompt history
-- AI analysis: scores you 0-100 on session hygiene, prompt quality, cost efficiency, cache usage, tool utilization
-- cost forensics: spend by project, by model, estimated waste from marathon sessions
-- detects anti-patterns: correction spirals, context blowup, repeated prompts, model overkill
+![agenttop web dashboard](assets/screenshots/optimizer.png)
 
-## install
+> Every AI coding tool stores usage data locally — JSONL logs, SQLite databases, workspace state — but none of them show you the full picture. agenttop reads all of it, normalizes it, and gives you a real-time dashboard with AI-powered analysis. 5 tools. One view. Nothing leaves your machine.
+
+---
+
+## Install
 
 ```bash
 git clone https://github.com/vicarious11/agenttop && cd agenttop && ./setup.sh
 ```
 
-or `pip install agenttop`
-
-## run
+That's it. Handles Python, venv, deps, everything. Then:
 
 ```bash
-./run.sh              # web dashboard
-.venv/bin/agenttop    # terminal dashboard
-agenttop init         # set up LLM for analysis (ollama/anthropic/openai)
+source .venv/bin/activate
+agenttop                # terminal dashboard
+agenttop web            # web dashboard at localhost:8420
+agenttop stats          # quick CLI summary
+agenttop init           # configure LLM for AI analysis
 ```
 
-## data sources
+Requirements: Python 3.10+. No Docker. No API keys needed. macOS, Linux, Windows.
+
+Keyboard: `d` dashboard · `s` sessions · `e` explorer · `a` analysis · `k` graph · `1-4` time range · `q` quit
+
+## What You See
+
+### Terminal Dashboard
 
 ```
-~/.claude/projects/**/*.jsonl        exact token counts per message
-~/.cursor/ai-tracking/*.db           conversations, models, AI vs human ratio
-~/.codex/.codex-global-state.json    prompts, automations
-~/.config/github-copilot/            session state
-~/Library/.../Kiro/state.vscdb       workspace data
+All time  17.8M tok  $687 cost  265 sess  5.6K msgs  5 tools  87% cache
+
+COST BY PROJECT                    COST BY MODEL
+apex-trading-engine ████████ $284  opus-4-6    ████████████ $412
+vaultkeeper         █████    $148  sonnet-4-6  █████        $198
+phantom-search      ████      $97  haiku-4-5   ██            $76
+neon-ui             ██        $63
+dataweave           ██        $51
+
+DAILY COST (30d)                   ACTIVITY BREAKDOWN
+▁▃▅▇█▇▅▃▁▂▄▆█▇▅▃▁▂▅▇█▇▅▃▁▃▅▇    coding       ████████  42%
+total $687  avg $23/day  peak $45  debugging    ████      21%
+                                   testing      ███       15%
+TOOLS                              exploration  ██         9%
+● Claude Code  180 sess  $469
+● Cursor        45 sess  $107     ONE-SHOT RATE
+● Kiro          20 sess   $53     87%  ██████████████████░░
+● Codex         12 sess   $53     edits that pass first try
+● Copilot        8 sess    $5     higher = better prompting
 ```
 
-## architecture
+Six panels. No plotext. Pure Rich text rendering. All data computed from actual tool calls, not keyword guessing.
 
+### Web Dashboard
+
+Three tabs: **Overview** · **Sessions** · **Analyze**
+
+- **Overview** — force-directed knowledge graph (D3), model usage (input/output/cache), hourly activity, cost breakdown, workflow intelligence
+- **Sessions** — full-page browser with Google-style pagination. Search by project or prompt. Sort by cost, time, tokens. Click any session to see complete prompt history
+- **Analyze** — select sessions (All / Last 10 / Top Cost), run LLM analysis, get a deep-dive report with score, grades, cost forensics by project and model, anti-patterns, recommendations with estimated savings
+
+Keyboard: `o` overview · `s` sessions · `a` analyze. URL hash routing (`#sessions`, `#analyze`) for deep links.
+
+## Features
+
+### Data Extraction
+
+| Tool | Data Source | What agenttop extracts |
+|------|------------|----------------------|
+| **Claude Code** | `~/.claude/projects/**/*.jsonl` | Exact per-message token counts (input, output, cache read, cache create). Per-message model ID. **Every tool call name** (Edit, Bash, Read, Grep, Agent, Write — extracted from `tool_use` content blocks). Up to 50 user prompts per session. Project path from `cwd` field. Cost from per-model pricing. |
+| **Cursor** | `~/.cursor/ai-tracking/ai-code-tracking.db` | Conversations from SQLite. Source type (tab/composer/chat). AI vs human code ratio from `scored_commits`. Model per code hash. Project resolution via `ide_state.json` workspace mapping. |
+| **Kiro** | `~/Library/.../Kiro/User/globalStorage/state.vscdb` | Session data from VS Code state DB. Keys matching `kiro%`, `chat%`, `session%` patterns. Message counts and timestamps. |
+| **Codex** | `~/.codex/` | Prompt history from `.codex-global-state.json`. Session files from `sessions/` rollouts. Automation data from SQLite. Config (model, reasoning effort). |
+| **Copilot** | `~/.config/github-copilot/session-state/` | Per-session JSON with message content. Model extraction. Custom agent detection. Token estimation from content length. |
+
+All read-only. agenttop never modifies your tool data.
+
+### Activity Classification
+
+Deterministic. No LLM. Classified from **actual tool call data** when available (Claude Code), falls back to prompt keywords for other tools.
+
+| Activity | How it's detected |
+|----------|------------------|
+| **coding** | Edit, Write, MultiEdit tool calls |
+| **debugging** | Bug/error/fix keywords in prompts + Edit/Bash patterns |
+| **testing** | Bash calls with pytest/jest/vitest/cargo test |
+| **exploration** | Read, Grep, Glob calls without edits |
+| **refactoring** | Refactor/rename/extract keywords + Edit patterns |
+| **git ops** | Bash calls with git commands |
+| **planning** | EnterPlanMode, TaskCreate, Agent tool calls |
+| **other** | Everything else |
+
+### One-Shot Success Rate
+
+Percentage of edit turns that pass without retry. Detects `Edit -> correction prompt -> Edit` retry cycles in your prompt history. Higher percentage = better prompting, less wasted tokens.
+
+When `tool_breakdown` is available (Claude Code), uses actual Edit/Write call counts. Falls back to prompt analysis for other tools.
+
+### Cost Analysis
+
+- **Cost by project** — which project burns the most money, with session count
+- **Cost by model** — opus vs sonnet vs haiku spend, computed from actual per-model pricing (input/output/cache rates)
+- **Daily cost sparkline** — 30-day unicode trend with total, average, and peak
+- **Cache hit rate** — from actual `cacheReadInputTokens` vs `inputTokens` in Claude Code data
+
+### Session Data Model
+
+Each session stores:
+
+```python
+Session(
+    tool_breakdown={"Edit": 5, "Bash": 3, "Read": 12, "Grep": 4},  # actual tool calls
+    models_used={"claude-opus-4-6": 8, "claude-sonnet-4-6": 12},    # per-message model
+    prompts=["fix the race condition in...", ...],                    # up to 50
+    total_tokens=48291,          # exact for Claude, estimated for others
+    estimated_cost_usd=12.47,    # per-model pricing
+    message_count=23,
+    tool_call_count=24,
+    # + id, tool, project, start_time, end_time
+)
 ```
-                    ┌──────────────────────────────────────────────┐
-                    │             YOUR MACHINE (read-only)         │
-                    │                                              │
-                    │  ~/.claude/   ~/.cursor/   ~/.codex/   ...   │
-                    └──────┬───────────┬────────────┬──────────────┘
-                           │           │            │
-                           ▼           ▼            ▼
-                    ┌──────────────────────────────────────────────┐
-                    │              COLLECTORS                       │
-                    │                                              │
-                    │  ClaudeCodeCollector    → JSONL parser        │
-                    │  CursorCollector        → SQLite + workspace  │
-                    │  KiroCollector          → VS Code state DB    │
-                    │  CodexCollector         → JSON + SQLite       │
-                    │  CopilotCollector       → session JSON        │
-                    │                                              │
-                    │  Each: collect_sessions() → list[Session]     │
-                    │        get_stats(days)    → ToolStats         │
-                    └──────────────────┬───────────────────────────┘
-                                       │
-                          ┌────────────┴────────────┐
-                          ▼                         ▼
-                   ┌─────────────┐          ┌─────────────┐
-                   │  WEB (D3)   │          │  TUI (term)  │
-                   │  port 8420  │          │  textual     │
-                   │             │          │              │
-                   │  FastAPI    │          │  5 tabs:     │
-                   │  WebSocket  │          │  dashboard   │
-                   │  3 tabs:    │          │  sessions    │
-                   │  overview   │          │  explorer    │
-                   │  sessions   │          │  analysis    │
-                   │  analyze    │          │  graph       │
-                   └──────┬──────┘          └──────────────┘
-                          │
-                          ▼
-                   ┌──────────────────────────────────────┐
-                   │      OPTIMIZER (map-reduce-generate)  │
-                   │                                      │
-                   │  MAP:     per-session LLM calls      │
-                   │           (cached, concurrent)        │
-                   │           intent, spirals, quality    │
-                   │                                      │
-                   │  REDUCE:  pure python, deterministic  │
-                   │           score 0-100, 5 dimensions   │
-                   │           cost forensics, anti-pats   │
-                   │                                      │
-                   │  GENERATE: single LLM call            │
-                   │           profile, recs, insights     │
-                   │                                      │
-                   │  LLM: ollama / anthropic / openai     │
-                   └──────────────────────────────────────┘
-```
 
-**collectors** parse tool-specific local files into a unified `Session` model (id, tool, project, messages, tokens, cost, prompts, timestamps). each collector handles one tool's quirks — JSONL for Claude, SQLite for Cursor, JSON blobs for Codex.
+## AI Analysis
+
+Optional. Select sessions, run LLM analysis, get a report.
 
-**web dashboard** is vanilla JS + D3, no frameworks. FastAPI serves the API and static files. WebSocket for live updates. three tabs: overview (knowledge graph + panels), sessions (paginated browser with detail pane), analyze (select sessions → LLM analysis → score + cost forensics + recommendations).
+**Three-phase pipeline (Map-Reduce-Generate):**
 
-**TUI** is built on textual. plotext for charts. five tabs: dashboard (stats + charts), sessions (project aggregates + history), explorer (interactive search/select/analyze), analysis (model usage + intent distribution), graph (tree view).
+1. **MAP** — batches selected sessions into a single LLM call with full prompt history. Classifies each: intent, correction spirals, prompt quality, wasted effort. Results cached per session ID — sessions are immutable, never re-analyzed.
 
-**optimizer** is the interesting part. three phases:
+2. **REDUCE** — pure Python, no LLM. Deterministic score from 5 dimensions (0-20 points each):
 
-1. **MAP** — takes your top 30 sessions (by cost), sends each to an LLM with full prompt history. classifies: intent (debugging/greenfield/exploration/...), had correction spirals?, prompt quality, wasted effort. results cached per session ID at `~/.agenttop/session_cache.json` — sessions are immutable so they're never re-analyzed. max 10 new sessions per run. concurrent: 1 worker for ollama, 4 for cloud.
+   | Dimension | Source | Formula |
+   |-----------|--------|---------|
+   | Session hygiene | MAP classifications | `spiral_free_sessions / total x 20` |
+   | Prompt quality | MAP classifications | `no_waste_sessions / total x 20` |
+   | Cost efficiency | Python cost forensics | `(1 - waste_pct / 100) x 20` |
+   | Cache efficiency | Claude model_usage | `cache_hit_rate / 100 x 20` |
+   | Tool utilization | Feature detection | `features_used / available x 20` |
 
-2. **REDUCE** — pure python. no LLM. computes a deterministic score from 5 dimensions (0-20 points each):
-   - session hygiene: `sessions_without_spirals / total × 20`
-   - prompt quality: `sessions_without_waste / total × 20`
-   - cost efficiency: `(1 - waste_pct/100) × 20`
-   - cache efficiency: `cache_hit_rate/100 × 20`
-   - tool utilization: `features_used/features_available × 20`
+3. **GENERATE** — single LLM call with ~2K tokens of pre-computed metrics. LLM writes prose (developer profile, recommendations, project insights). Does NOT compute any numbers — those come from REDUCE.
 
-   also computes cost forensics (spend by project, by model, waste estimation from marathon sessions) and anti-pattern counts.
+Score is fully traceable. "Session hygiene: 14/20 — 23/30 sessions had no correction spirals."
 
-3. **GENERATE** — single LLM call with ~2K tokens of pre-computed metrics. LLM writes prose (developer profile, recommendations, project insights). it does NOT compute any numbers — those come from REDUCE.
+**LLM providers:** Ollama (free, local — nothing leaves your machine), Anthropic, OpenAI, OpenRouter.
 
-the score is fully traceable. "session hygiene: 14/20 — 23/30 sessions had no correction spirals." not a vibe check.
+```bash
+agenttop init  # interactive setup wizard
+```
+
+## Demo Mode
 
-## API
+Safe for recordings and screenshots. Generates realistic fake data — 10 projects, 265 sessions across 5 tools, with handwritten prompts that read like real engineering work.
 
-| endpoint | what |
-|----------|------|
-| `GET /api/stats?days=N` | aggregated stats from all collectors |
-| `GET /api/sessions?days=N` | all sessions (paginated client-side) |
-| `GET /api/sessions/{id}` | full session detail with prompts |
-| `GET /api/models` | claude model usage (input/output/cache) |
-| `GET /api/hours` | hourly token distribution |
-| `GET /api/graph` | D3-compatible knowledge graph |
-| `POST /api/analyze-sessions` | LLM analysis on selected sessions |
-| `POST /api/optimize` | full optimizer pipeline |
-| `GET /api/optimize-stream` | SSE streaming progress + result |
-| `WS /ws` | real-time stat updates |
+```bash
+agenttop --demo        # terminal with fake data
+agenttop web --demo    # web dashboard with fake data
+```
 
-## config
+Deterministic. Same screenshots every time.
 
-zero config by default. `agenttop init` for interactive setup, or:
+## How It Works
+
+```
+~/.claude/  ~/.cursor/  ~/.codex/  ~/.config/github-copilot/  ~/Library/.../Kiro/
+     |           |          |              |                        |
+     v           v          v              v                        v
+  COLLECTORS — parse tool-specific local files
+  │  Claude: JSONL → exact tokens, tool names, model per message
+  │  Cursor: SQLite → conversations, AI vs human ratio, models
+  │  Codex:  JSON + SQLite → prompts, automations, rollouts
+  │  Copilot: JSON → session messages, model, agents
+  │  Kiro:   SQLite → VS Code state keys
+  │
+  └──> unified Session model (tool_breakdown, models_used, prompts, tokens, cost)
+          │
+          ├──> WEB DASHBOARD (FastAPI + D3 + vanilla JS, port 8420)
+          │    overview (knowledge graph) | sessions (paginated) | analyze
+          │
+          ├──> TERMINAL DASHBOARD (Textual + Rich)
+          │    dashboard | sessions | explorer | analysis | graph
+          │
+          └──> OPTIMIZER (Map-Reduce-Generate, optional)
+               MAP: batch LLM call, cached per session
+               REDUCE: deterministic score 0-100
+               GENERATE: prose recommendations
+```
+
+## Configuration
+
+Zero config by default. For AI analysis:
+
+```bash
+agenttop init
+```
+
+or manually:
 
 ```toml
 # ~/.agenttop/config.toml
 [llm]
 provider = "ollama"           # ollama | anthropic | openai | openrouter
-model = "ollama/gemma3:4b"
-map_concurrency = 0           # 0 = auto
+model = "ollama/gemma3:4b"    # any litellm-compatible model
 ```
 
-## no telemetry
+Environment variable overrides: `AGENTTOP_LLM_PROVIDER`, `AGENTTOP_LLM_MODEL`, `ANTHROPIC_API_KEY`.
+
+## No Telemetry
 
-zero. local only. ollama = nothing leaves your machine.
+Zero. No data collection. No cloud uploads. No analytics. Everything runs locally. With Ollama, nothing leaves your machine at all.
 
-## license
+## License
 
 Apache 2.0
+
+## Contributors
+
+Built with [@AbhilashSri](https://github.com/AbhilashSri) (workflow intelligence, code reviews), [@Mohit]() and [@Akshit]() (testing, UX).