You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Add What's New in V2 section highlighting bring-your-own-data,
interactive onboarding, knowledge system, self-learning, brand theming,
and 606 tests. Fix stale references to deleted scripts and setup docs.
Update counts (18 agents, 39 skills, 20 commands).
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Copy file name to clipboardExpand all lines: README.md
+83-46Lines changed: 83 additions & 46 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,19 +1,33 @@
1
-
# AI Analyst
1
+
# AI Analyst v2
2
2
3
-
> [!IMPORTANT]
4
-
> **Note from Shane — February 22, 2026**
5
-
>
6
-
> This repo was last pushed to on February 20. Since then, the system has grown significantly through testing on more complex, real-world datasets. Everything below works, but the version we're running locally is substantially ahead of what's here. Major update coming in the next couple of days.
A complete AI analyst system powered by Claude Code. You ask a business question. Claude frames it, explores your data, finds the root cause, builds a story, and hands you a branded slide deck with speaker notes. The whole thing takes minutes, not days.
13
9
14
-
Connect your own data with `/connect-data` or use the included example datasets.
10
+
**Bring your own data.** No bundled datasets to configure — connect your CSVs, DuckDB, or warehouse with `/connect-data` and start analyzing immediately.
11
+
12
+
**18** specialized agents | **39** auto-applied skills | **20** slash commands | DAG-based parallel execution | PDF + HTML export
13
+
14
+
---
15
+
16
+
## What's New in V2
15
17
16
-
**17** specialized agents | **30** auto-applied skills | **14** slash commands | DAG-based parallel execution | PDF + HTML export
18
+
V2 is a ground-up rebuild of the intelligence layer. The pipeline and agents from V1 still work the same way — you won't notice a difference in how you use it. What changed is everything underneath.
19
+
20
+
| Area | V1 | V2 |
21
+
|------|----|----|
22
+
|**Data**| Bundled NovaMart e-commerce dataset | Bring your own — CSV, DuckDB, Postgres, BigQuery, Snowflake |
23
+
|**Onboarding**| Manual setup, read the docs |`/setup` interview learns your role, data, and business context |
24
+
|**Memory**| Stateless across sessions | Knowledge system persists corrections, learnings, query patterns, business glossary |
25
+
|**Self-learning**| None | Captures feedback, logs corrections, retrieves proven SQL patterns — never repeats the same mistake |
26
+
|**Theming**| Hardcoded chart style | YAML-based theme system with brand colors, WCAG-compliant palettes |
Or skip the wizard and just ask a question with your data in a directory:
66
79
67
80
```
68
-
/run-pipeline data_path=data/your_dataset/ question="Why is conversion dropping?"
81
+
/run-pipeline data_path=data/my_csvs/ question="Why is conversion dropping?"
69
82
```
70
83
71
-
For full setup (MotherDuck, MCP connections, troubleshooting): [setup/prerequisites.md](setup/prerequisites.md)
84
+
For full setup details: [docs/setup-guide.md](docs/setup-guide.md)
72
85
73
86
---
74
87
@@ -88,7 +101,7 @@ Claude queries the data and returns an answer with a chart. Simple questions get
88
101
/run-pipeline data_path=data/your_dataset/ question="What's driving the decline in conversion?"
89
102
```
90
103
91
-
The pipeline runs 17 agents across 4 phases: Frame the question, Analyze the data, Build the story, Create the deck. You get a validated analysis, branded charts, a narrative, and a slide deck with speaker notes. Exports to PDF and HTML.
104
+
The pipeline runs 18 agents across 4 phases: Frame the question, Analyze the data, Build the story, Create the deck. You get a validated analysis, branded charts, a narrative, and a slide deck with speaker notes. Exports to PDF and HTML.
92
105
93
106
### 3. Explore a dataset
94
107
@@ -118,7 +131,7 @@ Claude generates a chart following Storytelling with Data methodology: warm off-
118
131
119
132
## How It Works: The Pipeline
120
133
121
-
When you run `/run-pipeline`, Claude orchestrates 17 agents across 4 phases:
134
+
When you run `/run-pipeline`, Claude orchestrates 18 agents across 4 phases:
122
135
123
136
```
124
137
1. FRAME 2. ANALYZE 3. STORY 4. DECK
@@ -144,7 +157,7 @@ You don't have to run the whole thing. Five execution plans let you run just the
144
157
145
158
| Plan | Use When | What Runs |
146
159
|------|----------|-----------|
147
-
|`full_presentation`| Complete analysis to slide deck | All 17 agents |
160
+
|`full_presentation`| Complete analysis to slide deck | All 18 agents |
148
161
|`deep_dive`| Analysis without presentation | Phases 1-2 only |
149
162
|`quick_chart`| Just need one chart | Chart Maker + Design Critic |
|`/notion-ingest`| Import business context from Notion |`/notion-ingest`|
233
+
|`/compare-datasets`| Compare metrics across datasets |`/compare-datasets`|
234
+
|`/setup-dev-context`| Add codebase context for dev teams |`/setup-dev-context`|
214
235
215
236
Or just ask in plain English. "Show me conversion by device" works as well as any command.
216
237
@@ -241,32 +262,17 @@ Your Data --> chart_helpers.py --> Base Chart (150 DPI)
241
262
- Collision detection checks for overlapping text with 3 auto-fix strategies: offset the label, reduce font size, or drop the least important label. Charts with unresolved collisions halt the pipeline.
242
263
- The deck uses branded HTML components: KPI cards, finding cards, recommendation rows, so-what callouts, before/after panels, timelines, and more
243
264
- A lint gate validates every deck before export: checks frontmatter completeness, HTML component usage (minimum 3 types), valid slide classes, slide count, and pacing
| Add to the agent DAG | Edit `agents/registry.yaml` (dependencies, execution order) |
333
343
334
344
---
335
345
336
346
<details>
337
-
<summary><strong>All 17 Agents</strong> (click to expand)</summary>
347
+
<summary><strong>All 18 Agents</strong> (click to expand)</summary>
338
348
339
349
Agents are markdown prompt templates in the `agents/` directory. Each defines a multi-step workflow with `{{VARIABLES}}` that get filled in at runtime. To invoke one, ask Claude to run it or use `/run-pipeline` to orchestrate all of them.
340
350
@@ -378,6 +388,7 @@ Agents are markdown prompt templates in the `agents/` directory. Each defines a
378
388
|-------|-------------|---------------|
379
389
| storytelling | Converts findings into a stakeholder-ready narrative with executive summary, findings, insight, and recommendations | 15 |
380
390
| deck-creator | Builds a branded Marp slide deck with HTML components, speaker notes, and correct theme styling | 16 |
@@ -390,7 +401,7 @@ Agents are markdown prompt templates in the `agents/` directory. Each defines a
390
401
---
391
402
392
403
<details>
393
-
<summary><strong>All 30 Skills</strong> (click to expand)</summary>
404
+
<summary><strong>All 39 Skills</strong> (click to expand)</summary>
394
405
395
406
Skills are instruction files in `.claude/skills/` that Claude follows automatically when a trigger condition matches. You don't invoke them manually. When you ask for a chart, the Visualization Patterns skill activates. When you start an analysis, the Data Quality Check skill runs.
396
407
@@ -404,6 +415,7 @@ These skills shape every interaction:
404
415
| close-the-loop | Every recommendation gets a decision owner, success metric, follow-up date, and fallback plan |
405
416
| data-quality-check | Validates data completeness and consistency before analysis begins |
0 commit comments