RyanAlberts
diff --git a/‎.secrets.baseline‎
Lines changed: 10 additions & 1 deletion b/‎.secrets.baseline‎
Lines changed: 10 additions & 1 deletion
diff --git a/‎CHANGELOG.md‎
Lines changed: 9 additions & 1 deletion b/‎CHANGELOG.md‎
Lines changed: 9 additions & 1 deletion
diff --git a/‎README.md‎
Lines changed: 4 additions & 0 deletions b/‎README.md‎
Lines changed: 4 additions & 0 deletions
diff --git a/‎docs/MEMO_STRUCTURE.md‎
Lines changed: 86 additions & 0 deletions b/‎docs/MEMO_STRUCTURE.md‎
Lines changed: 86 additions & 0 deletions
diff --git a/‎docs/decisions/0003-three-povs-memo-frame.md‎
Lines changed: 88 additions & 0 deletions b/‎docs/decisions/0003-three-povs-memo-frame.md‎
Lines changed: 88 additions & 0 deletions
diff --git a/‎docs/decisions/README.md‎
Lines changed: 1 addition & 0 deletions b/‎docs/decisions/README.md‎
Lines changed: 1 addition & 0 deletions
diff --git a/‎docs/screenshots/dashboard-hero.png‎
226 KB b/‎docs/screenshots/dashboard-hero.png‎
226 KB
diff --git a/‎docs/screenshots/dashboard-w26.jpg‎
369 KB b/‎docs/screenshots/dashboard-w26.jpg‎
369 KB
diff --git a/‎examples/README.md‎
Lines changed: 6 additions & 2 deletions b/‎examples/README.md‎
Lines changed: 6 additions & 2 deletions
@@ -136,6 +136,15 @@
         "line_number": 102
       }
     ],
+    "examples/output/dashboard-w26-pr17-2026-05-03.html": [
+      {
+        "type": "Base64 High Entropy String",
+        "filename": "examples/output/dashboard-w26-pr17-2026-05-03.html",
+        "hashed_secret": "3c09e03744a49c6020501c9b7ef6218ad440976e",
+        "is_verified": false,
+        "line_number": 102
+      }
+    ],
     "src/ycai/dashboard.py": [
       {
         "type": "Base64 High Entropy String",
@@ -146,5 +155,5 @@
       }
     ]
   },
-  "generated_at": "2026-05-02T02:15:53Z"
+  "generated_at": "2026-05-03T19:16:51Z"
 }
@@ -7,7 +7,15 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
 
 ## [Unreleased]
 
-_(no changes since 0.2.0)_
+### Added — PR #17 (memo polish)
+- **Executive summary** at the top of the memo, citing the Nobel laureate's framing of what the headline finding implies for capital allocation.
+- **Three-POV introduction** — single paragraph that pits Marc Andreessen, Ray Dalio, and Daron Acemoglu (2024 Nobel laureate in Economics) against each other on what the batch findings imply. The memo deliberately does not pick a winner. Codified in [`docs/MEMO_STRUCTURE.md`](docs/MEMO_STRUCTURE.md) and [ADR 0003](docs/decisions/0003-three-povs-memo-frame.md).
+- **Inside B2B SaaS** sub-industry table — one-layer-deeper breakdown using YC's `subindustry` passthrough (not LLM-derived, so it can't drift). Renders only when B2B SaaS rows exist.
+- **Tech stack chart now excludes the `unknown` bucket**; the unknown count is rendered as a footnote / asterisk under the chart instead of as the largest bar.
+- **Traction signals section** — companies that advertise verifiable traction (GitHub stars, named customers, funding rounds, revenue, user counts, press, partnerships). New `TractionSignal` schema, model populates them with verbatim spans, source-URL guard rejects fabricated citations. W26 dogfood: **73 of 105 high-confidence companies surfaced 212 traction signals across 8 kinds**.
+- **3-POV slide** in the deck for parity with the memo; named figures live in a single dict so memo and deck can never disagree.
+- **README hero screenshot** of the dashboard (auto-generated via Playwright at PR time).
+- **Bug fix**: dashboard chart-options JSON was being HTML-escaped before injection into a `<script type="application/json">` block, which broke `JSON.parse` on the client. Real charts in the v0.2.0 example HTML now render in browsers as well as in Playwright.
 
 ## [0.2.0] — 2026-05-01
 
 
@@ -6,6 +6,10 @@
 [![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](LICENSE)
 [![Python 3.11+](https://img.shields.io/badge/python-3.11+-blue.svg)](https://www.python.org/downloads/)
 
+![yc-ai-pulse W26 dashboard hero](docs/screenshots/dashboard-hero.png)
+
+_The dashboard auto-refuses to ship if any cited URL is dead. Full-page screenshot at [`docs/screenshots/dashboard-w26.jpg`](docs/screenshots/dashboard-w26.jpg)._
+
 ## What it does
 
 `yc-ai-pulse` answers: *what does the most recent YC batch tell us about the state of AI?*
 
@@ -0,0 +1,86 @@
+# Memo structure (the codified instruction)
+
+This file is load-bearing. Anyone editing `src/ycai/reports/docx.py` should
+keep the structure below intact unless the README, this file, ADR 0003, and
+the memo all change together.
+
+The memo (`report.docx`) is a narrative document, 3-6 pages, with embedded
+chart PNGs and verbatim citations. Per [USER.md document-format
+discipline](#) it is the canonical *strategic* surface. The deck mirrors it
+visually but the memo is the load-bearing prose.
+
+## Required sections, in order
+
+1. **Title + dateline** — batch label, generation timestamp, repo URL.
+2. **Executive summary** — 3-4 sentences. Headline coverage %, headline
+   capability finding (e.g. "X of Y companies build agents"), one Nobel
+   laureate's framing of what that means for capital allocation.
+3. **Introduction (three POVs)** — single paragraph that pits three named
+   public voices against each other on what to do with this batch:
+     - **Marc Andreessen** (a16z) — techno-optimist concentrate-and-bet
+     - **Ray Dalio** (Bridgewater) — diversify, weight macro cycles
+     - **Daron Acemoglu** (2024 Nobel laureate, MIT) — productivity claims
+       are inflated; weight labor-displacement and redistribution risk
+   The paragraph should not pick a winner. It frames the batch findings as
+   an empirical input that all three would interpret differently.
+4. **Coverage and methodology** — Tier A/B/C breakdown, Layer 1+2 disclosure.
+5. **The agentic batch** — capability heatmap + analysis paragraph.
+6. **Industry distribution** — top-level industry chart.
+7. **Inside B2B SaaS** — one-layer-deeper breakdown of the largest industry
+   bucket using YC's `subindustry` field. Pure passthrough math (not
+   LLM-derived) so the breakdown can't drift.
+8. **Tech stack and OSS posture** — chart of *known* tech-stack mentions
+   only; the unknown count is rendered as a footnote/asterisk under the
+   chart, not as a chart bar.
+9. **Traction signals** — companies that advertise verifiable traction
+   (GitHub stars, named customers, funding rounds, revenue, user counts,
+   press, partnerships). One section per signal kind, capped at 5
+   companies per kind for legibility, with verbatim detail and a citable
+   source URL. Companies without any traction signal are not listed.
+10. **Six company spotlights** — diverse-capability + non-B2B-SaaS picks.
+    Each spotlight includes its traction signals as bullets when present.
+11. **What we still cannot answer** — three open questions framed against
+    the introduction's three POVs.
+12. **Reproduce this memo** — install + run instructions.
+
+## Why this structure
+
+- **Executive summary first** because most readers don't read past page one.
+  The Nobel POV in section 2 is what makes the memo *useful as input to a
+  capital allocation decision* — without it, this is just classification.
+- **Three-POV introduction** because no single voice on AI is dispositive
+  in 2026. Pitting Andreessen, Dalio, and Acemoglu against each other forces
+  the reader to make a judgment rather than absorb a pre-cooked answer.
+- **Sub-industry breakdown** because "B2B SaaS" is the laziest taxonomy
+  bucket in venture and tells you nothing useful. One layer deeper
+  ("DevTools", "GTM/Sales", "Compliance") differentiates a bet.
+- **Tech-stack-known-only chart** because rendering "unknown" as the largest
+  bar is misleading even when honest. The footnote keeps the honesty.
+- **Traction section before spotlights** because traction is a stronger
+  signal than capability. A B2B SaaS with 5,000 GitHub stars is more
+  interesting than 50 nameless agents companies.
+
+## Citation rules (Layer 2 enforced)
+
+Every number in aggregate prose must trace back to `analytics.headline_numbers`,
+a chart counter, or `extra_allowed` (derived sums and infrastructure facts).
+Per-company verbatim quotes (taglines, rationales, traction details) are
+exempt from drift check but still scanned for forbidden phrases.
+
+The named-figures allowlist (Andreessen, Dalio, Acemoglu) is **explicitly
+not** sanitized — these are real public figures whose published views are
+being summarized, not anonymous "industry insiders". Per Layer 2 invariants,
+the prose around their names must paraphrase rather than fabricate quotes.
+
+## Vetting
+
+The repo's `tests/test_docx.py` exercises:
+- The structure renders end-to-end on a synthetic 8-company cohort.
+- Layer 2 audit aborts the build on a forbidden-phrase injection.
+- Layer 2 audit aborts on a fabricated number injection.
+- Sub-industry table appears when B2B SaaS rows exist.
+- Tech-stack footnote appears when unknown count > 0.
+- Traction section appears when at least one company has signals.
+- Three-POV introduction includes the three named figures.
+
+Run via `make validate-p0`.
@@ -0,0 +1,88 @@
+# ADR 0003 — Three-POV introduction frame in the memo
+
+**Date:** 2026-05-03
+**Status:** Accepted
+
+## Context
+
+The memo's purpose is to inform capital-allocation decisions, not to certify
+that a YC batch is "good" or "bad". An LLM-generated narrative without a
+named editorial frame defaults to bland techno-optimism — agents are real,
+software is eating the world, etc. — which is true but useless to a reader
+making allocation calls in 2026 against debt cycles, AI-productivity
+skepticism, and sectoral concentration.
+
+The reader needs a frame to argue *with*, not absorb. The cheapest,
+sharpest frame is to name three public voices whose views materially
+disagree and let the data set sit between them.
+
+## Decision
+
+Every memo opens with a single-paragraph "introduction" that pits three
+named public voices against each other on what the batch findings imply
+for capital allocation:
+
+- **Marc Andreessen** (a16z) — techno-optimist; AI is the dominant
+  industrial transition of our generation; concentrate capital in the
+  winners; regulation is the existential threat. Source: ["The
+  Techno-Optimist Manifesto"](https://a16z.com/the-techno-optimist-manifesto/), 2023.
+- **Ray Dalio** (Bridgewater) — paradigm-shift macro; AI is real but is
+  one factor among many; debt cycles, monetary regimes, and geopolitical
+  realignment dominate; diversify. Source: Dalio's "Principles for Dealing
+  with the Changing World Order" + ongoing LinkedIn essays.
+- **Daron Acemoglu** (2024 Nobel laureate in Economics, MIT) — AI's
+  productivity claims are likely overstated. His [2024 NBER working paper
+  10.3386/w32487](https://www.nber.org/papers/w32487) estimates total-factor productivity gains from AI
+  over the next decade at <0.66%, with the largest distributional risk
+  being labor-displacement-without-reabsorption. Investment frame: weight
+  redistributive risk and policy response, not just productivity TAM.
+
+The memo does **not** pick a winner. The job of the introduction is to
+force the reader to make a judgment, not to absorb a pre-cooked answer.
+
+## Consequences
+
+**Positive**
+- The memo becomes useful *as input to a real capital decision* rather
+  than as a tour of YC's classification taxonomy.
+- The three names provide an editorial spine that survives a YC batch
+  whose findings could be cherry-picked to support any one of them.
+- Anti-hallucination Layer 2's forbidden-phrase scan is unaffected — the
+  three figures are named real public people whose published views are
+  being summarized, not anonymous "industry insiders".
+
+**Negative**
+- We become responsible for representing each figure's view fairly.
+  Mitigated by linking to their actual published statements, paraphrasing
+  rather than fabricating quotes, and updating the doc when the figures'
+  positions evolve materially.
+- The frame is opinionated. A maintainer who wants a "neutral" memo
+  should fork or override.
+
+## Alternatives rejected
+
+- **No frame** — leaves the memo prose to defaults, i.e. techno-optimist
+  haze. Tested in PR #15; the result reads like a press release.
+- **One frame** (e.g. just Acemoglu) — replaces one bias with another.
+  Three voices disagreeing forces the reader's own synthesis.
+- **Five frames** — three is the minimum number of distinct positions
+  that span the optimist / hedger / skeptic axis. Five over-flattens.
+
+## Verification
+
+- `tests/test_docx.py::test_memo_introduction_includes_three_named_figures`
+  asserts the three names appear in the introduction paragraph.
+- The named-figures list is **not** in `FORBIDDEN_PHRASES`; appearance
+  passes Layer 2.
+- `docs/MEMO_STRUCTURE.md` is the maintainer-facing instruction; this ADR
+  is the rationale.
+
+## Updating the figures
+
+If one of the three names becomes inappropriate (e.g. retracted views,
+defamation risk, factual error), the change requires:
+1. A successor ADR (this one moves to "Superseded").
+2. A test update to the new names.
+3. A README + memo regeneration.
+
+The list is intentionally short to make this churn cheap.
@@ -12,3 +12,4 @@ The point: future-me reads three paragraphs and understands *why*, not just *wha
 
 - [0001 — Use yc-oss/api as the primary YC data source](0001-yc-data-source.md)
 - [0002 — Chrome extension talks to a local FastAPI, not Native Messaging](0002-localhost-vs-native-messaging.md)
+- [0003 — Three-POV introduction frame in the memo](0003-three-povs-memo-frame.md)
@@ -4,8 +4,12 @@ Sanitized sample artifacts. Every commit goes through `make publish-check` so PI
 
 | File | What |
 |---|---|
-| [`output/deck-w26-pr14-2026-05-01.pptx`](output/deck-w26-pr14-2026-05-01.pptx) | **PR #14 VC-style deck.** 16 slides, a16z-feel palette, matplotlib chart PNGs anchored to the same data the dashboard uses. Anti-hallucination Layer 2 ran before write. |
-| [`output/report-w26-pr15-2026-05-01.docx`](output/report-w26-pr15-2026-05-01.docx) | **PR #15 narrative memo.** 9 sections, ~47 paragraphs, 4 embedded chart PNGs. Headline finding, coverage methodology, capability heatmap with analysis, industry distribution, tech-stack/OSS-posture caveat, six company spotlights, unanswered questions. Layer 2 audit clean. |
+| [`output/dashboard-w26-pr17-2026-05-03.html`](output/dashboard-w26-pr17-2026-05-03.html) | **PR #17 dashboard — current best.** ECharts canvases now render correctly (the v0.2.0 release had a JSON-escape bug that left them blank in browsers). |
+| [`output/deck-w26-pr17-2026-05-03.pptx`](output/deck-w26-pr17-2026-05-03.pptx) | **PR #17 deck.** 17 slides; adds the three-POV slide (Andreessen / Dalio / Acemoglu) right after the TL;DR. |
+| [`output/report-w26-pr17-2026-05-03.docx`](output/report-w26-pr17-2026-05-03.docx) | **PR #17 narrative memo — current best.** Adds executive summary with Acemoglu's framing, three-POV introduction, "Inside B2B SaaS" sub-industry table, tech-stack-known-only chart with unknown footnote, full traction-signals section (73 of 105 companies surface verifiable traction). |
+| [`output/analyses-w26-pr17-2026-05-03.json`](output/analyses-w26-pr17-2026-05-03.json) | **PR #17 enrichment.** 105 high-confidence rows out of 124. 212 traction signals total across 8 kinds. |
+| [`output/deck-w26-pr14-2026-05-01.pptx`](output/deck-w26-pr14-2026-05-01.pptx) | PR #14 deck (16 slides). Kept as before-3-POV reference. |
+| [`output/report-w26-pr15-2026-05-01.docx`](output/report-w26-pr15-2026-05-01.docx) | PR #15 narrative memo. Kept as before-PR-#17 reference. |
 | [`output/dashboard-w26-pr12-2026-05-01.html`](output/dashboard-w26-pr12-2026-05-01.html) | **PR #12 dashboard — current best HTML.** Same W26 data, ECharts canvases (real heatmap, pies, bars). |
 | [`output/dashboard-w26-pr11-2026-05-01.html`](output/dashboard-w26-pr11-2026-05-01.html) | PR #11 dashboard with the depth=1 crawl but static CSS bars. Useful for comparing visual fidelity vs. PR #12. |
 | [`output/analyses-w26-pr11-2026-05-01.json`](output/analyses-w26-pr11-2026-05-01.json) | Source data for both PR #11 and PR #12 dashboards. 113/124 high-confidence. |
Original file line number	Diff line number	Diff line change
`@@ -136,6 +136,15 @@`
`136`	`136`	`"line_number": 102`
`137`	`137`	`}`
`138`	`138`	`],`
	`139`	`+ "examples/output/dashboard-w26-pr17-2026-05-03.html": [`
	`140`	`+ {`
	`141`	`+ "type": "Base64 High Entropy String",`
	`142`	`+ "filename": "examples/output/dashboard-w26-pr17-2026-05-03.html",`
	`143`	`+ "hashed_secret": "3c09e03744a49c6020501c9b7ef6218ad440976e",`
	`144`	`+ "is_verified": false,`
	`145`	`+ "line_number": 102`
	`146`	`+ }`
	`147`	`+ ],`
`139`	`148`	`"src/ycai/dashboard.py": [`
`140`	`149`	`{`
`141`	`150`	`"type": "Base64 High Entropy String",`
`@@ -146,5 +155,5 @@`
`146`	`155`	`}`
`147`	`156`	`]`
`148`	`157`	`},`
`149`		`- "generated_at": "2026-05-02T02:15:53Z"`
	`158`	`+ "generated_at": "2026-05-03T19:16:51Z"`
`150`	`159`	`}`
Original file line number	Diff line number	Diff line change
`@@ -12,3 +12,4 @@ The point: future-me reads three paragraphs and understands why, not just *wha`
`12`	`12`
`13`	`13`	`- [0001 — Use yc-oss/api as the primary YC data source](0001-yc-data-source.md)`
`14`	`14`	`- [0002 — Chrome extension talks to a local FastAPI, not Native Messaging](0002-localhost-vs-native-messaging.md)`
	`15`	`+- [0003 — Three-POV introduction frame in the memo](0003-three-povs-memo-frame.md)`