Skip to content

Commit 8ff0630

Browse files
nammnjoshiiclaude
andcommitted
docs: add sensitive data protection rule + tool evaluation draft
- CLAUDE.md: add Sensitive Data Protection section documenting the hook-based private tag behavior for .env reads - drafts/tool-evaluation/linkedin.md: LinkedIn post draft from claude-mem evaluation session (stochastic consensus + council methodology) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
1 parent 96625b2 commit 8ff0630

2 files changed

Lines changed: 70 additions & 0 deletions

File tree

CLAUDE.md

Lines changed: 11 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -8,6 +8,17 @@ Your job is to implement, maintain, and extend this pipeline correctly. Every ac
88

99
---
1010

11+
## Sensitive Data Protection (Automatic)
12+
13+
A Claude Code hook fires automatically before any `.env` file read and emits a `SENSITIVE FILE DETECTED` warning. When that warning appears:
14+
15+
- Wrap ALL displayed content in `<private>...</private>` tags immediately.
16+
- Do not echo raw API key values in responses, diffs, or logs.
17+
- Keys in scope: `ANTHROPIC_API_KEY`, `RESEND_API_KEY`, `SERPAPI_KEY`, `HUNTER_API_KEY`.
18+
- This applies even when no warning fires — treat `.env` content as private by default.
19+
20+
---
21+
1122
## Non-Negotiable Rules
1223

1324
**Read `docs/operation/antipatterns.md` before any code change. It overrides all other files on conflict.**

drafts/tool-evaluation/linkedin.md

Lines changed: 59 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,59 @@
1+
# Draft: Tool Evaluation via Stochastic Multi-Agent Consensus + Consciousness Council
2+
3+
**Platform:** LinkedIn
4+
**Target:** Technical operators, AI engineers, senior PMs running AI-assisted workflows
5+
**Hook:** Most people install dev tools in 5 minutes. I used 11 AI agents first.
6+
7+
---
8+
9+
## Post Draft
10+
11+
Most engineers evaluate a new dev tool the same way: read the README, skim the issues, run the install command.
12+
13+
That works fine for low-stakes tools. Not for tools that touch every session, capture everything you do, and inject context into every future conversation.
14+
15+
For claude-mem — a persistent memory compression system for Claude Code — I ran two rounds of structured AI deliberation before installing anything.
16+
17+
**Round 1: Stochastic Multi-Agent Consensus**
18+
19+
Five independent agents. Same question. Different temperature framings. Zero shared context between them.
20+
21+
Operational Efficiency Analyst. Systems Architect. Developer Experience Expert. Risk Analyst. Creative Explorer.
22+
23+
Each generated its own take on the value of the tool. Then I aggregated by semantic clustering — which ideas appeared across multiple agents independently?
24+
25+
Two use cases hit 0.80 confidence (4 of 5 agents agreed without seeing each other):
26+
- Cold-start elimination: every session currently wastes 10-15 minutes re-establishing context that died at the last session boundary
27+
- Automatic incident context capture: when a test breaks in session N+1, the observation log from session N is recoverable
28+
29+
One insight hit 0.60: the tool creates a two-layer memory architecture (episodic + semantic) that didn't exist before.
30+
31+
**Round 2: Consciousness Council**
32+
33+
Six roles. Tech and business. Genuine conflict required.
34+
35+
Security Engineer. Platform Lead. DevOps Engineer. Token Budget Analyst. Program Manager. Staff Engineer/Skeptic.
36+
37+
The consensus had missed something the Security Engineer caught immediately: the default posture of the tool is capture-everything unless you opt out. Nobody had asked "what's the sensitive perimeter and how do you exclude it?"
38+
39+
That's the Blind Spot the council is designed to surface — the question behind the question.
40+
41+
The council produced a 7-step install path with a core tension named: do you trust third-party AI summaries of your work enough to let them influence your session context automatically, or do you use the search tools explicitly and keep automatic injection off?
42+
43+
**What this changes:**
44+
45+
Tool evaluation isn't a README read. It's a structured decision with technical, security, and business dimensions that don't all show up in the same place.
46+
47+
The stochastic consensus tells you what the tool is actually good for — with a confidence score.
48+
49+
The council tells you how to implement it safely — with a named tension and a blind spot.
50+
51+
Neither alone is sufficient. Together they cost about 20 minutes and surface everything a solo assessment misses.
52+
53+
The install took 3 minutes. The evaluation took 20. The evaluation was worth it.
54+
55+
---
56+
57+
**Title:** How I evaluated a new dev tool with 11 AI agents before installing it
58+
**Hook:** Most people install dev tools in 5 minutes. I used 11 AI agents first.
59+
**Why it matters:** Reusable methodology for any tool evaluation with multi-dimensional trade-offs — applicable well beyond AI tooling

0 commit comments

Comments
 (0)