You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
- Tagline updated to mention LLM observability
- New 'LLM Observability' section after Quick Start with real demo
console output, per-field breakdown table, and four unique selling
points (cost-per-call, PII exposure, injection detection,
llm-dominates-request latency correlation)
- App Type Presets table: add 'llm' row and common combo examples
- Events Reference: add 'llm' event row; expand 'anomaly' to list all
four LLM anomaly types including llm-dominates-request
- Production Safety Reference: add .withLLMTracing() row
- withLLMTracing section: fix console output format to match real
output; add llm-dominates-request to anomaly events table
17.[Self-Host Your OTLP Endpoint](#self-host-your-otlp-endpoint)
51
+
18.[Roadmap](#roadmap)
52
+
19.[License](#license)
52
53
53
54
---
54
55
@@ -60,6 +61,7 @@ Standard APM products either require heavy agents, compile steps, or sacrifice d
60
61
-**AST-first privacy** — SQL/NoSQL query values are shredded at the AST layer before they ever touch a metric
61
62
-**Entropy-checked logs** — Shannon entropy scanning strips JWT tokens, API keys, and any other high-entropy string from `console` payloads automatically
62
63
-**Zero prototype pollution** — all DB interception goes through `node:diagnostics_channel`, the official Node.js observability primitive
64
+
-**LLM-aware** — intercepts OpenAI and Anthropic SDK calls to surface cost, token usage, PII exposure, and prompt injection attempts with zero code changes
Add `appType: 'llm'` and Argus intercepts every OpenAI and Anthropic call — cost per request, token counts, PII exposure, and prompt injection attempts, all in a single console line with zero code changes:
1.**Your real LLM bill, per request.** Not an estimate — computed from the actual token counts the model reports. Cost spike detection fires automatically when a single call runs 10× over your rolling average.
113
+
114
+
2.**Your users' emails are in those prompts.** Argus redacts PII (emails, phone numbers, SSNs, card numbers, IPs) from the telemetry record before export. The raw prompt reaches the model unchanged — your observability data never sees it.
115
+
116
+
3.**Prompt injection attempts, logged before damage is done.** Six regex patterns covering `ignore previous instructions`, role-override, and data-exfil attempts. Wire one listener to your security log.
117
+
118
+
4.**When your LLM owns your latency budget.** The `llm-dominates-request` rule fires an `'anomaly'` event when LLM time exceeds 80% of the HTTP request duration — the exact signal you need to decide whether to cache, stream, or move the call off the hot path.
119
+
120
+
```typescript
121
+
const agent =awaitArgusAgent.createProfile({
122
+
environment: "prod",
123
+
appType: ["web", "llm"], // or just "llm"
124
+
}).start();
125
+
126
+
// That's it. All OpenAI / Anthropic calls are traced from this point.
127
+
128
+
// Optional: react to anomalies
129
+
agent.on("anomaly", (event) => {
130
+
if (event.type==="llm-dominates-request") {
131
+
// LLM took >80% of the HTTP request budget — consider caching or streaming
132
+
}
133
+
if (event.type==="llm-cost-spike") {
134
+
// Single call cost spiked 10× — worth investigating
0 commit comments