Skip to content

Commit 753ddfe

Browse files
authored
Merge pull request anthropics#526 from anthropics/sre-incident-response-agent-fixes
fix(managed_agents): Fix console image, missing fixture, and summary blurb
2 parents b96dc04 + 355d201 commit 753ddfe

3 files changed

Lines changed: 46 additions & 11 deletions

File tree

Lines changed: 40 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,40 @@
1+
2026-04-07T03:08:11Z INFO checkout-svc starting build=2026.04.06-r3 pod=checkout-svc-7d9f6b8c4-xk2pq
2+
2026-04-07T03:08:11Z INFO config loaded env=prod region=us-east
3+
2026-04-07T03:08:12Z INFO http listening on :8080
4+
2026-04-07T03:08:12Z INFO connected to redis cart-cache:6379
5+
2026-04-07T03:08:13Z INFO connected to postgres orders-db:5432 pool=20
6+
2026-04-07T03:08:14Z INFO GET /healthz 200 1ms
7+
2026-04-07T03:08:31Z INFO POST /v1/checkout 200 84ms order_id=ord_88121
8+
2026-04-07T03:08:33Z INFO POST /v1/checkout 200 91ms order_id=ord_88122
9+
2026-04-07T03:08:47Z INFO pricing cache warmed entries=14092
10+
2026-04-07T03:09:02Z INFO POST /v1/checkout 200 77ms order_id=ord_88123
11+
2026-04-07T03:09:18Z WARN heap 101MiB / 128MiB (79%)
12+
2026-04-07T03:09:41Z INFO POST /v1/checkout 200 112ms order_id=ord_88124
13+
2026-04-07T03:09:55Z WARN heap 118MiB / 128MiB (92%)
14+
2026-04-07T03:10:03Z WARN gc pause 412ms
15+
2026-04-07T03:10:09Z ERROR allocation failed: cannot allocate 8MiB (heap exhausted)
16+
2026-04-07T03:10:09Z FATAL OutOfMemoryError in pricing.recompute
17+
2026-04-07T03:10:10Z INFO k8s: container checkout-svc OOMKilled (exit 137)
18+
2026-04-07T03:10:10Z INFO k8s: restarting container checkout-svc (restartCount=1)
19+
2026-04-07T03:10:14Z INFO checkout-svc starting build=2026.04.06-r3 pod=checkout-svc-7d9f6b8c4-xk2pq
20+
2026-04-07T03:10:14Z INFO http listening on :8080
21+
2026-04-07T03:10:15Z INFO connected to redis cart-cache:6379
22+
2026-04-07T03:10:16Z INFO connected to postgres orders-db:5432 pool=20
23+
2026-04-07T03:10:44Z INFO pricing cache warmed entries=14092
24+
2026-04-07T03:11:01Z WARN heap 104MiB / 128MiB (81%)
25+
2026-04-07T03:11:29Z WARN heap 121MiB / 128MiB (94%)
26+
2026-04-07T03:11:36Z ERROR allocation failed: cannot allocate 8MiB (heap exhausted)
27+
2026-04-07T03:11:36Z FATAL OutOfMemoryError in pricing.recompute
28+
2026-04-07T03:11:37Z INFO k8s: container checkout-svc OOMKilled (exit 137)
29+
2026-04-07T03:11:37Z INFO k8s: restarting container checkout-svc (restartCount=2)
30+
2026-04-07T03:11:41Z INFO checkout-svc starting build=2026.04.06-r3 pod=checkout-svc-7d9f6b8c4-xk2pq
31+
2026-04-07T03:11:41Z INFO http listening on :8080
32+
2026-04-07T03:12:08Z INFO pricing cache warmed entries=14092
33+
2026-04-07T03:12:39Z WARN heap 119MiB / 128MiB (93%)
34+
2026-04-07T03:12:51Z FATAL OutOfMemoryError in pricing.recompute
35+
2026-04-07T03:12:52Z INFO k8s: container checkout-svc OOMKilled (exit 137)
36+
2026-04-07T03:12:52Z INFO k8s: restarting container checkout-svc (restartCount=3)
37+
2026-04-07T03:12:52Z WARN k8s: back-off restarting failed container (CrashLoopBackOff)
38+
2026-04-07T03:13:22Z INFO k8s: restarting container checkout-svc (restartCount=4)
39+
2026-04-07T03:14:01Z INFO k8s: container checkout-svc OOMKilled (exit 137)
40+
2026-04-07T03:14:01Z WARN k8s: back-off restarting failed container (CrashLoopBackOff)

managed_agents/sre_incident_responder.ipynb

Lines changed: 2 additions & 7 deletions
Large diffs are not rendered by default.

registry.yaml

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -43,13 +43,13 @@
4343
- Agent Patterns
4444
- Integrations
4545
- title: Build an SRE incident response agent with Claude Managed Agents
46-
description: A webhook-triggered responder that investigates logs and runbooks
47-
with a custom Skill, fixes infrastructure code, and gates the PR behind a
48-
human-approval custom tool — with the full audit trail in the Console.
46+
description: 'Wire Claude into your on-call flow: when an alert fires, the agent
47+
reads logs and runbooks, pinpoints the root cause, opens a fix PR, and waits
48+
for your approval before merging.'
4949
path: managed_agents/sre_incident_responder.ipynb
5050
authors:
5151
- gaganb-ant
52-
date: '2026-04-08'
52+
date: '2026-04-10'
5353
categories:
5454
- Agent Patterns
5555
- Observability

0 commit comments

Comments
 (0)