temporal-community
diff --git a/‎.gitignore‎
Lines changed: 15 additions & 2 deletions b/‎.gitignore‎
Lines changed: 15 additions & 2 deletions
diff --git a/‎README.md‎
Lines changed: 1 addition & 1 deletion b/‎README.md‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎docs/architecture.md‎
Lines changed: 5 additions & 1 deletion b/‎docs/architecture.md‎
Lines changed: 5 additions & 1 deletion
diff --git a/‎docs/infrastructure.md‎
Lines changed: 5 additions & 17 deletions b/‎docs/infrastructure.md‎
Lines changed: 5 additions & 17 deletions
diff --git a/‎docs/workshop-overview.md‎
Lines changed: 2 additions & 2 deletions b/‎docs/workshop-overview.md‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎exercises/04_go_agent/README.md‎
Lines changed: 102 additions & 30 deletions b/‎exercises/04_go_agent/README.md‎
Lines changed: 102 additions & 30 deletions
@@ -207,11 +207,24 @@ marimo/_lsp/
 __marimo__/
 
 # Go
-exercises/04_go_agent/practice/go-agent
-exercises/04_go_agent/solution/go-agent
+/bin/
+*.exe
+*.test
+*.out
+exercises/04_go_agent/practice/04_go_agent
+exercises/04_go_agent/practice/practice
 exercises/02_explore_tailscale/go-hello-tsnet/practice/practice
 exercises/02_explore_tailscale/go-hello-tsnet/solution/solution
 
+# Tailscale tsnet state (contains auth keys / node private keys)
+/lab-worker/
+**/lab-worker/
+**/workshop-tsnet/
+
+# Auth keys / secrets
+*.env
+tskey-*
+
 # Node (Slidev)
 node_modules/
 slides/node_modules/
 
@@ -47,7 +47,7 @@ Then start with [Exercise 1](exercises/01_hello_tailnet/README.md).
 | 1 | [Hello Tailnet](exercises/01_hello_tailnet/README.md) | 15 min | Run a geo-IP workflow on the shared Temporal server via Tailscale |
 | 2 | [Explore Tailscale](exercises/02_explore_tailscale/README.md) | 15 min | Discover your network, understand Aperture, run a Go worker over `tsnet` |
 | 3 | [Weather Agent](exercises/03_weather_agent/README.md) | 15 min | Build a durable AI agent with LLM calls routed through Aperture |
-| 4 | [Go Agent](exercises/04_go_agent/README.md) | 15 min | The same weather agent, in Go |
+| 4 | [Metrics Watcher](exercises/04_go_agent/README.md) | 15 min | Schedule a Go tsnet worker to fetch metrics from a tailnet node and summarize them with Claude |
 
 ## Temporal Web UI
 
 
@@ -14,10 +14,14 @@ flowchart LR
     TS["Temporal Dev Server<br/>temporal-dev:7233<br/>temporal-dev:8233"]
     AP["Aperture<br/>API Gateway"]
   end
+  MS["metrics-server<br/>node_exporter:9100"]
   OAI["OpenAI API<br/>(shared key)"]
+  CLA["Anthropic API<br/>(shared key)"]
   VM <-. Tailnet .-> TS
   VM <-. Tailnet .-> AP
+  VM <-. Tailnet .-> MS
   AP --> OAI
+  AP --> CLA
 ```
 
 Everything between the attendee machine and the shared infrastructure rides on an encrypted Tailscale mesh. There is no public port open on the VPS. If you're not on the tailnet, you can't reach it.
@@ -38,7 +42,7 @@ A Temporal CLI extension that runs the dev server and joins the tailnet via [tsn
 
 ### Aperture (API gateway)
 
-Sits between attendee code and OpenAI. Holds the real API key, forwards requests to `api.openai.com`, and enforces per-identity rate limits using the caller's Tailscale identity. Attendees never see the key, and one person running 500 agents can't burn the whole budget.
+Sits between attendee code and the LLM providers. Holds the real API keys (OpenAI for the Python weather agent, Anthropic for the Go metrics watcher), forwards requests upstream, and enforces per-identity rate limits using the caller's Tailscale identity. Attendees never see the keys, and one person running 500 agents can't burn the whole budget.
 
 ## The two agent patterns
 
 
@@ -21,19 +21,6 @@ The workshop needs one long-lived VPS that runs `temporal-ts-net` and joins the
 
 If you're building something that runs for longer than a workshop or exposes the server to arbitrary traffic, switch to a real deployment before you hit the limits.
 
-### Install Go 1.26+
-
-`temporal-ts-net` currently requires Go 1.26.1+ (tsnet dependency). On `amd64`:
-
-```shell
-wget https://go.dev/dl/go1.26.2.linux-amd64.tar.gz
-sudo tar -C /usr/local -xzf go1.26.2.linux-amd64.tar.gz
-echo 'export PATH=$PATH:/usr/local/go/bin:$HOME/go/bin' >> ~/.bashrc
-source ~/.bashrc
-```
-
-On `arm64` (for example, Hetzner ARM), swap `linux-amd64` for `linux-arm64`.
-
 ### Install the Temporal CLI
 
 ```shell
@@ -45,14 +32,15 @@ temporal --version   # must be v1.6.0+ for extension support
 
 ### Install `temporal-ts-net`
 
+A prebuilt binary for your architecture, no Go toolchain required:
+
 ```shell
-git clone https://github.com/temporal-community/temporal-ts-net
-cd temporal-ts-net
-go install ./cmd/temporal-ts_net
-cd ..
+curl -sSfL https://raw.githubusercontent.com/temporal-community/temporal-ts-net/main/install.sh | sh
 temporal help --all | grep ts-net   # verify the extension is found
 ```
 
+The installer picks the right binary for `amd64` or `arm64`.
+
 ### Get a Tailscale auth key for the server
 
 In the Tailscale admin console, under **Settings > Keys**:
 
@@ -33,10 +33,10 @@ Every LLM call goes through Aperture, which holds the real OpenAI key and applie
 | AI agents on Temporal (talk) | 10 min |
 | Exercise 3 - Weather agent | 15 min |
 | Rate-limit demo (everyone fires at once) | 5 min |
-| Exercise 4 (Go agent) + `temporal-ts-net` walk-through | 15 min |
+| Exercise 4 - Metrics watcher + `temporal-ts-net` walk-through | 15 min |
 | Wrap-up + Q&A | 5 min |
 
-Exercise 4 translates the weather agent to Go, reusing the same Temporal server, Aperture endpoint, and tailnet.
+Exercise 4 is a finished Go worker that joins the tailnet via `tsnet`, pulls `node_exporter` metrics from another tailnet node, and runs a Temporal Schedule that asks Claude via Aperture for a plain-English health report. Attendees run it, tune the interval, and tweak the prompt.
 
 ## What attendees need
 
 
@@ -1,51 +1,123 @@
-# Exercise 4: Go Agent (Stretch Goal)
+# Exercise 4: Metrics Watcher
 
-The same weather agent pattern, implemented in Go.
+A finished Go worker that joins the tailnet via `tsnet`, scrapes
+`node_exporter` metrics from a pre-provisioned `metrics-server` VM, and
+asks Claude (via Aperture) for a plain-English health summary on a
+schedule.
 
 ## Goal
 
-Port the Python agentic loop to Go, demonstrating that Temporal + Tailscale + Aperture work across languages. The same shared Temporal server, the same Aperture endpoint, different language.
+Run the Exercise 2 `tsnet` pattern against real services: pull
+`node_exporter` metrics off the tailnet, summarize them with Claude
+via Aperture, and schedule the whole thing with Temporal. The code is
+complete. Run it, tune the cadence, watch the runs in the Temporal UI.
 
-## Status
+## Background
 
-This exercise is a **take-home stretch goal**. The Go files are stubbed out with TODOs describing what each function should do. Use the Python implementation in Exercise 3 as your reference.
+### Topology
 
-## Architecture
+```mermaid
+flowchart LR
+  VM[Your Instruqt VM<br/>Go worker]
+  TS[Temporal Dev Server<br/>temporal-dev:7233 / :8233]
+  MS[metrics-server<br/>node_exporter :9100]
+  AP[Aperture<br/>API Gateway]
+  VM <-. Tailnet .-> TS
+  VM <-. Tailnet .-> MS
+  VM <-. Tailnet .-> AP
+```
 
-The Go agent follows the same pattern as the Python version:
+### What's different from Exercise 2
 
-1. `CreateCompletion` activity calls the OpenAI API through Aperture
-2. The workflow loops: ask the LLM → execute chosen tool → feed result back
-3. Tool activities (`GetWeatherAlerts`, `GetIPAddress`, `GetLocationInfo`) call the same public APIs
-4. Worker connects to the shared Temporal server on the tailnet
+- **Temporal Schedule** with `TriggerImmediately`: fires once on start, then every `HEALTH_CHECK_INTERVAL` (default `10m`). Durable on the server.
+- Data comes from another tailnet node (`metrics-server:9100`), not the public internet.
+- LLM call goes to **Claude via Aperture**: same gateway as Exercise 3, different vendor.
+- Returns a structured `HealthReport` instead of a string, so the Temporal UI renders each field cleanly.
 
-The key difference: Go doesn't have an official OpenAI SDK integrated with Temporal, so you'll make HTTP requests directly to the Aperture endpoint (which is OpenAI-compatible).
+### What's already built for you
 
-## Files
+- `main.go`: joins the tailnet via `tsnet`, dials Temporal, creates the Schedule.
+- `activities.go`: `FetchMetrics` and `AnalyzeMetrics` (returns `HealthReport`).
+- `workflow.go`: `HealthCheckWorkflow` chains the two activities.
+- Tests: run offline with `go test ./...`.
 
-| File | What to implement |
-|------|-------------------|
-| `practice/main.go` | Worker setup and workflow starter |
-| `practice/workflow.go` | Agentic loop workflow |
-| `practice/activities.go` | OpenAI API call + tool implementations |
+## Run it
 
-## Getting Started
+### Step 1: Go to the practice directory
 
 ```bash
 cd exercises/04_go_agent/practice
-go mod tidy
-go run . # Start the worker
-go run . run "What's the weather like where I am?" # Run a workflow
+go mod download
+```
+
+### Step 2: Start the worker
+
+```bash
+WORKSHOP_USER_ID=$WORKSHOP_USER_ID \
+TS_AUTHKEY=tskey-auth-<your-key> \
+METRICS_URL=http://metrics-server:9100/metrics \
+go run .
+```
+
+First run takes 10-30 seconds while `tsnet` registers the node. After that:
+
+```
+level=INFO msg="joined tailnet" hostname=<you>-metrics-worker userID=<you>
+level=INFO msg="connected to temporal" host=temporal-dev:7233
+level=INFO msg="metrics reachable" url=http://metrics-server:9100/metrics
+level=INFO msg="created schedule" id=<you>-health-check-schedule interval=10m0s workflow=<you>-health-check
+```
+
+The schedule fires immediately. You'll see a completed workflow in the Temporal UI within seconds.
+
+### Step 3: Watch it in the Temporal UI
+
+Open `http://temporal-dev:8233`. Two places to look:
+
+- **Schedules**: click `<you>-health-check-schedule` to see the interval, next fire time, and recent fires.
+- **Workflows**: search for `<you>-health-check`. Each completed row (ID is suffixed with the schedule fire time) has the `HealthReport` in its Result panel.
+
+### Step 4: Tune the cadence
+
+10m is too slow to watch during the workshop. `Ctrl+C`, restart with a shorter interval:
+
+```bash
+HEALTH_CHECK_INTERVAL=2m \
+WORKSHOP_USER_ID=$WORKSHOP_USER_ID \
+TS_AUTHKEY=tskey-auth-<your-key> \
+METRICS_URL=http://metrics-server:9100/metrics \
+go run .
 ```
 
-## Hints
+Any Go duration (`30s`, `5m`, `1h`). The worker recreates the schedule on startup, so restarting just takes effect.
+
+### Step 5: Customize the Claude prompt
+
+Open `activities.go`, find `AnalyzeMetrics`. Change the prompt: request a different field, flag anything unusual, whatever. Restart the worker and watch the `HealthReport` change on the next fire.
+
+## Environment variables
+
+| Variable                | Required | Default               | Description                                                         |
+|-------------------------|----------|-----------------------|---------------------------------------------------------------------|
+| `TS_AUTHKEY`            | yes*     | (none)                | Tailscale auth key. Required on first run; tsnet reuses state after.|
+| `METRICS_URL`           | yes      | (none)                | `node_exporter` endpoint on the tailnet.                            |
+| `WORKSHOP_USER_ID`      | no       | `lab`                 | Prefixes hostname, task queue, and workflow ID.                     |
+| `HEALTH_CHECK_INTERVAL` | no       | `10m`                 | Cadence as a Go duration (`30s`, `5m`, `1h`).                       |
+| `TEMPORAL_HOST`         | no       | `temporal-dev:7233`   | Temporal server address.                                            |
+| `AI_URL`                | no       | `http://ai`           | Aperture endpoint.                                                  |
+| `AI_MODEL`              | no       | `claude-haiku-4-5`    | Claude model.                                                       |
+
+## Run the tests
+
+```bash
+go test ./...
+```
 
-- The OpenAI Responses API endpoint is `POST /v1/responses`
-- Set the `Authorization` header to `Bearer $OPENAI_API_KEY`
-- Use `encoding/json` to marshal/unmarshal request/response bodies
-- The `base_url` should come from `OPENAI_BASE_URL` environment variable
-- Look at `exercises/03_weather_agent/solution/` for the complete Python reference
+Tests mock `node_exporter` and Aperture with `httptest.Server`, no tailnet needed.
 
-## Solution
+## What You've Learned
 
-The solution will be added in a future update. For now, use the Python implementation as your guide.
+- `tsnet.Dial` works for both tailnet-internal HTTP and gRPC
+- Aperture is model-agnostic: Anthropic here, OpenAI in Exercise 3
+- Temporal Schedules with `TriggerImmediately` fire now, then every N, with the next fire visible in the UI
+- All three backing services are tailnet-only; Tailscale identity is the auth layer