Skip to content

Commit aa62a46

Browse files
committed
docs: fix voice, deduplicate results, link deployment files, clarify OTel scope
1 parent 590b276 commit aa62a46

1 file changed

Lines changed: 5 additions & 5 deletions

File tree

agents/claude-code/mlflow-tracing.md

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ Deploy Claude Code as a containerized agent on Red Hat OpenShift AI and wire it
88

99
### Summary
1010

11-
Agent-level instrumentation via `mlflow autolog claude` works out of the box with any backend. Swapping Vertex AI for vLLM or OGX produces the same trace schema with no changes to the tracing setup. If server-side OGX OTel spans are needed in future, they would be added to the Claude Code stop hook.
11+
Agent-level instrumentation via `mlflow autolog claude` works out of the box with any backend. Swapping Vertex AI for vLLM or OGX produces the same trace schema with no changes to the tracing setup. If server-side metrics are needed in future (e.g. per-hop latency, routing decisions), those would come from OGX or vLLM emitting their own OTel spans — the Claude Code hook only captures local agent-side data.
1212

1313
### OGX Telemetry Capabilities
1414

@@ -47,7 +47,7 @@ This works the same whether the backend is Vertex AI, vLLM directly, or OGX →
4747

4848
### Integration Path
4949

50-
The Claude Code stop hook is the right integration path. It already captures everything out of the box — tool calls, token usage, latency, session ID — and works the same across Vertex AI, vLLM, and OGX without any changes. If additional server-side metrics are needed (e.g. per-hop vLLM latency, OGX routing decisions), they can be added directly to the same hook since the infrastructure is already there.
50+
The Claude Code stop hook is the right integration path for agent-level tracing. It captures tool calls, token usage, latency, and session ID out of the box — and works the same across Vertex AI, vLLM, and OGX without any changes. If additional server-side metrics are needed (e.g. per-hop vLLM latency, OGX routing decisions), those would require OGX or vLLM to emit their own OTel spans separately.
5151

5252
---
5353

@@ -146,11 +146,11 @@ MLflow integration works. Follow this guide to hook Claude Code, OGX, and MLflow
146146

147147
The following must already be running on the cluster:
148148

149-
- Claude Code container deployed (see [agents/claude/claude_agent](https://github.com/red-hat-data-services/agentic-starter-kits/tree/main/agents/claude/claude_agent))
149+
- Claude Code container deployed (see [deployment/deployment.yaml](deployment/deployment.yaml))
150150
- OGX deployed and serving a model
151151
- MLflow instance running via the ODH/RHOAI operator with a workspace matching your namespace
152152

153-
### Step-by-Step Setup
153+
### Step-by-Step Setup (following the [deployment guide](deployment/README.md), adding MLflow-specific steps below)
154154

155155
#### 1. Add Python + MLflow to the Containerfile
156156

@@ -198,7 +198,7 @@ oc adm policy add-role-to-user edit -z default -n <your-namespace>
198198
value: "vllm/<your-model-name>"
199199
```
200200
201-
#### 5. Wire up autolog in the entrypoint
201+
#### 5. Wire up autolog in the [entrypoint](deployment/entrypoint.sh)
202202
203203
The entrypoint runs `mlflow autolog claude` at startup and injects auth into the generated `.claude/settings.json`:
204204

0 commit comments

Comments
 (0)