You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: agents/claude-code/mlflow-tracing.md
+5-5Lines changed: 5 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -8,7 +8,7 @@ Deploy Claude Code as a containerized agent on Red Hat OpenShift AI and wire it
8
8
9
9
### Summary
10
10
11
-
Agent-level instrumentation via `mlflow autolog claude` works out of the box with any backend. Swapping Vertex AI for vLLM or OGX produces the same trace schema with no changes to the tracing setup. If server-side OGX OTel spans are needed in future, they would be added to the Claude Code stop hook.
11
+
Agent-level instrumentation via `mlflow autolog claude` works out of the box with any backend. Swapping Vertex AI for vLLM or OGX produces the same trace schema with no changes to the tracing setup. If server-side metrics are needed in future (e.g. per-hop latency, routing decisions), those would come from OGX or vLLM emitting their own OTel spans — the Claude Code hook only captures local agent-side data.
12
12
13
13
### OGX Telemetry Capabilities
14
14
@@ -47,7 +47,7 @@ This works the same whether the backend is Vertex AI, vLLM directly, or OGX →
47
47
48
48
### Integration Path
49
49
50
-
The Claude Code stop hook is the right integration path. It already captures everything out of the box — tool calls, token usage, latency, session ID — and works the same across Vertex AI, vLLM, and OGX without any changes. If additional server-side metrics are needed (e.g. per-hop vLLM latency, OGX routing decisions), they can be added directly to the same hook since the infrastructure is already there.
50
+
The Claude Code stop hook is the right integration path for agent-level tracing. It captures tool calls, token usage, latency, and session ID out of the box — and works the same across Vertex AI, vLLM, and OGX without any changes. If additional server-side metrics are needed (e.g. per-hop vLLM latency, OGX routing decisions), those would require OGX or vLLM to emit their own OTel spans separately.
51
51
52
52
---
53
53
@@ -146,11 +146,11 @@ MLflow integration works. Follow this guide to hook Claude Code, OGX, and MLflow
146
146
147
147
The following must already be running on the cluster:
148
148
149
-
- Claude Code container deployed (see [agents/claude/claude_agent](https://github.com/red-hat-data-services/agentic-starter-kits/tree/main/agents/claude/claude_agent))
149
+
- Claude Code container deployed (see [deployment/deployment.yaml](deployment/deployment.yaml))
150
150
- OGX deployed and serving a model
151
151
- MLflow instance running via the ODH/RHOAI operator with a workspace matching your namespace
0 commit comments