Distributed Tracing: Java + Python with Galileo via OTEL

A runnable example showing how to trace a request across a Java Spring Boot gateway and a Python FastAPI + LangGraph RAG service — with a single trace ID stitched together end-to-end in Galileo.

User ──POST /ask──▶ Java Gateway ──POST /process──▶ Python RAG Service
                      │ Spring AI (classify)           │ LangGraph (route → retrieve → generate)
                      │ OTel Java Agent                │ LangChainInstrumentor + RetrieverSpan
                      └──────────────────┬─────────────┘
                                         │ OTLP HTTP
                                         ▼
                                 OpenTelemetry Collector
                                         │ OTLP HTTP (with Galileo auth)
                                         ▼
                                      Galileo

What this demonstrates

Cross-language trace propagation — the OTel Java agent injects a W3C traceparent header on every outbound HTTP call; the Python service continues the same trace automatically
Auto-instrumentation + targeted manual spans — LangChainInstrumentor handles LangGraph nodes automatically; a Galileo RetrieverSpan surfaces retrieved source documents in the trace UI; the Java service adds gen_ai.* attributes for proper agent hierarchy rendering
Collector-agnostic — both services export plain OTLP/HTTP to OTEL_EXPORTER_OTLP_ENDPOINT. Point them at the bundled demo collector, or at whatever OTel Collector you already run.
Financial services domain — ChromaDB is pre-seeded with realistic financial policy documents (wire transfers, KYC/AML, mortgage guidelines, etc.)

Expected trace in Galileo

invoke_agent financial-assistant          ← Java: outer agent span (manual)
  chat classify_question                  ← Java: LLM classification (manual span)
  POST /process                           ← Java agent: auto-instrumented HTTP call
    rag_pipeline                          ← Python: WorkflowSpan (Galileo)
      LangGraph.invoke                    ← Python: auto-instrumented by LangChainInstrumentor
        route                             ← Python: LangGraph node (auto)
        retrieve                          ← Python: LangGraph node (auto)
          chromadb_search                 ← Python: RetrieverSpan with source docs (Galileo)
        generate                          ← Python: LangGraph node (auto)
          openai.chat                     ← Python: LLM span (auto, OpenAIInstrumentor)

Prerequisites

Requirement	Version
Docker + Docker Compose	24+
OpenAI API key	—
Galileo account + API key	console.galileo.ai

Quick start

1. Clone and configure

git clone https://github.com/rungalileo/sdk-examples.git
cd sdk-examples/python/logging-samples/distributed-tracing-otel-python-java
cp .env.example .env

Edit .env and fill in your keys:

OPENAI_API_KEY=sk-...
GALILEO_API_KEY=...
GALILEO_PROJECT=distributed-tracing-demo
GALILEO_LOG_STREAM=main
GALILEO_API_URL=https://api.galileo.ai

2. Start all services

docker compose up --build

On first run this builds the Java and Python images (~2–3 min) and seeds ChromaDB. Once you see Started GatewayApplication, everything is ready.

3. Send a question

curl -s -X POST http://localhost:8089/ask \
  -H "Content-Type: application/json" \
  -d '{"question": "What are the wire transfer limits?"}' | jq

Expected response:

{
  "answer": "Domestic wire transfers are limited to $50,000 per business day...",
  "category": "FINANCIAL",
  "sources": ["wire-transfer-policy"]
}

4. View traces in Galileo

Open your Galileo project and navigate to the log stream you configured in .env. You should see a trace with the full span hierarchy shown above.

Service ports

Service	Host port	Purpose
Java gateway	8089	`POST /ask` — entry point
Python RAG service	8099	`POST /process` — internal
ChromaDB	8299	Vector store

Architecture

Java gateway (`java-service/`)

Spring Boot 3.4.1 + Spring AI 1.0.0 — handles routing and LLM classification
OTel Java agent (bundled in Docker image) — auto-instruments all HTTP calls and injects traceparent headers
Manual spans — two spans with gen_ai.* attributes tell Galileo to render this as an agent with an LLM sub-call; Spring AI doesn't yet emit these automatically

Python RAG service (`python-service/`)

FastAPI + LangGraph — three-node graph: route → retrieve → generate
LangChainInstrumentor + OpenAIInstrumentor — auto-create spans for each LangGraph node and every OpenAI call
RetrieverSpan (Galileo) — wraps the ChromaDB query so retrieved documents appear as source context in Galileo
WorkflowSpan (Galileo) — top-level span for the /process endpoint

How spans reach Galileo

Both services export OTLP/HTTP to whatever endpoint OTEL_EXPORTER_OTLP_ENDPOINT points at. That endpoint is always an OpenTelemetry Collector — the collector batches, retries on failure, and forwards to Galileo with the right auth headers. Galileo credentials live in the collector config, not in the application code.

Using the bundled demo collector

For running this repo end-to-end on a laptop, docker compose up starts a minimal collector alongside the two services. Its config is in otel-collector/config.yaml. Services automatically point at it:

OTEL_EXPORTER_OTLP_ENDPOINT=http://otel-collector:4318

This is a demo convenience — not something you deploy.

Using your own existing collector

In any real environment you almost certainly already run an OTel Collector. To point the services at it instead:

Add a Galileo exporter to your existing collector config (see otel-collector/config.yaml for the otlphttp/galileo block to copy).
Set OTEL_EXPORTER_OTLP_ENDPOINT in .env (or your deployment config) to your collector's OTLP/HTTP endpoint, e.g. https://otel-collector.internal.example.com:4318.
Remove the otel-collector service from docker-compose.yml (or just stop pointing at it).

No application code changes. The instrumentation stays identical.

Troubleshooting

Java service fails to start

Check that OPENAI_API_KEY is set in .env
Maven downloads dependencies on first build; allow 2–3 min

No traces in Galileo

Verify GALILEO_API_KEY and GALILEO_API_URL in .env
Check GALILEO_PROJECT and GALILEO_LOG_STREAM match what you're looking at in the UI

ChromaDB connection refused

Python service waits up to 60 s for ChromaDB; check docker compose logs python-service

Empty answers

Confirm OPENAI_API_KEY is valid and has quota
Try a question in the FINANCIAL category (e.g., "What is the minimum credit score for a mortgage?")

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Distributed Tracing: Java + Python with Galileo via OTEL

What this demonstrates

Expected trace in Galileo

Prerequisites

Quick start

Service ports

Architecture

Java gateway (`java-service/`)

Python RAG service (`python-service/`)

How spans reach Galileo

Using the bundled demo collector

Using your own existing collector

Troubleshooting

Further reading

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

Distributed Tracing: Java + Python with Galileo via OTEL

What this demonstrates

Expected trace in Galileo

Prerequisites

Quick start

Service ports

Architecture

Java gateway (java-service/)

Python RAG service (python-service/)

How spans reach Galileo

Using the bundled demo collector

Using your own existing collector

Troubleshooting

Further reading

Java gateway (`java-service/`)

Python RAG service (`python-service/`)