Fix llm_request trace context propagation by will-deines · Pull Request #38678 · vllm-project/vllm

will-deines · 2026-04-01T02:34:57Z

Summary

preserve request-scoped trace context for llm_request instead of falling back to ambient process environment state during final span emission
normalize propagated trace headers case-insensitively and carry them explicitly through serving and engine request processing
add regression coverage asserting llm_request stays in the parent distributed trace while receiving its own unique span id

Duplicate-work check

The only related open PR I found is [Frontend][Tracing] Add support for tracing aborted requests #32162, which adds tracing for aborted requests. It does not address stale trace header reuse, split traces, or llm_request span identity.
This PR supersedes closed PR Fix llm_request trace context propagation #38672, which was closed because it was based on the wrong fork base. This replacement is based directly on vllm-project/vllm:main.

Testing

./.venv/bin/python -m pytest tests/tracing/test_otel.py -v -s (5 passed)
PRE_COMMIT_HOME=/tmp/pre-commit-vllm-upstream-tracing XDG_CACHE_HOME=/tmp/xdg-cache-vllm-upstream-tracing NPM_CONFIG_CACHE=/tmp/npm-cache-vllm-upstream-tracing ./.venv/bin/pre-commit run --files vllm/tracing/utils.py vllm/tracing/otel.py vllm/tracing/__init__.py vllm/entrypoints/openai/engine/serving.py vllm/entrypoints/pooling/base/serving.py vllm/v1/engine/input_processor.py vllm/v1/engine/output_processor.py tests/tracing/test_otel.py (passed)

AI assistance

AI assistance was used to draft and implement this change. The submitter reviewed the final diff and validated the tests above.

Assisted-by: OpenAI Codex Signed-off-by: Will Deines <will@garr.io>

Co-authored-by: OpenAI Codex Signed-off-by: Will Deines <will@garr.io>

gemini-code-assist

Code Review

This pull request enhances OpenTelemetry tracing by improving trace context propagation and header handling. Key changes include making trace header extraction case-insensitive, adding a mechanism to retrieve trace headers from the current OTel context when request headers are missing, and introducing a use_environment_context flag to control fallback behavior to environment variables. Additionally, trace headers are now normalized in the V1 engine input processor, and comprehensive tests have been added to verify these improvements. I have no feedback to provide.

garrio-1 added 3 commits March 31, 2026 22:30

Fix llm_request trace context propagation

d4e635b

Assisted-by: OpenAI Codex Signed-off-by: Will Deines <will@garr.io>

Add llm_request parent trace regression test

58bc84a

Assisted-by: OpenAI Codex Signed-off-by: Will Deines <will@garr.io>

test: reset tracer provider in tracing tests

9343556

Co-authored-by: OpenAI Codex Signed-off-by: Will Deines <will@garr.io>

will-deines requested review from DarkLight1337, aarnphm, chaunceyjiang, njhill, noooop and russellb as code owners April 1, 2026 02:34

mergify bot added frontend v1 labels Apr 1, 2026

gemini-code-assist bot reviewed Apr 1, 2026

View reviewed changes

DarkLight1337 requested review from markmc and robertgshaw2-redhat April 1, 2026 03:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix llm_request trace context propagation#38678

Fix llm_request trace context propagation#38678
will-deines wants to merge 3 commits intovllm-project:mainfrom
will-deines:fix/llm-request-tracing-context

will-deines commented Apr 1, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

will-deines commented Apr 1, 2026

Summary

Duplicate-work check

Testing

AI assistance

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants