Pass through Lora Request Kwargs #513

abhay-sheshadri · 2025-09-03T06:32:18Z

Added support for passing through LORA Requests into vllm.trace

# Test with LoRA adapter
with vllm.trace(test_prompts, temperature=0.0, max_tokens=500, lora_request=lora_request) as tracer:
    lora_results = vllm.generator.output.save()

This commit adds the ability to pass custom metadata (like request_id) to tracer.invoke() calls and access it within the trace context. This solves issues with request/response alignment in batched inference scenarios where lazy evaluation can cause mismatched completions. Changes: - Mediator: Add custom_data parameter to store arbitrary metadata - Invoker: Extract custom parameters (like request_id) from kwargs before passing to batcher, store in mediator's custom_data - Access pattern: Use vllm._interleaver.current.custom_data.get('request_id') within trace context 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

abhay-sheshadri and others added 2 commits September 2, 2025 23:29

Pass through Lora Request Kwargs

8913455

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Pass through Lora Request Kwargs #513

Pass through Lora Request Kwargs #513

Uh oh!

abhay-sheshadri commented Sep 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Pass through Lora Request Kwargs #513

Are you sure you want to change the base?

Pass through Lora Request Kwargs #513

Uh oh!

Conversation

abhay-sheshadri commented Sep 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant