Refactor of vLLM bridge #1914

keshavb96 · 2026-01-23T06:56:02Z

The purpose of the PR is to refactor the existing JAX <> vLLM bridge to expose and interface that's more easily usable by user who want to avoid using Tunix, the main change is the addition of a framework agnostic interface - VLLMRolloutEngine that an external RL framework can inherit / wrap around to access as the entry point to the functionality of the bridge

jreiffers · 2026-01-26T10:26:57Z

jax-inference-offloading/jax_inference_offloading/api/types.py

+        """
+
+        def pad_left(seq: List[int], length: int, pad_value: int) -> List[int]:
+            seq = seq[:length]  # Truncate if too long


Hmm, silently truncating feels wrong.

Updated to not silently truncate anymore

jreiffers · 2026-01-26T10:28:03Z

jax-inference-offloading/jax_inference_offloading/api/types.py

+        """Get all generated token ID sequences."""
+        return [c.token_ids for c in self.completions]
+
+    def to_arrays(


This seems to be unused? Am I missing something?

I have a simpler debug example of an RL loop that doesn't use Tunix that uses it, I just pushed that example as well

jreiffers · 2026-01-26T10:28:36Z

jax-inference-offloading/jax_inference_offloading/engines/vllm_engine.py

+                - Dict[str, jax.Array]: Direct flattened params
+                - flax.nnx.State: Flax state object
+                - flax.nnx.Module: Flax module (state extracted automatically)
+            block: If True, wait for transfer completion (always True currently).


Why is this flag there if it doesn't do anything?

jreiffers · 2026-01-26T10:30:19Z

jax-inference-offloading/jax_inference_offloading/integrations/tunix/rollout.py

+                    assert len(original) <= length, f"Sequence too long: {len(original)} > {length}"
+                    return original + [pad_value] * (length - len(original))
+
+                for i, completion in enumerate(output.completions):


Leftover debug output?

jreiffers · 2026-01-26T10:31:57Z

jax-inference-offloading/jax_inference_offloading/integrations/tunix/rollout.py

+                input_tokens = []
+                output_tokens = []
+
+                def pad_to_left(original: List[int], length: int, pad_value: int) -> List[int]:


There's another implementation of this in api/types.py.

… config

keshavb96 added 3 commits January 20, 2026 16:28

bug fixes and upgrade vLLM to 0.12.0

8eb6db3

Merge branch 'main' into vllm_bridge

12d2a99

Jax <> vLLM bridge refactor

d39c2a8

jreiffers reviewed Jan 26, 2026

View reviewed changes

cleanup and standalone_example

f3c599d

keshavb96 marked this pull request as ready for review January 26, 2026 20:37

update standalone example and demonstrate specifying param mapping as…

a0a3a65

… config

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor of vLLM bridge #1914

Refactor of vLLM bridge #1914

Uh oh!

keshavb96 commented Jan 23, 2026

Uh oh!

jreiffers Jan 26, 2026

Uh oh!

keshavb96 Jan 26, 2026

Uh oh!

jreiffers Jan 26, 2026

Uh oh!

keshavb96 Jan 26, 2026 •

edited

Loading

Uh oh!

jreiffers Jan 26, 2026

Uh oh!

jreiffers Jan 26, 2026

Uh oh!

jreiffers Jan 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Refactor of vLLM bridge #1914

Are you sure you want to change the base?

Refactor of vLLM bridge #1914

Uh oh!

Conversation

keshavb96 commented Jan 23, 2026

Uh oh!

jreiffers Jan 26, 2026

Choose a reason for hiding this comment

Uh oh!

keshavb96 Jan 26, 2026

Choose a reason for hiding this comment

Uh oh!

jreiffers Jan 26, 2026

Choose a reason for hiding this comment

Uh oh!

keshavb96 Jan 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jreiffers Jan 26, 2026

Choose a reason for hiding this comment

Uh oh!

jreiffers Jan 26, 2026

Choose a reason for hiding this comment

Uh oh!

jreiffers Jan 26, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

keshavb96 Jan 26, 2026 •

edited

Loading