[rl] Update callsite to init_batch_invariance to pass attention backend. #2176

zhxchen17 · 2025-12-24T02:41:38Z

Summary:

In vllm-project/vllm#26315 and vllm-project/vllm#30704,
vllm has deprecated usage of VLLM_ATTENTION_BACKEND, and init_batch_invariance() now takes a
required argument for attention backend. So we're updating the callsites according to the
latest change from vllm.

Test Plan:

VLLM_BATCH_INVARIANT=1 VLLM_ATTENTION_BACKEND=FLASH_ATTN python3 torchtitan/experiments/rl/unified/simple_rl_multiprocess.py

Reviewers:

Subscribers:

Tasks:

Tags:

tianyu-l · 2025-12-24T02:45:48Z

torchtitan/experiments/rl/vllm_compat/simple_rl.py

+def get_vllm_attention_backend() -> AttentionBackendEnum:
+    if os.getenv("VLLM_ATTENTION_BACKEND") is None:
+        raise RuntimeError("VLLM_ATTENTION_BACKEND is not set.")
+    return getattr(AttentionBackendEnum, os.getenv("VLLM_ATTENTION_BACKEND"))


vllm-project/vllm#30704 says

The VLLM_ATTENTION_BACKEND environment variable has been deprecated by vllm-project/vllm#26315. This PR updates the batch invariant initialization accordingly.

Why we still use this envvar?

Personally I think we should avoid envvar as much as possible, so I'd prefer if we don't have to call

VLLM_BATCH_INVARIANT=1 VLLM_ATTENTION_BACKEND=FLASH_ATTN python3 torchtitan/experiments/rl/unified/simple_rl_multiprocess.py

wondering what error are you seeing if you don't set VLLM_ATTENTION_BACKEND=FLASH_ATTN and don't pass anything into init_batch_invariance?

@acisseJZhong the error we saw with init_batch_invariance():

File "/data/users/zhxchen17/torchtitan/torchtitan/experiments/rl/unified/simple_rl_multiprocess.py", line 25, in <module> from torchtitan.experiments.rl.unified.actors.generator import Generator File "/data/users/zhxchen17/torchtitan/torchtitan/experiments/rl/unified/actors/generator.py", line 18, in <module> from torchtitan.experiments.rl.vllm_compat.simple_rl import ( File "/data/users/zhxchen17/torchtitan/torchtitan/experiments/rl/vllm_compat/simple_rl.py", line 43, in <module> init_batch_invariance() TypeError: init_batch_invariance() missing 1 required positional argument: 'attention_backend'

and the error when init_batch_invariance(None):

Traceback (most recent call last): File "/data/users/zhxchen17/torchtitan/torchtitan/experiments/rl/unified/simple_rl_multiprocess.py", line 25, in <module> from torchtitan.experiments.rl.unified.actors.generator import Generator File "/data/users/zhxchen17/torchtitan/torchtitan/experiments/rl/unified/actors/generator.py", line 18, in <module> from torchtitan.experiments.rl.vllm_compat.simple_rl import ( File "/data/users/zhxchen17/torchtitan/torchtitan/experiments/rl/vllm_compat/simple_rl.py", line 43, in <module> init_batch_invariance(None) File "/data/users/zhxchen17/vllm/vllm/model_executor/layers/batch_invariant.py", line 1057, in init_batch_invariance override_envs_for_invariance(attention_backend) File "/data/users/zhxchen17/vllm/vllm/model_executor/layers/batch_invariant.py", line 1025, in override_envs_for_invariance raise RuntimeError(error) RuntimeError: VLLM batch_invariant mode requires an attention backend in ['FLASH_ATTN', 'FLASHINFER', 'FLASH_ATTN_MLA', 'TRITON_MLA'], but got 'None'. Please use --attention-backend or attention_config to set one of the supported backends before enabling batch_invariant.

@tianyu-l Right now other than envvar there seems no better way inject configs to simple_rl_multiprocess.py.
Other 3 options without using envvar:

Hardcode FLASH_ATTN backend for now in the main script and make it clear it's hardcoded.

Make simple_rl_multiprocess.py read some config files which contains the attention backend name.

Make simple_rl_multiprocess.py accept cmd line arg, but this seems not consistent with the main trainer script, so not considering it atm.

Seems 1. is better for now if we stick with flash attention for a while.

Summary: In vllm-project/vllm#26315 and vllm-project/vllm#30704, vllm has deprecated usage of VLLM_ATTENTION_BACKEND, and init_batch_invariance() now takes a required argument for attention backend. So we're updating the callsites according to the latest change from vllm. Test Plan: VLLM_BATCH_INVARIANT=1 VLLM_ATTENTION_BACKEND=FLASH_ATTN python3 torchtitan/experiments/rl/unified/simple_rl_multiprocess.py Reviewers: Subscribers: Tasks: Tags:

pytorch-bot bot added the ciflow/8gpu label Dec 24, 2025

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Dec 24, 2025

zhxchen17 requested review from acisseJZhong, tianyu-l and wwwjn December 24, 2025 02:41

tianyu-l reviewed Dec 24, 2025

View reviewed changes

zhxchen17 force-pushed the zhxchen17/init_batch_invariance branch from 2e155d8 to 3377c05 Compare December 24, 2025 16:57

zhxchen17 requested a review from tianyu-l December 24, 2025 16:57

zhxchen17 force-pushed the zhxchen17/init_batch_invariance branch from 3377c05 to 834b405 Compare December 24, 2025 17:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[rl] Update callsite to init_batch_invariance to pass attention backend. #2176

[rl] Update callsite to init_batch_invariance to pass attention backend. #2176

zhxchen17 commented Dec 24, 2025 •

edited

Loading

Uh oh!

tianyu-l Dec 24, 2025

Uh oh!

acisseJZhong Dec 24, 2025

Uh oh!

zhxchen17 Dec 24, 2025

Uh oh!

zhxchen17 Dec 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[rl] Update callsite to init_batch_invariance to pass attention backend. #2176

Are you sure you want to change the base?

[rl] Update callsite to init_batch_invariance to pass attention backend. #2176

Conversation

zhxchen17 commented Dec 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tianyu-l Dec 24, 2025

Choose a reason for hiding this comment

Uh oh!

acisseJZhong Dec 24, 2025

Choose a reason for hiding this comment

Uh oh!

zhxchen17 Dec 24, 2025

Choose a reason for hiding this comment

Uh oh!

zhxchen17 Dec 24, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

zhxchen17 commented Dec 24, 2025 •

edited

Loading