How to set model dtype bf16 when train models by GRPO? #4493

oashua · 2025-12-11T08:57:30Z

oashua
Dec 11, 2025

I use training script from here qwen2-7b-fsdp2.log, and just change qwen2-7b to qwen3-0.6b.

Not only in log of qwen2-7b-fsdp2.log but also my script I see the warning: Flash Attention 2.0 only supports torch.float16 and torch.bfloat16 dtypes, but the current dype in Qwen2ForCausalLM is torch.float32.

I looked up configure guide in https://verl.readthedocs.io/en/latest/examples/config.html but not found key to set type of actor model.

What should I do to set model's type bf16 in training? Thanks so much
verl version: 0.5.x

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How to set model dtype bf16 when train models by GRPO? #4493

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

How to set model dtype bf16 when train models by GRPO? #4493

Uh oh!

oashua Dec 11, 2025

Replies: 0 comments

oashua
Dec 11, 2025