Skip to content

v0.2.2: Fix logprob regression for RL training

Latest

Choose a tag to compare

@Lawhy Lawhy released this 11 Feb 01:33
· 4 commits to main since this release

What's Changed

Bug Fixes

  • fix(sglang): regression on logprob computation - Fixed a regression introduced in v0.2.0 where logprob_start_len=0 was
    accidentally removed from the SGLang /generate request. This caused input_token_logprobs to not be returned for tool result
    tokens, resulting in None values in the token trajectory.

Impact

This bug affected all token-in/token-out training workflows with tool-calling agents. Users would see:
TypeError: must be real number, not NoneType
when training frameworks (e.g., slime) attempted to create tensors from the logprobs.

Upgrade

pip install --upgrade strands-sglang

v0.2.1 will be yanked

Full Changelog: v0.2.1...v0.2.2