You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -11,7 +11,7 @@ RL environment abstraction for [Strands Agents](https://github.com/strands-agent
11
11
This package standardizes agent environments by treating each `env.step()` as a full agent loop (`prompt → (tool_call, tool_response)* → response`), not a single model call. Built on [strands](https://github.com/strands-agents/sdk-python) agent loop and [`strands-sglang`](https://github.com/horizon-rl/strands-sglang) for RL training.
12
12
13
13
-**Define environments easily** — subclass `Environment` and implement tools as `@tool` functions
14
-
-**Capture token-level observations** — TITO data for on-policy RL training (SGLang backend)
14
+
-**Capture token-level observations** — token-in/token-out trajectories for on-policy RL training (SGLang backend)
15
15
-**Plug in reward functions** — evaluate agent outputs with custom `RewardFunction`
16
16
-**Run benchmarks** — `Evaluator` with pass@k metrics, checkpointing, and resume
0 commit comments