Skip to content

Commit 5c69d94

Browse files
committed
chore: update
1 parent 2386757 commit 5c69d94

File tree

1 file changed

+3
-1
lines changed

1 file changed

+3
-1
lines changed

README.md

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -8,13 +8,15 @@ RL environment abstraction for [Strands Agents](https://github.com/strands-agent
88

99
## Features
1010

11-
This package standardizes agent environments by treating each `env.step()` as a full agent loop (prompt → (tool_call, tool_response)* → response), not a single model call or tool call. Built on [strands](https://github.com/strands-agents/sdk-python) agent loop and [`strands-sglang`](https://github.com/horizon-rl/strands-sglang) for RL training.
11+
This package standardizes agent environments by treating each `env.step()` as a **full agent loop**, not a single model call or tool call. Built on [strands](https://github.com/strands-agents/sdk-python) agent loop and [`strands-sglang`](https://github.com/horizon-rl/strands-sglang) for RL training.
1212

1313
- **Define environments easily** — subclass `Environment` and implement tools as `@tool` functions
1414
- **Capture token-level observations** — token-in/token-out trajectories for on-policy RL training (SGLang backend)
1515
- **Plug in reward functions** — evaluate agent outputs with custom `RewardFunction`
1616
- **Run benchmarks**`Evaluator` with flexible environment setup, metric customization, and resume
1717

18+
> An agent loop can be defined as `(prompt → (tool_call, tool_response+)* → response)`
19+
1820
## Install
1921

2022
```bash

0 commit comments

Comments
 (0)