We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
1 parent ca2893d commit e29a148Copy full SHA for e29a148
README.md
@@ -8,7 +8,7 @@ Standardizing environment infrastructure with [Strands Agents](https://github.co
8
9
## Features
10
11
-This package treats each `env.step()` as a **full agent loop** `(prompt → (tool_call, tool_response)* → response)`, not a single model call.
+This package treats each `env.step()` as a **full agent loop** `(prompt → (tool_call, tool_response+)* → response)`, not a single model call.
12
13
- **Define Environments** — Subclass `Environment`, add `@tool` functions, plug in `RewardFunction`
14
- **RL Training** — Token-level observations for on-policy training with [strands-sglang](https://github.com/horizon-rl/strands-sglang)
0 commit comments