strands-env

Standardizing environment infrastructure with Strands Agents — step, observe, reward.

Features

This package treats each env.step() as a full agent loop (prompt → (tool_call, tool_response+)* → response), not a single model call.

Define Environments — Subclass Environment, add @tool functions, plug in RewardFunction
RL Training — Token-level observations for on-policy training with strands-sglang
Benchmarking — CLI and Evaluator with checkpointing, resume, and custom metrics

Install

pip install strands-env

For development:

git clone https://github.com/horizon-rl/strands-env.git && cd strands-env
pip install -e ".[dev]"

Quick Start

Define an Environment

Subclass Environment and add tools as @tool-decorated functions:

from strands import tool
from strands_env.core import Environment

@tool
def calculator(expression: str) -> str:
    """Evaluate a math expression."""
    return str(eval(expression))

class MathEnv(Environment):
    def get_tools(self):
        return [calculator]

Run It

env = MathEnv(model_factory=factory, reward_fn=reward_fn)
result = await env.step(Action(message="What is 2^10?", task_context=TaskContext(ground_truth="1024")))

result.observation.final_response   # "The answer is 1024"
result.reward.reward                # 1.0
result.termination_reason           # TerminationReason.TASK_COMPLETE

See examples/calculator_demo.py for a complete example.

Run Evaluations

strands-env eval aime-2024 \
    --env examples/envs/calculator_env.py \
    --backend sglang \
    --base-url http://localhost:30000 \
    --n-samples-per-prompt 8 \
    --max-concurrency 30

Documentation

Evaluation Guide — CLI reference, hook files, custom evaluators
RL Training Integration — slime integration, token observations

Development

# Lint
ruff check src/ && ruff format --check src/

# Unit tests
pytest tests/unit/ -v

# Integration tests (requires running SGLang server)
pytest tests/integration/ -v --sglang-base-url=http://localhost:30000

License

Apache License 2.0 — see LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 117 Commits
.github/workflows		.github/workflows
docs		docs
examples		examples
src/strands_env		src/strands_env
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

strands-env

Features

Install

Quick Start

Define an Environment

Run It

Run Evaluations

Documentation

Development

License

About

Uh oh!

Releases 3

Packages

Contributors 2

Uh oh!

Languages

License

horizon-rl/strands-env

Folders and files

Latest commit

History

Repository files navigation

strands-env

Features

Install

Quick Start

Define an Environment

Run It

Run Evaluations

Documentation

Development

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Contributors 2

Uh oh!

Languages

Packages