Skip to content

Commit ff1b63e

Browse files
authored
docs: update rllm integration status and add training example links (#45)
1 parent 2c73837 commit ff1b63e

2 files changed

Lines changed: 5 additions & 11 deletions

File tree

AGENTS.md

Lines changed: 2 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -17,7 +17,6 @@ This document provides context, patterns, and guidelines for AI coding assistant
1717
- [Environment Variables](#environment-variables)
1818
- [Common Tasks](#common-tasks)
1919
- [Development Tips](#development-tips)
20-
- [Known Limitations & TODOs](#known-limitations--todos)
2120
- [External References](#external-references)
2221

2322
---
@@ -494,13 +493,6 @@ uv run pre-commit install
494493

495494
---
496495

497-
## Known Limitations & TODOs
498-
499-
### Design Improvements
500-
- **Model gateway (in preview)**: [rllm-model-gateway](https://github.com/rllm-org/rllm/tree/main/rllm-model-gateway) replaces the need for `vLLMModel` client-side token collection. The gateway proxies inference requests and captures token IDs + logprobs transparently at the HTTP layer. Integration with rllm training backends is under active development. The legacy `vLLMModel` under `frameworks/strands/` is retained for backward compatibility.
501-
502-
---
503-
504496
## External References
505497

506498
- **ACR Documentation**: https://docs.aws.amazon.com/bedrock-agentcore/
@@ -510,4 +502,5 @@ uv run pre-commit install
510502
- **Runtime SDK Overview**: https://aws.github.io/bedrock-agentcore-starter-toolkit/user-guide/runtime/overview.html
511503
- **HTTP Protocol Contract**: https://docs.aws.amazon.com/bedrock-agentcore/latest/devguide/runtime-http-protocol-contract.html#container-requirements-http
512504
- **rLLM SDK (reference)**: https://rllm-project.readthedocs.io/en/latest/core-concepts/sdk/#1-define-your-agent-function
513-
- **rllm-model-gateway** (token capture proxy for RL training): https://github.com/rllm-org/rllm/tree/main/rllm-model-gateway
505+
- **rllm-model-gateway** (token capture proxy for RL training): https://github.com/rllm-org/rllm/tree/main/rllm-model-gateway | [PyPI](https://pypi.org/project/rllm-model-gateway/)
506+
- **AgentCore math training example** (rllm + Tinker backend): https://github.com/rllm-org/rllm/blob/main/examples/agentcore_math/train_agentcore_math_tinker.sh

README.md

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -127,7 +127,7 @@ In practice, this is infrastructure managed by the training framework:
127127
- **During training**: the training engine points `base_url` through the gateway automatically
128128
- **During evaluation**: `base_url` points directly to any OpenAI-compatible endpoint (vLLM, SGLang, LiteLLM, etc.), or you can use `BedrockModel` via the Bedrock API — no gateway involved
129129

130-
The gateway is currently in preview. See the [rllm-model-gateway repo](https://github.com/rllm-org/rllm/tree/main/rllm-model-gateway) for details.
130+
The gateway is [available on PyPI](https://pypi.org/project/rllm-model-gateway/) (`pip install rllm-model-gateway`). See the [rllm-model-gateway repo](https://github.com/rllm-org/rllm/tree/main/rllm-model-gateway) for details.
131131

132132
## Client-Side: Invoking Agents and Collecting Results
133133

@@ -267,7 +267,8 @@ The training architecture follows a **decoupled design** where agent rollouts an
267267
This architecture enables parallel and highly efficient rollouts with secure execution during RL training. The decoupled design means training libraries only need the agent's container image to start training—agent code and dependencies stay completely separate from the training library.
268268

269269
**Supported Training Libraries:**
270-
- [rllm](https://github.com/rllm-org/rllm) integration coming soon (supports multiple backends: veRL, Tinker, and more)
270+
- [rLLM](https://github.com/rllm-org/rllm) — supports multiple backends (veRL, Tinker, and more)
271+
- [Math Agent](examples/strands_math_agent/): [Tinker](https://github.com/rllm-org/rllm/blob/main/examples/agentcore_math/train_agentcore_math_tinker.sh)
271272

272273
### Prepare Your Agent Container
273274

0 commit comments

Comments
 (0)