Skip to content

Commit 684fc1e

Browse files
authored
docs: Rename Open-RL to OpenRL across all markdown files (#104)
1 parent 865da70 commit 684fc1e

18 files changed

Lines changed: 59 additions & 57 deletions

File tree

README.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
1-
# Open-RL: self-hosted API for your RL Infrastructure
1+
# OpenRL: self-hosted API for your RL Infrastructure
22

3-
Open-RL implements [Tinker](https://tinker-docs.thinkingmachines.ai/) compatible API for fine-tuning language models that you can run on your own infrastructure (machine or a kubernetes cluster). You can use the Tinker SDK to orchestrate RL training loops by writing imperative Python code directly from your local machine.
3+
OpenRL implements [Tinker](https://tinker-docs.thinkingmachines.ai/) compatible API for fine-tuning language models that you can run on your own infrastructure (machine or a kubernetes cluster). You can use the Tinker SDK to orchestrate RL training loops by writing imperative Python code directly from your local machine.
44

55
# Why Tinker
66

docs/architecture.md

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -1,12 +1,12 @@
1-
# Open-RL
1+
# OpenRL
22

3-
Open-RL implements post-training APIs to fine-tune language models on
3+
OpenRL implements post-training APIs to fine-tune language models on
44
self-hosted infrastructure. These APIs cover common post-training techniques
55
such as supervised fine-tuning, reinforcement learning, and related workflows.
66

7-
Conceptually, Open-RL decouples the researcher-facing training loop from the
7+
Conceptually, OpenRL decouples the researcher-facing training loop from the
88
infrastructure that runs it. Researchers own datasets, environments, rewards,
9-
losses, and optimization logic; Open-RL owns the serving, scheduling, sampling,
9+
losses, and optimization logic; OpenRL owns the serving, scheduling, sampling,
1010
and storage needed to run that loop. This separation lets training methods and
1111
backend capacity evolve independently.
1212

@@ -137,7 +137,7 @@ The sampler produces rollouts and token logprobs for the training loop. On one
137137
machine this can run through the same model state as training; in a cluster it
138138
can be a separate inference service that loads adapter snapshots.
139139

140-
Keeping sampling as a separate concept lets Open-RL use the same API contract
140+
Keeping sampling as a separate concept lets OpenRL use the same API contract
141141
for single-machine iteration and cluster-backed inference. The client only sees
142142
sample results, not the backend routing choice.
143143

docs/blog/from-mac-to-gke.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,8 +1,8 @@
1-
# From Your Mac to GKE: Fine-Tuning Gemma with Open-RL
1+
# From Your Mac to GKE: Fine-Tuning Gemma with OpenRL
22

33
RL fine-tuning is one of the most powerful ways to specialize language models — but the infrastructure behind it has traditionally been a nightmare. You're either wrestling with GPU allocation, rewriting training scripts for different backends, or managing job lifecycles by hand.
44

5-
[Open-RL](https://github.com/google/open-rl) is a self-hosted, open-source API that makes this simple. Write your training loop once using the Tinker SDK, run it on your Mac to iterate fast, then point it at a GKE cluster when you're ready to scale. Same code, any backend.
5+
[OpenRL](https://github.com/google/open-rl) is a self-hosted, open-source API that makes this simple. Write your training loop once using the Tinker SDK, run it on your Mac to iterate fast, then point it at a GKE cluster when you're ready to scale. Same code, any backend.
66

77
Let's walk through it.
88

@@ -106,4 +106,4 @@ Check out the [Architecture Deep Dive](../architecture.md) for a detailed explan
106106
- **[GKE Deployment Guide](../deployment.md)** — Set up the distributed backend on Kubernetes
107107
- **[Architecture Deep Dive](../architecture.md)** — How the Gateway, Queue, and Clock Cycle Engine work together
108108

109-
Open-RL is Apache 2.0 licensed. Contributions welcome.
109+
OpenRL is Apache 2.0 licensed. Contributions welcome.

docs/configuration.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
# Configuration
22

3-
Open-RL is configured with environment variables. The examples below use plain
3+
OpenRL is configured with environment variables. The examples below use plain
44
shell commands so they work even if `make` is not installed. The root
55
`Makefile` wraps the same commands for convenience.
66

@@ -81,7 +81,7 @@ make server BASE_MODEL=google/gemma-4-e2b SAMPLING_BACKEND=vllm
8181
| Env var | Default | What it does |
8282
| --- | --- | --- |
8383
| `TINKER_BASE_URL` | `http://127.0.0.1:9003` | Base URL used by example clients and scripts. |
84-
| `TINKER_API_KEY` | `tml-dummy-key` | Passed through to the Tinker SDK. Local Open-RL does not enforce auth. |
84+
| `TINKER_API_KEY` | `tml-dummy-key` | Passed through to the Tinker SDK. Local OpenRL does not enforce auth. |
8585
| `HF_TOKEN` | unset | Required for gated Hugging Face models. `uv run hf auth login` is the easiest setup path. |
8686
| `ENABLE_GCP_TRACE` | `0` | `1` exports OpenTelemetry traces to Google Cloud Trace. |
8787
| `ENABLE_CONSOLE_TRACE` | `0` | `1` prints trace spans to stdout for debugging. |

docs/rl_concepts.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
1-
# Reinforcement Learning Concepts in Open-RL
1+
# Reinforcement Learning Concepts in OpenRL
22

3-
This document provides an overview of the core Reinforcement Learning (RL) concepts used in the Open-RL project.
3+
This document provides an overview of the core Reinforcement Learning (RL) concepts used in the OpenRL project.
44

55
<!-- TODO: Add RL concepts -->

docs/setup/gke-setup.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
# GKE Setup Guide
22

3-
This guide describes how to create a minimal GKE Standard cluster to run Open-RL workloads. It sets up the Open-RL gateway, one vLLM worker, one trainer worker, Redis, and a shared Filestore PVC.
3+
This guide describes how to create a minimal GKE Standard cluster to run OpenRL workloads. It sets up the OpenRL gateway, one vLLM worker, one trainer worker, Redis, and a shared Filestore PVC.
44

55
This guide is based on the [Text-to-SQL recipe](../../examples/text-to-sql/README.md) requirements.
66

@@ -93,7 +93,7 @@ Connect `kubectl`:
9393
gcloud container clusters get-credentials "${CLUSTER}" --location="${REGION}"
9494
```
9595

96-
## 3. Deploy Open-RL
96+
## 3. Deploy OpenRL
9797

9898
Deploy the manifests using the Kustomize overlay. You should apply **only one** of the following, depending on your needs:
9999

@@ -146,7 +146,7 @@ curl http://127.0.0.1:9003/api/v1/healthz
146146
curl http://127.0.0.1:9003/api/v1/get_server_capabilities
147147
```
148148

149-
The Open-RL server is now available at `http://127.0.0.1:9003`.
149+
The OpenRL server is now available at `http://127.0.0.1:9003`.
150150

151151
## 5. Clean Up
152152

docs/setup/local-setup.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
# Local Setup Guide
22

3-
This guide describes how to set up a local environment (or a single VM) to run Open-RL workloads.
3+
This guide describes how to set up a local environment (or a single VM) to run OpenRL workloads.
44

55
## Prerequisites
66

@@ -85,9 +85,9 @@ export VLLM_ARCHITECTURE_OVERRIDE=Gemma4ForCausalLM
8585
make vllm
8686
```
8787

88-
### 3. Start the Open-RL Server
88+
### 3. Start the OpenRL Server
8989

90-
In a **second terminal session**, start the Open-RL gateway and trainer on GPU 1:
90+
In a **second terminal session**, start the OpenRL gateway and trainer on GPU 1:
9191

9292
```bash
9393
export CUDA_VISIBLE_DEVICES=1
@@ -96,4 +96,4 @@ export SAMPLING_BACKEND=vllm
9696
make server
9797
```
9898

99-
The Open-RL server is now available at `http://127.0.0.1:9003`.
99+
The OpenRL server is now available at `http://127.0.0.1:9003`.

docs/tinker-client-compatibility.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ Generated from `tinker==0.18.1` by
44
`tests/tinker_client_compat.py`.
55

66
The test discovers public Tinker client methods with `dir()` and `inspect`,
7-
starts the real Open-RL FastAPI gateway in single-process mode with a tiny
7+
starts the real OpenRL FastAPI gateway in single-process mode with a tiny
88
local model fixture, lets the SDK fetch server bootstrap config, calls each
99
discovered method
1010
with small fixture arguments, and records whether the call succeeds before

examples/README.md

Lines changed: 6 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
1-
# Open-RL Examples
1+
# OpenRL Examples
22

3-
This directory contains examples, demos, and helper scripts for using the Open-RL framework. These are not part of the core library but serve as recipes for training and evaluation.
3+
This directory contains examples, demos, and helper scripts for using the OpenRL framework. These are not part of the core library but serve as recipes for training and evaluation.
44

55
## Prerequisites
66

@@ -22,9 +22,11 @@ This directory contains examples, demos, and helper scripts for using the Open-R
2222

2323
### Reinforcement Learning (RL)
2424
* **[Text-to-SQL RL](rl/text-to-sql):** Runs the Gemma 4 SFT+RL recipe with SQL execution rewards and curve plotting.
25-
* **[Autoresearch Demo](autoresearch):** Runs code-RL researchers against the same Open-RL gateway using cookbook DeepCoder rewards, Sandbox Fusion, and optional Agent Sandbox CRDs.
25+
26+
### Autoresearch
27+
* **[Autoresearch Demo](autoresearch):** Runs code-RL researchers against the same OpenRL gateway using cookbook DeepCoder rewards, Sandbox Fusion, and optional Agent Sandbox CRDs.
2628

2729
### Tinker Cookbook
28-
* **[Tinker Cookbook Recipes](tinker-cookbook):** Examples showing how to run [Tinker Cookbook](https://github.com/thinking-machines-lab/tinker-cookbook) recipes with Open-RL.
30+
* **[Tinker Cookbook Recipes](tinker-cookbook):** Examples showing how to run [Tinker Cookbook](https://github.com/thinking-machines-lab/tinker-cookbook) recipes with OpenRL.
2931

3032
---

examples/autoresearch/README.md

Lines changed: 8 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -1,11 +1,11 @@
1-
# Open-RL Autoresearch Demo
1+
# OpenRL Autoresearch Demo
22

33
This adapts [Karpathy's autoresearch](https://github.com/karpathy/autoresearch)
4-
to Open-RL: an agent repeatedly edits one allowed target, runs a bounded
4+
to OpenRL: an agent repeatedly edits one allowed target, runs a bounded
55
measured attempt, keeps commits that improve the configured metric, and resets
66
the rest. The same recipe contract works locally or in Kubernetes; in a cluster,
77
each run can live in its own pod and act as a researcher while sharing the same
8-
storage and Open-RL backend.
8+
storage and OpenRL backend.
99

1010
## Minimal Recipe Shape
1111

@@ -64,10 +64,10 @@ recipe-specific settings.
6464

6565
## Architecture
6666

67-
Autoresearch runs as a small Kubernetes add-on around the shared Open-RL
67+
Autoresearch runs as a small Kubernetes add-on around the shared OpenRL
6868
infrastructure. A recipe overlay starts the UI plus one researcher Sandbox per
6969
researcher. Each Sandbox runs Gemini CLI, edits the recipe, launches attempts,
70-
and calls the shared Open-RL/Tinker services.
70+
and calls the shared OpenRL/Tinker services.
7171

7272
![Autoresearch architecture](arch.png)
7373

@@ -86,11 +86,11 @@ Choose one recipe overlay:
8686
# Fast text-SQL, no model server.
8787
kubectl apply -k examples/autoresearch/recipes/text_sql
8888

89-
# Math-RL add-on. First deploy Open-RL with docs/setup/gke-setup.md,
89+
# Math-RL add-on. First deploy OpenRL with docs/setup/gke-setup.md,
9090
# or reuse an existing backend at http://open-rl-gateway-service:8000.
9191
kubectl apply -k examples/autoresearch/recipes/math_rl
9292

93-
# Convenience one-shot Math-RL stack: Open-RL backend + autoresearch add-on.
93+
# Convenience one-shot Math-RL stack: OpenRL backend + autoresearch add-on.
9494
kubectl apply -k examples/autoresearch/recipes/math_rl/gke
9595
```
9696

@@ -110,7 +110,7 @@ http://localhost:8080/experiments.html
110110
```
111111

112112
Use the normal [GKE setup guide](../../docs/setup/gke-setup.md) for cluster,
113-
GPU, storage, and the Open-RL backend. These overlays add researcher sandboxes and
113+
GPU, storage, and the OpenRL backend. These overlays add researcher sandboxes and
114114
the UI on top of that shared backend.
115115

116116
Researcher pods wait for comma-separated `READY_URLS` before the agent starts, so

0 commit comments

Comments
 (0)