You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+2-2Lines changed: 2 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,6 +1,6 @@
1
-
# Open-RL: self-hosted API for your RL Infrastructure
1
+
# OpenRL: self-hosted API for your RL Infrastructure
2
2
3
-
Open-RL implements [Tinker](https://tinker-docs.thinkingmachines.ai/) compatible API for fine-tuning language models that you can run on your own infrastructure (machine or a kubernetes cluster). You can use the Tinker SDK to orchestrate RL training loops by writing imperative Python code directly from your local machine.
3
+
OpenRL implements [Tinker](https://tinker-docs.thinkingmachines.ai/) compatible API for fine-tuning language models that you can run on your own infrastructure (machine or a kubernetes cluster). You can use the Tinker SDK to orchestrate RL training loops by writing imperative Python code directly from your local machine.
Copy file name to clipboardExpand all lines: docs/blog/from-mac-to-gke.md
+3-3Lines changed: 3 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,8 +1,8 @@
1
-
# From Your Mac to GKE: Fine-Tuning Gemma with Open-RL
1
+
# From Your Mac to GKE: Fine-Tuning Gemma with OpenRL
2
2
3
3
RL fine-tuning is one of the most powerful ways to specialize language models — but the infrastructure behind it has traditionally been a nightmare. You're either wrestling with GPU allocation, rewriting training scripts for different backends, or managing job lifecycles by hand.
4
4
5
-
[Open-RL](https://github.com/google/open-rl) is a self-hosted, open-source API that makes this simple. Write your training loop once using the Tinker SDK, run it on your Mac to iterate fast, then point it at a GKE cluster when you're ready to scale. Same code, any backend.
5
+
[OpenRL](https://github.com/google/open-rl) is a self-hosted, open-source API that makes this simple. Write your training loop once using the Tinker SDK, run it on your Mac to iterate fast, then point it at a GKE cluster when you're ready to scale. Same code, any backend.
6
6
7
7
Let's walk through it.
8
8
@@ -106,4 +106,4 @@ Check out the [Architecture Deep Dive](../architecture.md) for a detailed explan
106
106
-**[GKE Deployment Guide](../deployment.md)** — Set up the distributed backend on Kubernetes
107
107
-**[Architecture Deep Dive](../architecture.md)** — How the Gateway, Queue, and Clock Cycle Engine work together
108
108
109
-
Open-RL is Apache 2.0 licensed. Contributions welcome.
109
+
OpenRL is Apache 2.0 licensed. Contributions welcome.
Copy file name to clipboardExpand all lines: docs/setup/gke-setup.md
+3-3Lines changed: 3 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,6 +1,6 @@
1
1
# GKE Setup Guide
2
2
3
-
This guide describes how to create a minimal GKE Standard cluster to run Open-RL workloads. It sets up the Open-RL gateway, one vLLM worker, one trainer worker, Redis, and a shared Filestore PVC.
3
+
This guide describes how to create a minimal GKE Standard cluster to run OpenRL workloads. It sets up the OpenRL gateway, one vLLM worker, one trainer worker, Redis, and a shared Filestore PVC.
4
4
5
5
This guide is based on the [Text-to-SQL recipe](../../examples/text-to-sql/README.md) requirements.
Copy file name to clipboardExpand all lines: examples/README.md
+6-4Lines changed: 6 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,6 +1,6 @@
1
-
# Open-RL Examples
1
+
# OpenRL Examples
2
2
3
-
This directory contains examples, demos, and helper scripts for using the Open-RL framework. These are not part of the core library but serve as recipes for training and evaluation.
3
+
This directory contains examples, demos, and helper scripts for using the OpenRL framework. These are not part of the core library but serve as recipes for training and evaluation.
4
4
5
5
## Prerequisites
6
6
@@ -22,9 +22,11 @@ This directory contains examples, demos, and helper scripts for using the Open-R
22
22
23
23
### Reinforcement Learning (RL)
24
24
***[Text-to-SQL RL](rl/text-to-sql):** Runs the Gemma 4 SFT+RL recipe with SQL execution rewards and curve plotting.
25
-
***[Autoresearch Demo](autoresearch):** Runs code-RL researchers against the same Open-RL gateway using cookbook DeepCoder rewards, Sandbox Fusion, and optional Agent Sandbox CRDs.
25
+
26
+
### Autoresearch
27
+
***[Autoresearch Demo](autoresearch):** Runs code-RL researchers against the same OpenRL gateway using cookbook DeepCoder rewards, Sandbox Fusion, and optional Agent Sandbox CRDs.
26
28
27
29
### Tinker Cookbook
28
-
***[Tinker Cookbook Recipes](tinker-cookbook):** Examples showing how to run [Tinker Cookbook](https://github.com/thinking-machines-lab/tinker-cookbook) recipes with Open-RL.
30
+
***[Tinker Cookbook Recipes](tinker-cookbook):** Examples showing how to run [Tinker Cookbook](https://github.com/thinking-machines-lab/tinker-cookbook) recipes with OpenRL.
0 commit comments