Skip to content

Commit 5f64a0f

Browse files
committed
Update default model to gpt-5 and add model temperature for o3
1 parent 04b9756 commit 5f64a0f

File tree

3 files changed

+25
-9
lines changed

3 files changed

+25
-9
lines changed

README.md

Lines changed: 14 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -117,9 +117,14 @@ Whether you're interested in AI, automation, or contributing to cutting-edge age
117117

118118

119119
### Installation
120+
To install Agent S2.5 without cloning the repository, run
120121
```bash
121122
pip install gui-agents
122123
```
124+
If you would like to test Agent S2.5 while making changes, clone the repository and install using
125+
```
126+
pip install -e .
127+
```
123128

124129
### API Configuration
125130

@@ -147,7 +152,7 @@ For optimal performance, we recommend [UI-TARS-1.5-7B](https://huggingface.co/By
147152

148153

149154
> ⚡️ **Recommended Setup:**
150-
> For the best configuration, we recommend using **OpenAI o3-2025-04-16** as the main model, paired with **UI-TARS-1.5-7B** for grounding.
155+
> For the best configuration, we recommend using **OpenAI gpt-5-2025-08-07** as the main model, paired with **UI-TARS-1.5-7B** for grounding.
151156
152157

153158
### CLI
@@ -157,7 +162,7 @@ Run Agent S2.5 with the required parameters:
157162
```bash
158163
agent_s \
159164
--provider openai \
160-
--model o3-2025-04-16 \
165+
--model gpt-5-2025-08-07 \
161166
--ground_provider huggingface \
162167
--ground_url http://localhost:8080 \
163168
--ground_model ui-tars-1.5-7b \
@@ -167,13 +172,16 @@ agent_s \
167172

168173
#### Required Parameters
169174
- **`--provider`**: Main generation model provider (e.g., openai, anthropic, etc.) - Default: "openai"
170-
- **`--model`**: Main generation model name (e.g., o3-2025-04-16) - Default: "o3-2025-04-16"
175+
- **`--model`**: Main generation model name (e.g., gpt-5-2025-08-07) - Default: "gpt-5-2025-08-07"
171176
- **`--ground_provider`**: The provider for the grounding model - **Required**
172177
- **`--ground_url`**: The URL of the grounding model - **Required**
173178
- **`--ground_model`**: The model name for the grounding model - **Required**
174179
- **`--grounding_width`**: Width of the output coordinate resolution from the grounding model - **Required**
175180
- **`--grounding_height`**: Height of the output coordinate resolution from the grounding model - **Required**
176181

182+
#### Optional Parameters
183+
- **`--model_temperature`**: The temperature to fix all model calls to (necessary to set to 1.0 for models like o3 but can be left blank for other models)
184+
177185
#### Grounding Model Dimensions
178186
The grounding width and height should match the output coordinate resolution of your grounding model:
179187
- **UI-TARS-1.5-7B**: Use `--grounding_width 1920 --grounding_height 1080`
@@ -208,8 +216,9 @@ Next, we define our engine parameters. `engine_params` is used for the main agen
208216
engine_params = {
209217
"engine_type": provider,
210218
"model": model,
211-
"base_url": model_url, # Optional
212-
"api_key": model_api_key, # Optional
219+
"base_url": model_url, # Optional
220+
"api_key": model_api_key, # Optional
221+
"temperature": model_temperature # Optional
213222
}
214223

215224
# Load the grounding engine from a custom endpoint

gui_agents/s2_5/cli_app.py

Lines changed: 9 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -230,8 +230,8 @@ def main():
230230
parser.add_argument(
231231
"--model",
232232
type=str,
233-
default="o3-2025-04-16",
234-
help="Specify the model to use (e.g., o3-2025-04-16)",
233+
default="gpt-5-2025-08-07",
234+
help="Specify the model to use (e.g., gpt-5-2025-08-07)",
235235
)
236236
parser.add_argument(
237237
"--model_url",
@@ -245,6 +245,12 @@ def main():
245245
default="",
246246
help="The API key of the main generation model.",
247247
)
248+
parser.add_argument(
249+
"--model_temperature",
250+
type=float,
251+
default=None,
252+
help="Temperature to fix the generation model at (e.g. o3 can only be run with 1.0)"
253+
)
248254

249255
# Grounding model config: Self-hosted endpoint based (required)
250256
parser.add_argument(
@@ -312,6 +318,7 @@ def main():
312318
"model": args.model,
313319
"base_url": args.model_url,
314320
"api_key": args.model_api_key,
321+
"temperature": getattr(args, 'model_temperature', None),
315322
}
316323

317324
# Load the grounding engine from a custom endpoint

models.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -46,7 +46,7 @@ from gui_agents.s2_5.agents.agent_s import AgentS2_5
4646

4747
engine_params = {
4848
"engine_type": 'openai', # Allowed Values: 'openai', 'anthropic', 'gemini', 'azure_openai', 'vllm', 'open_router'
49-
"model": 'o3-2025-04-16', # Allowed Values: Any Vision and Language Model from the supported APIs
49+
"model": 'gpt-5-2025-08-07', # Allowed Values: Any Vision and Language Model from the supported APIs
5050
}
5151
agent = AgentS2_5(
5252
engine_params,
@@ -62,7 +62,7 @@ from gui_agents.s2_5.core.mllm import LMMAgent
6262

6363
engine_params = {
6464
"engine_type": 'openai', # Allowed Values: 'openai', 'anthropic', 'gemini', 'azure_openai', 'vllm', 'open_router'
65-
"model": 'o3-2025-04-16', # Allowed Values: Any Vision and Language Model from the supported APIs
65+
"model": 'gpt-5-2025-08-07', # Allowed Values: Any Vision and Language Model from the supported APIs
6666
}
6767
agent = LMMAgent(
6868
engine_params=engine_params,

0 commit comments

Comments
 (0)