Add LLM profile switch tool by neubig · Pull Request #3188 · OpenHands/software-agent-sdk

neubig · 2026-05-10T14:10:22Z

Summary

Add an optional built-in SwitchLLMTool that lets an agent switch the conversation to a saved LLM profile.
Include available profile names in the tool description at creation time.
Include the active model in the switch observation so the agent can report what model powers future turns.
Add a runnable example that starts on a GPT-5.5 profile, calls switch_llm to move to Claude, and confirms the active model changed.
Add focused tests for profile listing in the description, successful switching, and missing-profile errors.

Validation

Docs PR for documented examples: docs: document SDK switch LLM example docs#499
uv run pytest tests/sdk/tool/test_switch_llm.py tests/sdk/conversation/test_switch_model.py -q
uv run pytest tests/sdk/tool/test_builtins.py tests/sdk/agent/test_agent_tool_init.py -q
env -u LMNR_PROJECT_API_KEY -u LMNR_BASE_URL -u LMNR_FORCE_HTTP uv run pytest tests/sdk/tool/test_switch_llm.py -q
env -u LMNR_PROJECT_API_KEY -u LMNR_BASE_URL -u LMNR_FORCE_HTTP uv run pre-commit run --files openhands-sdk/openhands/sdk/tool/builtins/switch_llm.py tests/sdk/tool/test_switch_llm.py examples/01_standalone_sdk/49_switch_llm_tool.py
env -u LMNR_PROJECT_API_KEY -u LMNR_BASE_URL -u LMNR_FORCE_HTTP OPENHANDS_SUPPRESS_BANNER=1 LLM_API_KEY=... LLM_BASE_URL=https://llm-proxy.app.all-hands.dev uv run python examples/01_standalone_sdk/49_switch_llm_tool.py
- Started on openai/gpt-5.5.
- The agent called switch_llm with profile_name='example-claude'.
- The next assistant response was powered by openai/prod/claude-sonnet-4-5-20250929 and reported that active model.
- The example printed EXAMPLE_COST: 0.034233.

This pull request was updated by an AI agent (OpenHands) on behalf of the user.

Agent Server images for this PR

• GHCR package: https://github.com/OpenHands/agent-sdk/pkgs/container/agent-server

Variants & Base Images

Variant	Architectures	Base Image	Docs / Tags
java	amd64, arm64	`eclipse-temurin:17-jdk`	Link
python	amd64, arm64	`nikolaik/python-nodejs:python3.13-nodejs22-slim`	Link
golang	amd64, arm64	`golang:1.21-bookworm`	Link

Pull (multi-arch manifest)

# Each variant is a multi-arch manifest supporting both amd64 and arm64
docker pull ghcr.io/openhands/agent-server:944e4c9-python

Run

docker run -it --rm \
  -p 8000:8000 \
  --name agent-server-944e4c9-python \
  ghcr.io/openhands/agent-server:944e4c9-python

All tags pushed for this build

ghcr.io/openhands/agent-server:944e4c9-golang-amd64
ghcr.io/openhands/agent-server:944e4c91f66b3b55cc201e9a87ad7e4959e84f93-golang-amd64
ghcr.io/openhands/agent-server:agent-switch-llm-tool-golang-amd64
ghcr.io/openhands/agent-server:944e4c9-golang_tag_1.21-bookworm-amd64
ghcr.io/openhands/agent-server:944e4c9-golang-arm64
ghcr.io/openhands/agent-server:944e4c91f66b3b55cc201e9a87ad7e4959e84f93-golang-arm64
ghcr.io/openhands/agent-server:agent-switch-llm-tool-golang-arm64
ghcr.io/openhands/agent-server:944e4c9-golang_tag_1.21-bookworm-arm64
ghcr.io/openhands/agent-server:944e4c9-java-amd64
ghcr.io/openhands/agent-server:944e4c91f66b3b55cc201e9a87ad7e4959e84f93-java-amd64
ghcr.io/openhands/agent-server:agent-switch-llm-tool-java-amd64
ghcr.io/openhands/agent-server:944e4c9-eclipse-temurin_tag_17-jdk-amd64
ghcr.io/openhands/agent-server:944e4c9-java-arm64
ghcr.io/openhands/agent-server:944e4c91f66b3b55cc201e9a87ad7e4959e84f93-java-arm64
ghcr.io/openhands/agent-server:agent-switch-llm-tool-java-arm64
ghcr.io/openhands/agent-server:944e4c9-eclipse-temurin_tag_17-jdk-arm64
ghcr.io/openhands/agent-server:944e4c9-python-amd64
ghcr.io/openhands/agent-server:944e4c91f66b3b55cc201e9a87ad7e4959e84f93-python-amd64
ghcr.io/openhands/agent-server:agent-switch-llm-tool-python-amd64
ghcr.io/openhands/agent-server:944e4c9-nikolaik_s_python-nodejs_tag_python3.13-nodejs22-slim-amd64
ghcr.io/openhands/agent-server:944e4c9-python-arm64
ghcr.io/openhands/agent-server:944e4c91f66b3b55cc201e9a87ad7e4959e84f93-python-arm64
ghcr.io/openhands/agent-server:agent-switch-llm-tool-python-arm64
ghcr.io/openhands/agent-server:944e4c9-nikolaik_s_python-nodejs_tag_python3.13-nodejs22-slim-arm64
ghcr.io/openhands/agent-server:944e4c9-golang
ghcr.io/openhands/agent-server:944e4c91f66b3b55cc201e9a87ad7e4959e84f93-golang
ghcr.io/openhands/agent-server:agent-switch-llm-tool-golang
ghcr.io/openhands/agent-server:944e4c9-golang_tag_1.21-bookworm
ghcr.io/openhands/agent-server:944e4c9-java
ghcr.io/openhands/agent-server:944e4c91f66b3b55cc201e9a87ad7e4959e84f93-java
ghcr.io/openhands/agent-server:agent-switch-llm-tool-java
ghcr.io/openhands/agent-server:944e4c9-eclipse-temurin_tag_17-jdk
ghcr.io/openhands/agent-server:944e4c9-python
ghcr.io/openhands/agent-server:944e4c91f66b3b55cc201e9a87ad7e4959e84f93-python
ghcr.io/openhands/agent-server:agent-switch-llm-tool-python
ghcr.io/openhands/agent-server:944e4c9-nikolaik_s_python-nodejs_tag_python3.13-nodejs22-slim

About Multi-Architecture Support

Each variant tag (e.g., 944e4c9-python) is a multi-arch manifest supporting both amd64 and arm64
Docker automatically pulls the correct architecture for your platform
Individual architecture tags (e.g., 944e4c9-python-amd64) are also available if needed

github-actions · 2026-05-10T14:10:47Z

Python API breakage checks — ✅ PASSED

Result: ✅ PASSED

Action log

github-actions · 2026-05-10T14:11:05Z

REST API breakage checks (OpenAPI) — ✅ PASSED

Result: ✅ PASSED

Action log

github-actions · 2026-05-10T14:13:03Z

Coverage Report •

File	Stmts	Miss	Cover	Missing
openhands-sdk/openhands/sdk/tool/builtins
switch_llm.py	61	12	80%	34–40, 62, 88, 99, 116, 156
TOTAL	26284	7586	71%

Co-authored-by: openhands <openhands@all-hands.dev>

all-hands-bot

Clean implementation of an optional LLM profile switching tool. Follows existing patterns (similar to InvokeSkillTool), has good test coverage, and includes clear error handling.

[RISK ASSESSMENT]

[Overall PR] ⚠️ Risk Assessment: 🟢 LOW

Adds optional built-in tool without modifying existing behavior. Well-tested with focused unit tests covering profile listing, successful switching, and error cases. No eval-risk concerns since this is an opt-in tool that must be explicitly enabled via include_default_tools=["SwitchLLMTool"].

all-hands-bot

✅ QA Report: PASS

SwitchLLMTool successfully enables agents to switch between saved LLM profiles during conversation execution, with proper error handling and state persistence.

Does this PR achieve its stated goal?

Yes. The PR set out to "add an optional built-in SwitchLLMTool that lets an agent switch the conversation to a saved LLM profile." The implementation delivers exactly this:

Tool creation and registration: The SwitchLLMTool is correctly registered in BUILT_IN_TOOL_CLASSES and can be instantiated via include_default_tools=["SwitchLLMTool"].
Profile switching: The tool successfully switches the conversation's LLM from one saved profile to another, updating both conversation.agent.llm.model and conversation.state.agent.llm.model.
Profile discovery: The tool description dynamically lists all available profiles from LLMProfileStore, making them visible to the agent.
Error handling: Missing profiles are caught and reported without crashing or leaving the conversation in an invalid state.
Multiple switches: Sequential profile switches work correctly, allowing an agent to change models multiple times during a single conversation.

Evidence: Created three test LLM profiles (fast, slow, powerful), used SwitchLLMTool to switch from default model (gpt-4o-mini) → powerful (claude-3-5-sonnet-20241022) → fast (gpt-4o-mini). Each switch updated the active model correctly. Attempting to switch to a non-existent profile returned an error observation without changing the model.

Phase	Result
Environment Setup	✅ Dependencies installed, project builds successfully
CI Status	✅ All core checks pass (sdk-tests, tools-tests, pre-commit, coverage-report)
Functional Verification	✅ Tool switches profiles, lists available profiles, handles errors correctly

Functional Verification

Test 1: Tool Registration and Discovery

Verification:
Confirmed SwitchLLMTool is registered in BUILT_IN_TOOL_CLASSES and can be instantiated:

from openhands.sdk.tool.builtins import BUILT_IN_TOOL_CLASSES
print("SwitchLLMTool" in BUILT_IN_TOOL_CLASSES)  # True
print(BUILT_IN_TOOL_CLASSES.get("SwitchLLMTool"))  # <class '...SwitchLLMTool'>

Result: ✓ Tool is correctly registered and discoverable via include_default_tools.

Test 2: Profile Listing in Tool Description

Setup:
Created three LLM profiles in a temporary profile store:

fast.json (model: gpt-4o-mini)
slow.json (model: gpt-4o)
powerful.json (model: claude-3-5-sonnet-20241022)

Verification:
Called SwitchLLMTool.create() and inspected the tool description:

Available LLM profiles:
- fast
- powerful
- slow

Result: ✓ Tool description correctly lists all available profiles in sorted order.

Test 3: Successful Profile Switch

Baseline (before switch):
Created a conversation with default model:

default_llm = TestLLM.from_messages([], model="gpt-4o-mini", usage_id="default")
agent = Agent(llm=default_llm, tools=[], include_default_tools=["SwitchLLMTool"])
conversation = LocalConversation(agent=agent, workspace=Path.cwd())
print(conversation.agent.llm.model)  # Output: gpt-4o-mini

This confirms the conversation starts with the default model.

Action:
Executed the SwitchLLMTool to switch to the "powerful" profile:

observation = conversation.execute_tool(
    "switch_llm",
    SwitchLLMAction(profile_name="powerful", reason="Need more powerful model")
)

Result (after switch):

Observation text: Switched LLM profile to 'powerful'. Future agent steps will use this profile.
Is error: False
Profile name: powerful
Active model: claude-3-5-sonnet-20241022
Current conversation model: claude-3-5-sonnet-20241022

Verified both the agent's LLM and the conversation state were updated:

assert conversation.agent.llm.model == "claude-3-5-sonnet-20241022"
assert conversation.state.agent.llm.model == "claude-3-5-sonnet-20241022"

Interpretation: The switch from gpt-4o-mini to claude-3-5-sonnet-20241022 was successful. Both the runtime agent and the persisted conversation state reflect the new model.

Test 4: Error Handling for Missing Profile

Setup:
Conversation is currently using the "powerful" profile (claude-3-5-sonnet-20241022).

Action:
Attempted to switch to a non-existent profile:

error_observation = conversation.execute_tool(
    "switch_llm",
    SwitchLLMAction(profile_name="nonexistent", reason="Testing error handling")
)

Result:

Observation text: LLM profile 'nonexistent' was not found.
Is error: True
Current model (should be unchanged): claude-3-5-sonnet-20241022

Verified the model remained unchanged:

assert conversation.agent.llm.model == "claude-3-5-sonnet-20241022"
assert conversation.state.agent.llm.model == "claude-3-5-sonnet-20241022"

Interpretation: The tool correctly handles missing profiles by returning an error observation without modifying the conversation state. The agent continues using the previous model.

Test 5: Multiple Sequential Switches

Setup:
Conversation is using the "powerful" profile.

Action:
Switched to the "fast" profile:

observation2 = conversation.execute_tool(
    "switch_llm",
    SwitchLLMAction(profile_name="fast", reason="Switching to faster model")
)

Result:

Observation text: Switched LLM profile to 'fast'. Future agent steps will use this profile.
Current model: gpt-4o-mini

Verified the second switch succeeded:

assert conversation.agent.llm.model == "gpt-4o-mini"

Interpretation: Multiple profile switches work correctly. The conversation successfully transitioned from default → powerful → fast without issues.

Test 6: Visualization Methods

Verification:
Tested the visualize property on both SwitchLLMAction and SwitchLLMObservation:

Action visualization:

Switch LLM profile: gpt-4o
Reason: Need more powerful model for complex reasoning

Success observation visualization:

Switched LLM profile: fast-model (gpt-4o-mini)

Error observation visualization:

Failed to switch LLM profile: nonexistent

Result: ✓ All visualization methods produce correctly formatted Rich Text objects with appropriate styling.

Issues Found

None.

This QA report was created by an AI agent (OpenHands) on behalf of the user.

Co-authored-by: openhands <openhands@all-hands.dev>

neubig · 2026-05-11T13:35:42Z

@OpenHands address review comments and then merge this PR.

openhands-ai · 2026-05-11T13:36:02Z

I'm on it! neubig can track my progress at all-hands.dev

Co-authored-by: openhands <openhands@all-hands.dev>

openhands-ai · 2026-05-11T14:12:51Z

OpenHands encountered an error: Request timeout after 30 seconds to https://xielshjxxiiokogz.prod-runtime.all-hands.dev/api/conversations/605c645c-2dea-471e-92fc-e41c1996498e/ask_agent

See the conversation for more information.

Add LLM profile switch tool

d18c898

Co-authored-by: openhands <openhands@all-hands.dev>

neubig force-pushed the agent-switch-llm-tool branch from 24dd9ee to d18c898 Compare May 10, 2026 14:34

Merge branch 'main' into agent-switch-llm-tool

8fe34a8

neubig marked this pull request as ready for review May 10, 2026 15:18

all-hands-bot approved these changes May 10, 2026

View reviewed changes

all-hands-bot reviewed May 10, 2026

View reviewed changes

neubig mentioned this pull request May 10, 2026

feat(sdk): gate switch llm default tool #3190

Merged

neubig requested a review from VascoSch92 May 10, 2026 15:31

docs(sdk): add switch llm tool example

177edd2

Co-authored-by: openhands <openhands@all-hands.dev>

VascoSch92 approved these changes May 11, 2026

View reviewed changes

Comment thread openhands-sdk/openhands/sdk/tool/builtins/switch_llm.py

Comment thread openhands-sdk/openhands/sdk/tool/builtins/switch_llm.py

Merge branch 'main' into agent-switch-llm-tool

f7ced90

Address switch LLM review feedback

944e4c9

Co-authored-by: openhands <openhands@all-hands.dev>

neubig merged commit 1004ecc into main May 11, 2026
37 of 38 checks passed

neubig deleted the agent-switch-llm-tool branch May 11, 2026 13:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add LLM profile switch tool#3188

Add LLM profile switch tool#3188
neubig merged 5 commits into
mainfrom
agent-switch-llm-tool

neubig commented May 10, 2026 •

edited by github-actions Bot

Loading

Uh oh!

github-actions Bot commented May 10, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented May 10, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented May 10, 2026 •

edited

Loading

Uh oh!

all-hands-bot left a comment

Uh oh!

all-hands-bot left a comment

Uh oh!

Uh oh!

Uh oh!

neubig commented May 11, 2026

Uh oh!

openhands-ai Bot commented May 11, 2026

Uh oh!

Uh oh!

openhands-ai Bot commented May 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

neubig commented May 10, 2026 • edited by github-actions Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Validation

Uh oh!

github-actions Bot commented May 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Python API breakage checks — ✅ PASSED

Uh oh!

github-actions Bot commented May 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

REST API breakage checks (OpenAPI) — ✅ PASSED

Uh oh!

github-actions Bot commented May 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

all-hands-bot left a comment

Choose a reason for hiding this comment

Uh oh!

all-hands-bot left a comment

Choose a reason for hiding this comment

✅ QA Report: PASS

Does this PR achieve its stated goal?

Test 1: Tool Registration and Discovery

Test 2: Profile Listing in Tool Description

Test 3: Successful Profile Switch

Test 4: Error Handling for Missing Profile

Test 5: Multiple Sequential Switches

Test 6: Visualization Methods

Issues Found

Uh oh!

Uh oh!

Uh oh!

neubig commented May 11, 2026

Uh oh!

openhands-ai Bot commented May 11, 2026

Uh oh!

Uh oh!

openhands-ai Bot commented May 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

neubig commented May 10, 2026 •

edited by github-actions Bot

Loading

github-actions Bot commented May 10, 2026 •

edited

Loading

github-actions Bot commented May 10, 2026 •

edited

Loading

github-actions Bot commented May 10, 2026 •

edited

Loading