Proposal: Structured Tool API

The current `ToolSpec` is just a type alias and isn’t very ergonomic or type-safe when defining tools. I’d like to propose a more structured and Swifty API for defining and using tools, inspired by [ollama-swift](https://github.com/loopwork-ai/ollama-swift).

This new design makes it easier to define, register, and invoke tools with full Codable support for inputs and outputs. It also encourages safe and readable usage by reducing boilerplate and typos.

## Example Usage

```swift
struct WeatherInput: Codable {
    let city: String
}

struct WeatherOutput: Codable {
    let temperature: Double
    let conditions: String
}

let weatherTool = Tool<WeatherInput, WeatherOutput>(
    name: "get_current_weather",
    description: """
    Get the current weather for a city,
    with conditions ("sunny", "cloudy", etc.)
    and temperature in °C.
    """,
    parameters: [
       .required("city", type: .string, description: "The city to get weather for")
    ]
) { input async throws -> WeatherOutput in
    // Implement weather lookup logic here
    return WeatherOutput(temperature: 18.5, conditions: "cloudy")
}
```
The `Tool` struct accepts a `name`, `description`, a list of `parameters`, and the actual `implementation` logic. It is generic over two types: one for the input and one for the output, both conforming to `Codable`. Internally, the parameters will eventually be represented as a dictionary. The initializer is just a more structured and type-safe way to define them.

```swift
let chat: [Chat.Message] = [
    .system("You are a helpful assistant that can check the weather."),
    .user("What's the weather like in Portland?")
]

let input = UserInput(
    chat: chat,
    tools: [weatherTool]
)

/* Generate output */

/* Handle tool call */
```

## Handling Tool Calls

Ollama includes tool calls as part of the final message output. From my experience, when tools are provided, only a single message is returned, even if streaming is enabled. It holds back the output until the full response is generated, and only then returns the message along with any tool call metadata. While I'm not entirely sure how we could replicate this behavior, I think it's a practical and user-friendly approach and we should consider adopting a similar solution.
```swift
// ⚠️ Ollama Swift's Tool Call Handling
if let toolCalls = response.message.toolCalls {
    for toolCall in toolCalls {
        print("Tool called: \(toolCall.function.name)")
        print("Arguments: \(toolCall.function.arguments)")
    }
}
```

I’d like to discuss this with you. What are your thoughts?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Proposal: Structured Tool API #308

Example Usage

Handling Tool Calls

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Proposal: Structured Tool API #308

Description

Example Usage

Handling Tool Calls

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions