Avoid parsing upstream respose using Pydantic where possible

See Mistral adapter:

```py
def _response_to_dict(obj: ChatCompletionChunk | ChatCompletion) -> dict:
    return obj.to_dict(warnings=False)

async def chat_completion(
    *, request: dict, client: AsyncAzureOpenAI | AsyncOpenAI
) -> AsyncIterator[dict] | dict:
    response: (
        AsyncStream[ChatCompletionChunk] | ChatCompletion
    ) = await call_with_extra_body(client.chat.completions.create, request)

    if isinstance(response, AsyncStream):
        raw_stream = map_stream(_response_to_dict, response)
        return extract_reasoning_content(raw_stream)
    else:
        return extract_reasoning_content(_response_to_dict(response))
```

Here, `openai` library does parsing for the response, and then we convert the models back to JSON.
This round trip could be avoided completely if we use `client.chat.completions.with_raw_response.create` instead, which doesn't do any parsing.

Likewise for other adapters that do not require response parsing.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avoid parsing upstream respose using Pydantic where possible #446

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Avoid parsing upstream respose using Pydantic where possible #446

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions