bug: reasoning_effort wrapped in chat_template_kwargs breaks real OpenAI API backends

### Describe the bug

 Since PR #1116, the `reasoning_effort` family type unconditionally wraps the effort value inside `chat_template_kwargs` regardless of whether the backend is a vLLM-hosted
  model or api.openai.com. The real OpenAI API does not recognise `chat_template_kwargs` and returns a 400.

### To Reproduce

  1. Configure a model with `reasoning_family: gpt (type: reasoning_effort)` pointing to an api.openai.com backend
  2. Route a request that triggers a decision with `use_reasoning: true`
  3. Observe a 400 from OpenAI: `"Unknown parameter: 'chat_template_kwargs'"`

### Expected behavior

  For backends targeting api.openai.com, `reasoning_effort` should be sent as a plain top-level field:
`  { "reasoning_effort": "high" }`

  The current behaviour (correct for vLLM-hosted models) sends:
`  { "chat_template_kwargs": { "reasoning_effort": "high" } }`


### Affected layer

None

### Additional context

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bug: reasoning_effort wrapped in chat_template_kwargs breaks real OpenAI API backends #1901

Describe the bug

To Reproduce

Expected behavior

Affected layer

Additional context

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

bug: reasoning_effort wrapped in chat_template_kwargs breaks real OpenAI API backends #1901

Description

Describe the bug

To Reproduce

Expected behavior

Affected layer

Additional context

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions