AI Agent: LLM capability matrix + multimodal tool results

__Is your feature request related to a problem? Please describe.__

The native provider implementations from Phase 1 hardcode their capability profile (modalities, context window, max output tokens, feature flags). The capabilities of an LLM depend on the combination of model and backend (a given Claude model on Bedrock can differ from the same model on the direct API in context window or regional rate limits), so the hardcoded approach won't scale beyond the smallest viable shipping set. Tool-result documents that contain images or PDFs are also still routed through the synthetic-UserMessage fallback regardless of whether the target model can read them natively.

__Describe the solution you'd like__

Introduce a configuration-driven LLM capability matrix that describes each supported (api family, backend, model) tuple. Wire each native chat model implementation to consult the matrix at request time, and route documents inside tool-call results to either native multimodal emission or the existing fallback path based on the resolved capabilities.

__Describe alternatives you've considered__

See the parent epic.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AI Agent: LLM capability matrix + multimodal tool results #7214

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

AI Agent: LLM capability matrix + multimodal tool results #7214

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions