Add prompt caching schema support for structured system/developer content

Specifically I would like to enable caching with RooCode using Claude models as it significantly reduces cost with the huge number of tokens used.

Bedrock supports [prompt caching](https://docs.aws.amazon.com/bedrock/latest/userguide/prompt-caching.html) via `cachePoint` blocks, but the gateway currently only accepts string content for system and developer messages. The OpenAI SDK sends cache control hints as structured content:

```json
{
  "role": "system",
  "content": [
    {"type": "text", "text": "You are a helpful assistant.", "cache_control": {"type": "ephemeral"}}
  ]
}
```

This structured format is silently rejected or ignored.

### Proposed solution

1. Add a `CacheControl` model and `cache_control` field on `TextContent` in the schema
2. Change `SystemMessage.content` and `DeveloperMessage.content` to accept `str | list[TextContent]`
3. Handle list-format system/developer messages in `_parse_system_prompts` — extract text and emit Bedrock `cachePoint` blocks when `cache_control` is set
4. Emit `cachePoint` in `_parse_content_parts` for user message content with cache markers

This allows clients using the OpenAI SDK's prompt caching pattern to have their cache hints translated to Bedrock's native `cachePoint` format.

**Files:** `src/api/schema.py`, `src/api/models/bedrock.py`

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add prompt caching schema support for structured system/developer content #229

Proposed solution

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Add prompt caching schema support for structured system/developer content #229

Description

Proposed solution

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions