Structure completion request to maximize Prompt Caching

Today, the current flow of a request through to an OpenAI service relies on simple JSON-serialization of a model to encode the message to `BinaryData` and send it through the pipeline.

This does not maximize Prompt Caching capabilities, where the completion request should have `tools`, then `history`, then new content - in that order.
Additionally, the tools and history must be in the same order every time (suggest alpha order by tool name).

Sources:
https://learn.microsoft.com/en-us/azure/ai-services/openai/how-to/prompt-caching
https://openai.com/index/api-prompt-caching/
https://learn.microsoft.com/en-us/azure/ai-services/openai/how-to/prompt-caching#what-is-cached

Asks for `BinaryData` from the options:
https://github.com/Azure/azure-sdk-for-java/blob/cc459eee27b3b7b59452faf41e69101f017cd816/sdk/openai/azure-ai-openai/src/main/java/com/azure/ai/openai/OpenAIClient.java#L726

Which simply uses a default serialization implementation to turn the CompletionChatOptions into BinaryData
https://github.com/Azure/azure-sdk-for-java/blob/cc459eee27b3b7b59452faf41e69101f017cd816/sdk/core/azure-core/src/main/java/com/azure/core/util/BinaryData.java#L614-L615
https://github.com/Azure/azure-sdk-for-java/blob/cc459eee27b3b7b59452faf41e69101f017cd816/sdk/core/azure-core/src/main/java/com/azure/core/util/BinaryData.java#L181

### Additional context

https://github.com/microsoft/semantic-kernel/discussions/9444
https://github.com/openai/openai-dotnet/issues/281

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Structure completion request to maximize Prompt Caching #42805

Additional context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

	public static BinaryData fromObject(Object data) {
	return fromObject(data, SERIALIZER);

Structure completion request to maximize Prompt Caching #42805

Description

Additional context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions