fix(openai): remove duplicate schema from messages in JSON_SCHEMA mode (#1761)

jxnl · web-flow · commit 79e913d56ab7 · 2025-09-11T14:30:36.000-04:00
Removes redundant schema information from messages when using `JSON_SCHEMA` mode. ### Why This Change? **JSON mode** (`response_format: {"type": "json_object"}`) - OpenAI docs require explicit JSON instruction in messages since no schema is provided in response_format. https://platform.openai.com/docs/guides/structured-outputs?api-mode=chat#json-mode > When using JSON mode, you must always instruct the model to produce JSON via some message in the conversation, for example via your system message. If you don't include an explicit instruction to generate JSON, the model may generate an unending stream of whitespace and the request may run continually until it reaches the token limit. To help ensure you don't forget, the API will throw an error if the string "JSON" does not appear somewhere in the context. **JSON_SCHEMA mode** (`response_format: {"type": "json_schema", ...}`) - Schema is already provided in response_format. Adding the same schema to messages creates redundancy, increases token consumption unnecessarily, and provides no additional value to the model. ### Changes - `JSON_SCHEMA` mode: No schema added to messages (schema already in response_format) - `JSON` and `MD_JSON` modes: Unchanged behavior (still add schema to messages as required)  ---- > [!IMPORTANT] > Removes redundant schema from messages in `JSON_SCHEMA` mode in `handle_json_modes()` in `utils.py`, reducing token consumption. > > - **Behavior**: > - In `handle_json_modes()` in `utils.py`, `JSON_SCHEMA` mode no longer adds schema to messages, as it's already in `response_format`. > - `JSON` and `MD_JSON` modes remain unchanged, still adding schema to messages. > - **Rationale**: > - Reduces redundancy and token consumption in `JSON_SCHEMA` mode by not duplicating schema in messages. > > <sup>This description was created by </sup>[<img alt="Ellipsis" src="https://img.shields.io/badge/Ellipsis-blue?color=175173">](https://www.ellipsis.dev?ref=567-labs%2Finstructor&utm_source=github&utm_medium=referral)<sup> for f3af7fb. You can [customize](https://app.ellipsis.dev/567-labs/settings/summaries) this summary. It will automatically update as commits are pushed.</sup>
diff --git a/instructor/providers/openai/utils.py b/instructor/providers/openai/utils.py
@@ -439,22 +439,23 @@ def handle_json_modes(
         )
         new_kwargs["messages"] = merge_consecutive_messages(new_kwargs["messages"])
 
-    if new_kwargs["messages"][0]["role"] != "system":
-        new_kwargs["messages"].insert(
-            0,
-            {
-                "role": "system",
-                "content": message,
-            },
-        )
-    elif isinstance(new_kwargs["messages"][0]["content"], str):
-        new_kwargs["messages"][0]["content"] += f"\n\n{message}"
-    elif isinstance(new_kwargs["messages"][0]["content"], list):
-        new_kwargs["messages"][0]["content"][0]["text"] += f"\n\n{message}"
-    else:
-        raise ValueError(
-            "Invalid message format, must be a string or a list of messages"
-        )
+    if mode != Mode.JSON_SCHEMA:
+        if new_kwargs["messages"][0]["role"] != "system":
+            new_kwargs["messages"].insert(
+                0,
+                {
+                    "role": "system",
+                    "content": message,
+                },
+            )
+        elif isinstance(new_kwargs["messages"][0]["content"], str):
+            new_kwargs["messages"][0]["content"] += f"\n\n{message}"
+        elif isinstance(new_kwargs["messages"][0]["content"], list):
+            new_kwargs["messages"][0]["content"][0]["text"] += f"\n\n{message}"
+        else:
+            raise ValueError(
+                "Invalid message format, must be a string or a list of messages"
+            )
 
     return response_model, new_kwargs