You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Gemma 4 supports structured thinking, where the model can reason step-by-step before producing a final answer. The reasoning process is exposed via the `reasoning_content` field in the API response.
481
+
Gemma 4 supports structured thinking, where the model can reason step-by-step before producing a final answer. The reasoning process is exposed via the `reasoning` field in the API response (vLLM 0.18+) or `reasoning_content` in older vLLM versions.
482
+
483
+
> ⚠️ **Important: Required Configuration**
484
+
>
485
+
> When using thinking mode, you **must** include `"skip_special_tokens": False` in the `extra_body` parameter. Without this setting, the reasoning special tokens will be stripped and the thinking output will not be properly captured.
Gemma 4 supports function calling with a dedicated tool-call protocol using custom special tokens (`<|tool_call|>`, `<tool_call|>`, etc.).
547
556
557
+
> ⚠️ **Important: Required Configuration**
558
+
>
559
+
> When using function calling, you **must** include `"skip_special_tokens": False` in the `extra_body` parameter. Without this setting, the tool-call special tokens will be stripped and function calling will not work properly.
0 commit comments