Vertex thinking changes to disable thinking mode for 2.5 models #1090

narengogi · 2025-05-12T08:42:16Z

make changes to disable thinking mode in 2.5 models
also return thinking tokens in the response as per open ai format

example request body to disable thinking

{
    "model": "gemini-2.5-flash-preview-04-17",
    "max_tokens": 2000,
    "stream": true,
    "messages": [
        {
            "role": "user",
            "content": "What is the meanin of life, universe and everything"
        }
    ],
    "thinking": {
        "type": "disabled",
        "budget_tokens": 0
    }
}

NOTE: users are required to send type disabled and budget_tokens: 0 explicitly

matter-code-review · 2025-05-12T08:42:19Z

Important

PR Review Skipped

PR review skipped as per the configuration setting. Run a manually review by commenting /matter review

💡Tips to use Matter AI

Command List

/matter summary: Generate AI Summary for the PR
/matter review: Generate AI Reviews for the latest commit in the PR
/matter review-full: Generate AI Reviews for the complete PR
/matter release-notes: Generate AI release-notes for the PR
/matter <ask-question>: Chat with your PR with Matter AI Agent
/matter remember <recommendation>: Generate AI memories for the PR
/matter explain: Get an explanation of the PR
/matter help: Show the list of available commands and documentation

matter-code-review · 2025-05-12T08:42:32Z

Summary By MatterAI

🔄 What Changed

Added thoughtsTokenCount tracking for Vertex AI and Google providers
Implemented conditional thinking mode configuration
Enhanced token usage metadata tracking

🔍 Impact of the Change

More granular token usage reporting
Flexible thinking mode configuration
Improved token tracking for AI models

📁 Total Files Changed

4 files modified:
1. src/providers/google-vertex-ai/chatComplete.ts
2. src/providers/google-vertex-ai/transformGenerationConfig.ts
3. src/providers/google-vertex-ai/types.ts
4. src/providers/google/chatComplete.ts

🧪 Test Added

No explicit new tests added
Existing test coverage assumed

🔒 Security Vulnerabilities

No direct security vulnerabilities detected
Improved configuration validation

Checklist

My code follows the style guidelines of this project
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas
My changes generate no new warnings
New and existing unit tests pass locally with my changes

Sequence Diagram

sequenceDiagram
participant GoogleVertexAI as Vertex
participant TransformConfig as Config
participant ChatComplete as Chat

Vertex->>Config: transformGenerationConfig()
Config-->>Vertex: Configure thinking mode

Vertex->>Chat: chatComplete()
Chat-->>Vertex: Process token usage

Note over Vertex, Chat: Token Tracking
Vertex->Chat: Extract thoughtsTokenCount
Chat-->Vertex: Return token details

matter-code-review · 2025-05-12T09:44:48Z

Important

PR Review Skipped

PR review skipped as per the configuration setting. Run a manually review by commenting /matter review

💡Tips to use Matter AI

Command List

/matter summary: Generate AI Summary for the PR
/matter review: Generate AI Reviews for the latest commit in the PR
/matter review-full: Generate AI Reviews for the complete PR
/matter release-notes: Generate AI release-notes for the PR
/matter <ask-question>: Chat with your PR with Matter AI Agent
/matter remember <recommendation>: Generate AI memories for the PR
/matter explain: Get an explanation of the PR
/matter help: Show the list of available commands and documentation

matter-code-review · 2025-05-12T09:45:01Z

Summary By MatterAI

🔄 What Changed

Added support for granular thinking mode configuration in Google Vertex AI
Introduced thoughtsTokenCount tracking
Implemented conditional thinking mode enablement

🔍 Impact of the Change

Provides more precise control over AI model's reasoning process
Enables token-level budget management for thinking mode
Enhances token usage reporting

📁 Total Files Changed

2 files modified:
1. src/providers/google-vertex-ai/chatComplete.ts
2. src/providers/google/chatComplete.ts

🧪 Test Added

N/A (No explicit test cases provided in PR)

🔒 Security Vulnerabilities

No direct security vulnerabilities detected

Type of Change

New feature (non-breaking change which adds functionality)

Checklist

Sequence Diagram

sequenceDiagram
participant GoogleVertexAI as Vertex
participant ChatComplete as ChatAPI

Vertex->>ChatAPI: Request with thinking parameters
Note over Vertex: thinking.type = 'enabled'
Note over Vertex: thinking.budget_tokens configured
ChatAPI-->>Vertex: Response with thoughtsTokenCount
Vertex->>ChatAPI: Parse token details
Note over ChatAPI: Add completion_tokens_details
ChatAPI-->>Vertex: Return reasoning_tokens

src/providers/google-vertex-ai/transformGenerationConfig.ts

narengogi · 2025-05-27T09:55:42Z

closing as changes are moved into #1113

narengogi added 2 commits May 12, 2025 13:39

support disabling thinking messages for vertex gemini models

4048f46

return thinking tokens in completion tokens details

c29cb48

narengogi requested review from csgulati09 and VisargD May 12, 2025 08:42

fix typos

10b9eaf

narengogi mentioned this pull request May 12, 2025

update docs on how to disable thinking for gemini models Portkey-AI/docs-core#323

Merged

VisargD reviewed May 14, 2025

View reviewed changes

src/providers/google-vertex-ai/transformGenerationConfig.ts Show resolved Hide resolved

narengogi closed this May 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Vertex thinking changes to disable thinking mode for 2.5 models #1090

Vertex thinking changes to disable thinking mode for 2.5 models #1090

Uh oh!

narengogi commented May 12, 2025 •

edited

Loading

Uh oh!

matter-code-review bot commented May 12, 2025

PR Review Skipped

Command List

Uh oh!

matter-code-review bot commented May 12, 2025

Uh oh!

matter-code-review bot commented May 12, 2025

PR Review Skipped

Command List

Uh oh!

matter-code-review bot commented May 12, 2025

Uh oh!

Uh oh!

narengogi commented May 27, 2025

Uh oh!

Uh oh!

Vertex thinking changes to disable thinking mode for 2.5 models #1090

Vertex thinking changes to disable thinking mode for 2.5 models #1090

Uh oh!

Conversation

narengogi commented May 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

matter-code-review bot commented May 12, 2025

PR Review Skipped

Command List

Uh oh!

matter-code-review bot commented May 12, 2025

Summary By MatterAI

🔄 What Changed

🔍 Impact of the Change

📁 Total Files Changed

🧪 Test Added

🔒 Security Vulnerabilities

Checklist

Sequence Diagram

Uh oh!

matter-code-review bot commented May 12, 2025

PR Review Skipped

Command List

Uh oh!

matter-code-review bot commented May 12, 2025

Summary By MatterAI

🔄 What Changed

🔍 Impact of the Change

📁 Total Files Changed

🧪 Test Added

🔒 Security Vulnerabilities

Type of Change

Checklist

Sequence Diagram

Uh oh!

Uh oh!

narengogi commented May 27, 2025

Uh oh!

Uh oh!

narengogi commented May 12, 2025 •

edited

Loading