Skip to content

Update model definitions for latest OpenAI, Gemini, and Claude models#341

Open
williamjameshandley wants to merge 3 commits into
masterfrom
feat/update-model-definitions
Open

Update model definitions for latest OpenAI, Gemini, and Claude models#341
williamjameshandley wants to merge 3 commits into
masterfrom
feat/update-model-definitions

Conversation

@williamjameshandley

Copy link
Copy Markdown
Contributor

Summary

  • OpenAI: Added gpt-5.4-mini (cheaper/faster GPT-5.4, ~1.05M context, reasoning/vision/caching), gpt-5.4-nano (cheapest/fastest GPT-5.4, API-only), and gpt-5.3-codex-spark (faster Codex variant, research preview, >1000 tok/s)
  • Gemini: Added gemini-3.1-flash-live-preview (real-time audio-to-audio model for Live API, low-latency streaming)
  • Claude aliases: Updated opus alias from claude-opus-4-5-20251101 to claude-opus-4-6, and sonnet alias from claude-sonnet-4-5-20250929 to claude-sonnet-4-6

Files changed

  • src/mcp_handley_lab/llm/providers/openai/models.yaml — 3 new model entries
  • src/mcp_handley_lab/llm/providers/gemini/models.yaml — 1 new model entry
  • src/mcp_handley_lab/llm/registry.py — updated 2 alias mappings

Test plan

  • Verify YAML files parse correctly
  • Verify new models appear in model listing
  • Verify opus and sonnet aliases resolve to the new model IDs

🤖 Generated with Claude Code

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

@chatgpt-codex-connector chatgpt-codex-connector Bot left a comment

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 9810b0e6ce

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

Comment on lines +56 to +59
gemini-3.1-flash-live-preview:
# Model metadata
description: "Real-time audio-to-audio model for Live API with low-latency streaming"
capabilities: "🔴 Best for: Real-time audio conversations, low-latency voice interaction, Live API"

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

P2 Badge Gate Live API model behind realtime adapter

gemini-3.1-flash-live-preview is now exposed as a regular Gemini model, but our Gemini adapters only use client.models.generate_content(...) and do not implement a Live API/realtime session path. Since this model is explicitly a Live API audio-to-audio model, selecting it through normal mcp-chat generation will hit a runtime API error instead of returning output. Please either hide this model from standard generation/image-analysis routing or add a dedicated realtime adapter before registering it here.

Useful? React with 👍 / 👎.

williamjameshandley and others added 2 commits April 12, 2026 10:38
- Add missing supports_vision: true to gpt-5.4-nano
- Split gemini-3.1-flash-live-preview output pricing into text/audio
- Fix stale usage note: Sonnet 4.5 → Sonnet 4.6

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
@williamjameshandley williamjameshandley enabled auto-merge (squash) April 12, 2026 11:03
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant