feat(providers): add Google Gemini TTS provider by YTLLL · Pull Request #1828 · moeru-ai/airi

YTLLL · 2026-05-13T05:36:39Z

Description

This PR replaces #1824 with a cleaner branch history.

It adds a Google Gemini API text-to-speech provider to AIRI's speech provider list.

Included:

Google Gemini TTS provider metadata
Static Gemini TTS model list
Static Gemini prebuilt voice list
API key/base URL validation
Gemini generateContent TTS request handling
PCM-to-WAV conversion for playback compatibility
Related provider/settings layout fixes discovered during review/testing

The settings layout fixes are included here because they affect the same provider settings surface needed to configure and use the new Gemini TTS provider.

Linked Issues

Replaces #1824

Additional Context

Tested with:

pnpm lint
pnpm typecheck

github-actions · 2026-05-13T05:37:08Z

⏳ Approval required for deploying to Cloudflare Workers (Preview) for stage-web.

Name	Link
🔭 Waiting for approval	For maintainers, approve here

Hey, maintainers, kindly take some time to review and approve this deployment when you are available. Thank you! 🙏

gemini-code-assist

Code Review

This pull request introduces the Google Gemini Speech provider, adding a new settings UI, store registration, and a core implementation that includes a custom fetch adapter to handle Gemini's text-to-speech API and PCM-to-WAV conversion. The changes also include comprehensive unit tests and locale updates, alongside a minor refactor to standardize CSS classes by replacing 'of-x-auto' with 'overflow-x-auto' across several settings modules. Feedback was provided regarding an inconsistency in base URL normalization where the current logic removes trailing slashes, potentially conflicting with the validator's expectations.

The previous implementation stripped trailing slashes from the base URL, inconsistent with the baseUrlValidator (which requires trailing slashes) and all other providers (openai-compatible-builder, openrouter/audio-speech). Align normalizeBaseUrl, createAudioFetch, and createSpeechProvider with the project convention. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

leiyutian added 2 commits May 13, 2026 13:34

feat(providers): add Google Gemini TTS provider

963f160

fix(settings): prevent provider settings layout overflow

f62f849

YTLLL mentioned this pull request May 13, 2026

feat(providers): add Google Gemini TTS provider #1824

Closed

4 tasks

github-actions Bot added apps/stage-web Web App: PWA & Browser feature Related to feature scope/audio-output Scope related to audio output (TTS, Voice cloning, etc.) scope/i18n scope/providers Scope related to providers we support labels May 13, 2026

gemini-code-assist Bot reviewed May 13, 2026

View reviewed changes

Comment thread packages/stage-ui/src/stores/providers/google-gemini-speech.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(providers): add Google Gemini TTS provider#1828

feat(providers): add Google Gemini TTS provider#1828
YTLLL wants to merge 3 commits into
moeru-ai:mainfrom
YTLLL:feat/google-gemini-tts-provider-clean

YTLLL commented May 13, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented May 13, 2026 •

edited

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

YTLLL commented May 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Linked Issues

Additional Context

Uh oh!

github-actions Bot commented May 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

⏳ Approval required for deploying to Cloudflare Workers (Preview) for stage-web.

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

YTLLL commented May 13, 2026 •

edited

Loading

github-actions Bot commented May 13, 2026 •

edited

Loading