Gemini: Add option to specify config_entry in generate_content service #143776

SLaks · 2025-04-27T17:56:22Z

This was requested for #140769.

I marked the new parameter as required in services.yaml (to require it in the UI) but optional in the implementation schema (to not fail on existing calls).

This respects the config entry's model parameters, but ignores prompt and tool options, since they require HA's Conversation infrastructure to handle correctly (to render templates and add parameters to the system prompt, or to call tools).

Warning: This is a breaking change! All calls to this action will now use the safety settings & model specified in first instance of the integration, even if the action doesn't pass config_entry.

This breaking change is easy to avoid if we want (it would make the code a bit messier)

Breaking change

Proposed change

Type of change

Dependency upgrade
Bugfix (non-breaking change which fixes an issue)
New integration (thank you!)
New feature (which adds functionality to an existing integration)
Deprecation (breaking change to happen in the future)
Breaking change (fix/feature causing existing functionality to break)
Code quality improvements to existing code or addition of tests

Additional information

This PR fixes or closes issue: fixes Feature - add Time-to-Live (ttl) parameter support to Pushover #143341
This PR is related to issue: Add Gemini generate_image service #140769
Link to documentation pull request: Gemini: Document the config_entry parameter home-assistant.io#38771
Link to developer documentation pull request:
Link to frontend pull request:

Checklist

The code change is tested and works locally.
Local tests pass. Your PR cannot be merged unless tests pass
There is no commented out code in this PR.
I have followed the development checklist
I have followed the perfect PR recommendations
The code has been formatted using Ruff (ruff format homeassistant tests)
Tests have been added to verify that the new code works.

If user exposed functionality or configuration variables are added/changed:

Documentation added/updated for www.home-assistant.io

If the code communicates with devices, web services, or third-party tools:

The manifest file has all fields filled out correctly.
Updated and included derived files by running: python3 -m script.hassfest.
New or updated dependencies have been added to requirements_all.txt.
Updated by running python3 -m script.gen_requirements_all.
For the updated dependencies - a link to the changelog, or at minimum a diff between library versions is added to the PR description.

To help with the load of incoming pull requests:

I have reviewed two other open pull requests in this repository.

This was requested for home-assistant#140769. I marked the new parameter as required in services.yaml (to require it in the UI) but optional in the implementation schema (to not fail on existing calls). This respects the config entry's model parameters, but ignores prompt and tool options, since they require HA's `Conversation` infrastructure to handle correctly (to render templates and add parameters to the system prompt, or to call tools).

home-assistant · 2025-04-27T17:56:29Z

Hey there @tronikos, @IvanLH, mind taking a look at this pull request as it has been labeled with an integration (google_generative_ai_conversation) you are listed as a code owner for? Thanks!

Code owner commands

Code owners of google_generative_ai_conversation can trigger bot actions by commenting:

@home-assistant close Closes the pull request.
@home-assistant rename Awesome new title Renames the pull request.
@home-assistant reopen Reopen the pull request.
@home-assistant unassign google_generative_ai_conversation Removes the current integration label and assignees on the pull request, add the integration domain after the command.
@home-assistant add-label needs-more-information Add a label (needs-more-information, problem in dependency, problem in custom component) to the pull request.
@home-assistant remove-label needs-more-information Remove a label (needs-more-information, problem in dependency, problem in custom component) on the pull request.

homeassistant/components/google_generative_ai_conversation/strings.json

IvanLH · 2025-04-29T19:24:24Z

From what I read in the code, it doesn't seem possible to select different models for the action and for the conversation agent. Not all models have Image output for example, we need not concern us with Veo and Imagen for the GenerateContent action as those are purely media targeted and they use different SDK methods (generate_videos and generate_images).

However it is possible for users to want to use say Gemini 2.5 flash on their conversation agent, since it has the best results on our eval set, and 2.0 Flash on the action to be able to generate images.

@tronikos what do you think? I am myself partial to making the model a parameter of the action itself, not reading from the config entry. Coupling both seems odd even from an UX perspective, they appear to be separate things on the UI.

Edit: https://ai.google.dev/gemini-api/docs/models#model-variations source for the different Gemini Models and their capabilities.

IvanLH · 2025-04-29T19:30:08Z

From what I read in the code, it doesn't seem possible to select different models for the action and for the conversation agent. Not all models have Image output for example, we need not concern us with Veo and Imagen for the GenerateContent action as those are purely media targeted and they use different SDK methods (generate_videos and generate_images).

However it is possible for users to want to use say Gemini 2.5 flash on their conversation agent, since it has the best results on our eval set, and 2.0 Flash on the action to be able to generate images.

@tronikos what do you think? I am myself partial to making the model a parameter of the action itself, not reading from the config entry. Coupling both seems odd even from an UX perspective, they appear to be separate things on the UI.

Edit: https://ai.google.dev/gemini-api/docs/models#model-variations source for the different Gemini Models and their capabilities.

Upon further reading, I see we ask users to specify the Config_entry, how would that look like? Do they need to specifiy the full JSON structure?

SLaks · 2025-04-29T19:43:34Z

This looks like this:

You would need to add a second instance of the integration from Devices, copy in the same API key, and select a different model from the Configure dialog there. The full workflow (which I would add to docs in the followup image gen PR) would be:

Find your API key (eg, from core.config_entries)
Devices, Google Generative AI, Add Service
Paste in your API key
Rename the new entry to something more meaningful than 2x Google Generative AI (eg, rename to Images)
Click Configure in the new entry
Uncheck Recommended model settings
Submit
Select Gemini 2.0 Flash (Image Generation) Experimental in Model
Submit a second time

I agree that this is not ideal; I would recommend adding an optional Model dropdown in the service call. OTOH, if you use the service in a number of places, using the config entry would make it easier to change models for all of them at once.

IvanLH · 2025-04-29T19:58:29Z

I would much prefer a Model dropdown on the service call, I think it's not terrible if you have to manually update the model for an action you have, the models behave subtly differently, and you probably want to test how they behave in the specific action you are editing.

This would also allow us to error out if the user selects image output with a model that does not support such capabilities.

SLaks · 2025-04-29T20:54:25Z

@tronikos WDYT?

SLaks · 2025-04-29T23:04:03Z

This would also allow us to error out if the user selects image output with a model that does not support such capabilities.

We can do that either way (by looking up the model from the config entry). However, AFAIK, the Google API does not expose supported output formats in the list of models.

There is one downside to specifying a model instead of a config entry: There is no way to specify safety settings. WDYT?

IvanLH · 2025-04-30T00:27:47Z

I think safety settings should also not be global. More of a per action thing, and while it's true that we can show the error, it would not be when the model is selected. As in that flow we have no way of knowing if there is an action with generate image.

The user can be notified, but now they have to navigate to a whole different page to fix it.

tronikos · 2025-04-30T07:10:33Z

I prefer specifying config_entry mostly for consistency with the OpenAI integration. Ideally we need to support media as input and output to all the different LLM integrations using a unified framework. I think there is an architecture discussion about it.

SLaks · 2025-04-30T22:12:36Z

Do we have a consensus here?

FYI, Gemini uses an entirely different API for videos, with different parameters, so we probably won't be able to reuse this action for generating videos as well.

IvanLH · 2025-05-01T04:23:35Z

There's also Imagen for Images, we can discuss how to choose the correct Image API on the upcoming PR. I've asked some other folks if they want to chime in, matching Open AI would be nice, but I personally think we should explore a way to allow multi model setups without resorting to extravagant multi config entries, stemming from multiple "devices".

SLaks · 2025-05-02T01:09:45Z

WDYT of adding two params, so the user can specify either a config entry (and apply safety settings) xor an arbitrary model?

Beware: If I add a model dropdown, I will need to remove services.yaml and register this service in code, so that I can fetch the models from the Gemini API and populate the dropdown options.

IvanLH · 2025-05-02T02:37:46Z

I like that, users get to choose if they want to rely on the global config, or they want to specify things.

UI for the model params is going to be tricky, could we hide it like on the current settings?

SLaks · 2025-05-02T14:33:17Z

UI for the model params is going to be tricky, could we hide it like on the current settings?

No; services aren't interactive (we can only render one screen).

But we can put either one in a collapsible section.

joostlek · 2025-05-14T17:19:56Z

homeassistant/components/google_generative_ai_conversation/__init__.py

+                )
+            config_entry = found_entry
+        else:
+            # Deprecated in 2025.6, to remove in 2025.10


6 months of deprecation

joostlek · 2025-05-14T17:20:55Z

homeassistant/components/google_generative_ai_conversation/__init__.py

+        if CONF_CONFIG_ENTRY in call.data:
+            entry_id = call.data[CONF_CONFIG_ENTRY]
+            found_entry = hass.config_entries.async_get_entry(entry_id)
+            if found_entry is None or found_entry.domain != DOMAIN:


the domain can't be wrong

joostlek · 2025-05-14T17:21:18Z

homeassistant/components/google_generative_ai_conversation/__init__.py

+            entry_id = call.data[CONF_CONFIG_ENTRY]
+            found_entry = hass.config_entries.async_get_entry(entry_id)
+            if found_entry is None or found_entry.domain != DOMAIN:
+                raise ServiceValidationError(
+                    translation_domain=DOMAIN,
+                    translation_key="invalid_config_entry",
+                    translation_placeholders={"config_entry": entry_id},
+                )
+            config_entry = found_entry


we should have a second check to check if its loaded

joostlek · 2025-05-14T17:22:41Z

tests/components/google_generative_ai_conversation/test_init.py

+        },
+    )
+    second_entry.add_to_hass(hass)
+    with patch("google.genai.models.AsyncModels.get"):


Let's patch this where we use it

joostlek · 2025-05-14T17:22:50Z

tests/components/google_generative_ai_conversation/test_init.py

+    SAMPLE_MODEL = "fake-test-model"
+
+    second_entry = MockConfigEntry(
+        domain="google_generative_ai_conversation",


Suggested change

domain="google_generative_ai_conversation",

domain=DOMAIN,

joostlek · 2025-05-14T17:22:59Z

tests/components/google_generative_ai_conversation/test_init.py

+        domain="google_generative_ai_conversation",
+        title="Google Generative AI Conversation",
+        data={
+            "api_key": "bla",


Suggested change

"api_key": "bla",

CONF_API_KEY: "bla",

joostlek · 2025-05-14T17:23:19Z

tests/components/google_generative_ai_conversation/test_init.py

@@ -36,14 +54,18 @@ async def test_generate_content_service_without_images(
        response = await hass.services.async_call(
            "google_generative_ai_conversation",
            "generate_content",
-            {"prompt": "Write an opening speech for a Home Assistant release party"},
+            {
+                "config_entry": second_entry.entry_id,


Suggested change

"config_entry": second_entry.entry_id,

CONF_CONFIG_ENTRY: second_entry.entry_id,

home-assistant · 2025-05-14T17:23:28Z

Please take a look at the requested changes, and use the Ready for review button when you are done, thanks 👍

Learn more about our pull request process.

SLaks · 2025-05-16T15:54:22Z

Do we have any consensus for what parameters the service should accept?

SLaks requested a review from tronikos as a code owner April 27, 2025 17:56

home-assistant bot added cla-signed has-tests integration: google_generative_ai_conversation new-feature labels Apr 27, 2025

home-assistant bot assigned tronikos Apr 27, 2025

home-assistant bot added the Quality Scale: No score label Apr 27, 2025

This was referenced Apr 27, 2025

Gemini: Document the config_entry parameter home-assistant/home-assistant.io#38771

Draft

Add Gemini generate_image service #140769

Draft

NoRi2909 reviewed Apr 29, 2025

View reviewed changes

homeassistant/components/google_generative_ai_conversation/strings.json Outdated Show resolved Hide resolved

Gemini: Update translation

d843b7c

joostlek requested changes May 14, 2025

View reviewed changes

home-assistant bot marked this pull request as draft May 14, 2025 17:23

	"config_entry": second_entry.entry_id,
	CONF_CONFIG_ENTRY: second_entry.entry_id,

Uh oh!

Gemini: Add option to specify config_entry in generate_content service #143776

Are you sure you want to change the base?

Gemini: Add option to specify config_entry in generate_content service #143776

Uh oh!

Conversation

SLaks commented Apr 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Breaking change

Proposed change

Type of change

Additional information

Checklist

Uh oh!

home-assistant bot commented Apr 27, 2025

Uh oh!

Uh oh!

IvanLH commented Apr 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

IvanLH commented Apr 29, 2025

Uh oh!

SLaks commented Apr 29, 2025

Uh oh!

IvanLH commented Apr 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

SLaks commented Apr 29, 2025

Uh oh!

SLaks commented Apr 29, 2025

Uh oh!

IvanLH commented Apr 30, 2025

Uh oh!

tronikos commented Apr 30, 2025

Uh oh!

SLaks commented Apr 30, 2025

Uh oh!

IvanLH commented May 1, 2025

Uh oh!

SLaks commented May 2, 2025

Uh oh!

IvanLH commented May 2, 2025

Uh oh!

SLaks commented May 2, 2025

Uh oh!

joostlek May 14, 2025

Choose a reason for hiding this comment

Uh oh!

joostlek May 14, 2025

Choose a reason for hiding this comment

Uh oh!

joostlek May 14, 2025

Choose a reason for hiding this comment

Uh oh!

joostlek May 14, 2025

Choose a reason for hiding this comment

Uh oh!

joostlek May 14, 2025

Choose a reason for hiding this comment

Uh oh!

joostlek May 14, 2025

Choose a reason for hiding this comment

Uh oh!

joostlek May 14, 2025

Choose a reason for hiding this comment

Uh oh!

home-assistant bot commented May 14, 2025

Uh oh!

SLaks commented May 16, 2025

Uh oh!

Uh oh!

SLaks commented Apr 27, 2025 •

edited

Loading

IvanLH commented Apr 29, 2025 •

edited

Loading

IvanLH commented Apr 29, 2025 •

edited

Loading