[ML] Default endpoint allocations are now configurable #127783

prwhelan · 2025-05-06T20:44:57Z

Min and max allocations for the default endpoints' adaptive allocation settings are now configurable via a setting.

This is intended to help new clusters running on laptops and in serverless.

This does not automatically increase or reduce the default endpoints to those values - we still need a cluster reboot to set the values.

Relate #124653

Min and max allocations for the default endpoints' adaptive allocation settings are now configurable via a setting. This is intended to help new clusters running on laptops and in serverless. This does not automatically increase or reduce the default endpoints to those values - we still need a cluster reboot to set the values. Relate elastic#124653

elasticsearchmachine · 2025-05-06T20:45:22Z

Hi @prwhelan, I've created a changelog YAML for you.

elasticsearchmachine · 2025-05-06T22:50:29Z

Pinging @elastic/ml-core (Team:ML)

jonathan-buttner

If I understand correctly, these changes will allow modifying the default endpoints. I know we've brought up concerns about modifying the defaults because they affect everyone.

I think there's also consistency issues here because we persist the default inference endpoints to the inference index. We only do that if the id is missing. So if a user does a GET _inference/_all, we'll persist the default inference endpoints the first time, then if they update the settings, reboot, and then issue another GET _inference/_all we won't persist them again and I don't think they'll see the new values they set via the settings.

I'm not totally sure but I suspect it will also affect usage beyond just retrieving the default endpoints.

We'd need to delete the default endpoints and recreate them. I think we should revisit whether we need to persist the default endpoints at all but that's another discussion 😄

jonathan-buttner · 2025-05-07T13:53:25Z

...n/java/org/elasticsearch/xpack/core/ml/inference/assignment/AdaptiveAllocationsSettings.java

+        0,
+        0,
+        32,
+        Setting.Property.Dynamic,


I think dynamic will only be useful if we're going to allow changes without having to reboot. I think we typically add a addSettingsUpdateConsumer call so we can listen for the changes for example:

https://github.com/elastic/elasticsearch/blob/main/x-pack/plugin/inference/src/main/java/org/elasticsearch/xpack/inference/external/http/HttpSettings.java#L48

prwhelan · 2025-05-20T13:57:41Z

We discussed this and determined that what we are trying to accomplish - setting the default endpoint min allocations to 1 so that an ML node is always active - can be implemented using the existing hardware profiles by setting the min ML node to 1.

We can revisit this as needed

prwhelan added >enhancement :ml Machine learning Team:ML Meta label for the ML team auto-backport Automatically create backport pull requests when merged v8.19.0 v9.1.0 labels May 6, 2025

Update docs/changelog/127783.yaml

4202d49

prwhelan marked this pull request as ready for review May 6, 2025 22:50

jonathan-buttner requested changes May 7, 2025

View reviewed changes

Merge branch 'main' into ml/settings

972e309

prwhelan closed this May 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ML] Default endpoint allocations are now configurable #127783

[ML] Default endpoint allocations are now configurable #127783

Uh oh!

prwhelan commented May 6, 2025

Uh oh!

elasticsearchmachine commented May 6, 2025

Uh oh!

elasticsearchmachine commented May 6, 2025

Uh oh!

jonathan-buttner left a comment •

edited

Loading

Uh oh!

jonathan-buttner May 7, 2025

Uh oh!

prwhelan commented May 20, 2025

Uh oh!

Uh oh!

[ML] Default endpoint allocations are now configurable #127783

[ML] Default endpoint allocations are now configurable #127783

Uh oh!

Conversation

prwhelan commented May 6, 2025

Uh oh!

elasticsearchmachine commented May 6, 2025

Uh oh!

elasticsearchmachine commented May 6, 2025

Uh oh!

jonathan-buttner left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jonathan-buttner May 7, 2025

Choose a reason for hiding this comment

Uh oh!

prwhelan commented May 20, 2025

Uh oh!

Uh oh!

jonathan-buttner left a comment •

edited

Loading