Skip to content

Memory Consumption Issue with Elastic Agent on Kubernetes with high number of resources #5991

@rhr323

Description

@rhr323

Issue Summary

In our testing on the serverless platform, we aimed to assess the maximum number of projects that can be supported on a single MKI cluster. We were using the Elastic Agent version 8.15-4-SNAPSHOT to mitigate previously identified memory issues.

Most Elastic Agent instances functioned without issues. However, on nodes hosting vector search projects, where a larger number of Elasticsearch instances and their associated Kubernetes resources (e.g., pods, deployments, services, secrets) are allocated, we observed the Elastic Agent running out of memory. This typically occurred when these nodes were hosting around 100 Elasticsearch instances.

Observed Behavior

  • Elastic Agent on high-density nodes (around 100 Elasticsearch instances) experienced memory exhaustion and got stuck in a crash loop.
  • Diagnostic data was collected from an Elastic Agent on a node with ~70 allocated projects at the time of capture.

Environment

  • Elastic Agent version: 8.15-4-SNAPSHOT
  • Kubernetes environment: Serverless platform, MKI cluster
  • Node allocation: ~100 Elasticsearch instances per node for vector search projects

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions