Add securedMetricsPort by AleksZimin · Pull Request #495 · piraeusdatastore/piraeus-operator

AleksZimin · 2023-06-29T06:52:48Z

If a Linstor controller has more than one replica, only one target will be up in Prometheus. This is because of the leader election mechanism for Linstor controllers. Once a leader is elected, other pods of Linstor controllers won't provide any metrics.

This PR addresses this issue when using a service monitor. When the securedMetricsPort parameter is set, the operator will add a metrics port to the Linstor controller service and Linstor controller endpoints.

Signed-off-by: Aleksandr Zimin <alexandr.zimin@flant.com>

WanzenBug · 2023-07-12T09:46:22Z

Sorry for the late response.

If a Linstor controller has more than one replica, only one target will be up in Prometheus. This is because of the leader election mechanism for Linstor controllers. Once a leader is elected, other pods of Linstor controllers won't provide any metrics.

That part made sense to me. The way the ServiceMonitor resource works you would have the metrics for the non-leader controllers appear to be down (I guess they are down, technically).

This PR addresses this issue when using a service monitor. When the securedMetricsPort parameter is set, the operator will add a metrics port to the Linstor controller service and Linstor controller endpoints.

I don't quite get how this fixes the issue. We only configure a second port, but how does that help? Also, there is nothing "secured" about the port?

Signed-off-by: Aleksandr Zimin <alexandr.zimin@flant.com>

AleksZimin · 2023-07-16T16:10:06Z

I don't quite get how this fixes the issue. We only configure a second port, but how does that help? Also, there is nothing "secured" about the port?

Apologies for the confusion. Let me provide more details on how this fix addresses the issue.

By replacing the PodMonitor with ServiceMonitor, we change the way Prometheus scrapes the metrics. With PodMonitor, Prometheus attempts to scrape metrics from every pod, which can lead to issues when multiple replicas of the LINSTOR controller are present.

When we switch to ServiceMonitor, Prometheus only scrapes the endpoints of the service itself, rather than each individual pod. This resolves the problem of having only one target up in Prometheus when multiple replicas are deployed.

Additionally, we mentioned "secured" in the context of using RBAC proxy to secure communications for collecting metrics. When RBAC proxy is in use, the metrics are provided through another port, which needs to be added to the endpoints of the LINSTOR controller service. By configuring the securedMetricsPort in the LinstorController custom resource, we ensure that Prometheus can scrape metrics from this secured port.

I hope this clarifies how the changes address the issue and the relevance of the "secured" aspect. Please let me know if you have any further questions or concerns.

Add securedMetricsPort

9e7dc56

Signed-off-by: Aleksandr Zimin <alexandr.zimin@flant.com>

Add docs and examples for securedMetricsPort

d9edc04

Signed-off-by: Aleksandr Zimin <alexandr.zimin@flant.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add securedMetricsPort#495

Add securedMetricsPort#495
AleksZimin wants to merge 2 commits intopiraeusdatastore:masterfrom
AleksZimin:add-secured-metrics-port

AleksZimin commented Jun 29, 2023 •

edited

Loading

Uh oh!

WanzenBug commented Jul 12, 2023

Uh oh!

AleksZimin commented Jul 16, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

AleksZimin commented Jun 29, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

WanzenBug commented Jul 12, 2023

Uh oh!

AleksZimin commented Jul 16, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

AleksZimin commented Jun 29, 2023 •

edited

Loading