Skip to content

Bug: Increased Distributor latency in 2.15 #10717

Open
@EoinFarrell

Description

@EoinFarrell

What is the bug?

Hey Grafana team,

We have an alert that fires when we see latency above 1 second from the distributor using the below query:

histogram_quantile(0.99, avg by (le) (rate(cortex_request_duration_seconds_bucket{job=~"(cortex)/((distributor|cortex|mimir|mimir-write))",route=~"/distributor.Distributor/Push|/httpgrpc.*|api_(v1|prom)_push|otlp_v1_metrics"}[5m]))) > 1

Since upgrading to 2.15 this alert has been consistently firing. We have noticed a large bump in overall performance in Mimir, but we're wondering if increased latency is to be expected as a possible side effect ?

Side Note: Thanks for all the great work, our team was delighted with the performance improvements we've seen overall in 2.15 aside from this latency jump.

How to reproduce it?

Mimir 2.15

What did you think would happen?

N/A

What was your environment?

Kuberenetes

Any additional context to share?

No response

Activity

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions