Performance issues updating from Bitnami Thanos image to Quay Thanos #8471
Unanswered
nb-ccd
asked this question in
Questions & Answers
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hi,
We have been running Thanos 0.32.1 through the Bitnami Thanos chart (https://charts.bitnami.com/bitnami) using their default image. In response to their new image policy which introduces fees, we want to move using the image published by the Thanos team themselves to Quay.
However.. we've run into some major performance issues moving to the Quay image of the same version: Grafana is getting a large number of HTTP 504s from the Thanos Query endpoint, and I see the same when port forwarding to query the Thanos endpoint directly:
Or alternatively:
Looking in Thanos Query's logs I see repeated DeadlineExceeded messages:
I don't see anything logged relevant to this in the Prometheus pod or its Thanos sidecar.
I don't see any differences in the Dockerfiles between the Quay and Bitnami images in terms of runtime arguments. The docker images at the version I'm using are here:
There aren't changes to configuration or different environment variables in the image, if there are changes looks like it's at compile time and so not represented in the Dockerfile.
Has anyone else who has migrated run into similar issues, or any expert users got advice on how to debug these timeouts?
(Side note: I realise we are running an older Thanos version, happy to upgrade if there is reason to believe it would help, but our preference was to stick with the current application version while making the initial change between images to reduce the number of factors changing.)
Beta Was this translation helpful? Give feedback.
All reactions