Scalable conversion RFC by GiedriusS · Pull Request #62 · thanos-io/thanos-parquet-gateway

GiedriusS · 2026-02-17T15:25:26Z

Jotting down some of my thoughts and sharing with others. Maybe they are stupid, maybe they are not. Let me know your thoughts.

It provides no value.

Jot down some of my thoughts and share it with others.

xiu

I think this makes sense. I understand the use of the queue to lessen the coupling between the planner and converter though I wonder whether we intend the planner to run over a small set of converters or rather several block streams.
If running colocated in the same Kubernetes cluster say, would it be simpler to have the planner give a short list of blocks to convert (2-3) via gRPC to each converter after discovery, committing the state to S3 so to be able to recover in case of restart or so.

adezxc · 2026-02-23T11:42:17Z

The whole concept makes sense.
As something that came to my mind, could we have something like: planner is only responsible for having a queue of work that has to be done and all the workers request e.g /api/get_work?size=3 with their metadata e.g IP address/hostname etc. and planner would be sending heartbeats to workers (that it knows are assigned for work)?

If a worker is unavailable for X minutes, planner marks it's work items as failed and reports them as available again. If a worker is done with the conversion, it just reports it as successful to the planner and planner removes the item from the queue.

Also as a consideration: would a worker only work with one objstore directory? This also means we'd have to validate if a worker can do the work that is given by the planner, right? I was considering if we could have workers being universal, only requiring multiple objstore configs to be able to accept work from multiple planners, but IMO that brings too much complexity(?)

GiedriusS · 2026-03-20T09:50:33Z

Michael shared this with me https://turbopuffer.com/blog/object-storage-queue, we can use it for inspiration.

GiedriusS added 2 commits February 17, 2026 14:04

docs: rm CIRCLECI_SETUP

6b48176

It provides no value.

rfcs: add scalable converter

f220a65

Jot down some of my thoughts and share it with others.

xiu reviewed Feb 18, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Scalable conversion RFC#62

Scalable conversion RFC#62
GiedriusS wants to merge 2 commits intomainfrom
scalable_conversion

GiedriusS commented Feb 17, 2026

Uh oh!

xiu left a comment

Uh oh!

adezxc commented Feb 23, 2026

Uh oh!

GiedriusS commented Mar 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

GiedriusS commented Feb 17, 2026

Uh oh!

xiu left a comment

Choose a reason for hiding this comment

Uh oh!

adezxc commented Feb 23, 2026

Uh oh!

GiedriusS commented Mar 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants