Design to lower cloud costs of ducklake (TCO) #473

hpvd · 2025-09-24T10:19:43Z

hpvd
Sep 24, 2025

Since cost is always a topic and since ducklake is going for "large scale" with huge amounts of data, cost is even more important and tiny things may have noteable impact.

Ducklake is already a very efficient design.

But maybe we can optimize this further or/and
open optional configurations to calibrate the cost/performance sweet spot for

your different usescases and
your type of S3 you are using (standard, glacier, express... which have different cost characteristics)

Where do cost have their origin (Cost types)?

A) running metadatabase
B) put, get... to S3
C) S3 storage
D) ...

Ideas:

Adding indexes to reduce query work may lower cost types B and may raise costs of type A, see Adding indexes #389
think about batch writing to S3 may lower cost Type B)
...

Which Points/Ideas/approaches do you see?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Design to lower cloud costs of ducklake (TCO) #473

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Design to lower cloud costs of ducklake (TCO) #473

Uh oh!

Uh oh!

hpvd Sep 24, 2025

Replies: 0 comments

hpvd
Sep 24, 2025