You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Since cost is always a topic and since ducklake is going for "large scale" with huge amounts of data, cost is even more important and tiny things may have noteable impact.
Ducklake is already a very efficient design.
But maybe we can optimize this further or/and
open optional configurations to calibrate the cost/performance sweet spot for
your different usescases and
your type of S3 you are using (standard, glacier, express... which have different cost characteristics)
Where do cost have their origin (Cost types)?
A) running metadatabase
B) put, get... to S3
C) S3 storage
D) ...
Ideas:
Adding indexes to reduce query work may lower cost types B and may raise costs of type A, see Adding indexes #389
think about batch writing to S3 may lower cost Type B)
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Since cost is always a topic and since ducklake is going for "large scale" with huge amounts of data, cost is even more important and tiny things may have noteable impact.
Ducklake is already a very efficient design.
But maybe we can optimize this further or/and
open optional configurations to calibrate the cost/performance sweet spot for
Where do cost have their origin (Cost types)?
Ideas:
Which Points/Ideas/approaches do you see?
Beta Was this translation helpful? Give feedback.
All reactions