Explore flox for improving Xarray's groupby() performance
#490
tomvothecoder
started this conversation in
3. Ideas
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Is your feature request related to a problem?
According to Xarray's documentation,
groupby()andresample()triggers computation on all blocks which means it loads all data into memory and does not operate on them lazily. This results in a costly operation.This is where
floxcomes along:Describe the solution you'd like
floxwith xCDAT APIsIf performance increases are substantial:
floxas an xCDAT dependencygroupby()calls are made internallyDescribe alternatives you've considered
floxto speed upgroupbyoperations, including spatial and temporal averaging.Additional context
One challenge might be that we need to constraint the minimum version of xarray to
>=2022.6.0because that is whenfloxwas first implemented in xarray.Beta Was this translation helpful? Give feedback.
All reactions