Dev chunked matmult #101

avolu · 2025-05-15T14:00:58Z

See https://3.basecamp.com/5595649/buckets/32341364/todos/8649700954 for discussion

I added a new xrutils function for memory and cpu-efficient multiplication of large matrices. xrutils.chunked_eff_xr_matmult performs the mutliplication using numpy only, chunks up the data and streams chunks to disk, before it reassembles everything and puts it back into an xarray. The output should be identical to xarrayA @ xarrayB, but less hard on memory and cpu overhead.

This function is then used in the updated forward_model.apply_inv_sensitivity function whenever timeseries data with more than 1000 samples is handed in, which can however be force-skipped by setting the optional "chunk" flag to false (true by default).

I did some testing on the output of both the conventional and new chunked conversion, and (ignoring very small floating point errors) both yield the same result.

Maybe its a quick review and you can merge it, otherwise this has time until later

avolu added 5 commits May 15, 2025 14:38

one version for chunked xarray matrix multiplication

5592a52

clean up of cose for pull request

16e0efd

Update xrutils.py

d2aade2

Update xrutils.py

7a39320

Update xrutils.py

baeca25

avolu requested a review from emiddell May 15, 2025 14:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Dev chunked matmult #101

Dev chunked matmult #101

avolu commented May 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Dev chunked matmult #101

Are you sure you want to change the base?

Dev chunked matmult #101

Conversation

avolu commented May 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants