Optimized memory usage and added multithreading to 1-D reflected light calculations by Nicholaswogan · Pull Request #402 · natashabatalha/picaso

Nicholaswogan · 2026-04-22T23:09:13Z

This PR should be accepted after #399.

Summary

This branch refactors the 1-D reflected-light radiative-transfer path to reduce allocations and peak memory usage while preserving the existing outputs. It also enables multithreaded reflected-light execution by parallelizing over wavelength. The PR adds unit tests that demonstrates that the new non-allocating version of get_reflected_1d produces the same results (over all if branches) when compared to the old allocating get_reflected_1d. The new implementation in this PR, uses about half the peak memory, and is a factor of 2 faster using 1 thread, compared to the old implementation.

This PR focuses on get_reflected_1d, but the same approach could be easily extended to other RT calculations (3D, thermal and tranmision).

Peak memory use (and code runtimes) can be reduced further by reworking get_opacities and compute_opacities, however the gains there are probably less substantial than this PR.

Relevant to #263

What changed

Added a no-allocation reflected-light implementation in picaso/fluxes_noalloc.py.
Introduced GetReflected1D and GetReflected1DWorkspace to hold persistent results and per-thread scratch space.
Reworked the 1D reflected-light path to stream over wavelength and reuse workspace buffers.
Added thread-safe parallel execution over wavelength with per-thread workspaces.
Updated justdoit.py to call the new class-based reflected solver.
Updated the climate reflected-light path to use a single persistent reflected workspace across the Gauss-angle loop.
Added and updated tests for:
- tridiagonal solver parity
- setup routine parity
- 1D reflected-light parity

Comparison to old code

I ran a test case that computes five reflected light spectra of a clear-sky modern Earth. The table below summarizes the runtime and memory usage of both the old get_reflected_1d and the new implementation in this PR.

Version of `get_reflected_1d`	Threads	Runtime	Allocations	Peak memory	Total memory
Old	1	53.9 s	25,935,610	5.760 GB	166.907 GB
New	1	25.0 s	743,485	2.395 GB	37.089 GB
New	4	11.8 s	743,520	2.395 GB	37.089 GB

… helper

Nicholaswogan · 2026-04-22T23:16:53Z

Note that multithreading by default always uses 1 thread (i.e. is not parallel), to be consistent with previous versions of PICASO and to prevent multithreading within an already parallel calculation (e.g. a retrieval). The user has to explicitly turn on multithreading with numba.set_num_threads(num_threads) where num_threads is the number of threads to use.

natashabatalha · 2026-04-23T20:00:21Z

        raise Exception("The only available opacity methods are: resortrebin, preweighted, and resampled")
    return opacityclass

+class RTSolvers:


Is this a prelude to keeping track of memory allocation?

The idea with RTSolvers is to preserve the work memory there (and attached it to the inputs class) needed for any RT calculation. This will mean that work memory does not need to be allocated+deallocated every call to a given RT algorithm (slow). Instead, the allocation happens once.

However, the work memory needed for the new get_reflected_1d that I wrote is very small compared to what is needed for the old version.

So, you probably wouldn't take much a performance hit, if you just allocated/deallocated the work memory on the fly.

Nicholaswogan · 2026-04-24T05:29:11Z

Closed, because I'm re-doing in a way that is simpler

Nicholaswogan added 30 commits April 15, 2026 08:49

Add HDF5 support for resampled opacity databases

8c70a2c

Optimize HDF5 resampled opacity retrieval in optics

f3cc594

Optimize HDF5 resampled opacity I/O and add SQLite-to-HDF5 conversion…

22a9bef

… helper

default query is nearest to be consistent with previous behavior

2c5c2cf

simplify

9c1ab1b

in-process allocation-free rewrite of get_reflected_1d

88cdab9

more progress on get_reflected_1d

86c9a16

implemented get_lvl_flux branch

294ce41

implemented get_toa_intensity branch

542ab62

test showing get_reflected_1d_inplace is same as get_reflected_1d

5c703fc

optimize tridiag indexing

c668251

optimize index ordering of arrays

1479c52

test most branches in get_reflected_1e

aa9a817

renamed the test

aaeb683

optimized computation of

3499dd9

prevent duplicate calculations in

79c1d19

fixed scalar assumptions

56f449e

spectral calcs use get_reflected_1d_inplace

f5b6aad

low peak memory version

08c4089

moved non-allocating routines to new file

08e359f

explicit numpy functions

9359049

changed numba imports

3d1142b

progress on class implementation

f06dcd6

progress on class version

bb45874

tests path

824ccb3

doc strings

d671197

comments

ba4741f

updated entry point in justdoit

17679f3

better check for GetReflected1D

9a3260e

changed call to reduce overhead

b1b0171

confirmed climate is unchanged

21a922a

Nicholaswogan requested a review from natashabatalha April 22, 2026 23:11

formatting only

77d2797

natashabatalha reviewed Apr 23, 2026

View reviewed changes

Nicholaswogan mentioned this pull request Apr 24, 2026

Optimized memory usage and added multithreading to 1-D reflected light calculations (attempt 2) #404

Merged

Nicholaswogan closed this Apr 24, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimized memory usage and added multithreading to 1-D reflected light calculations#402

Optimized memory usage and added multithreading to 1-D reflected light calculations#402
Nicholaswogan wants to merge 32 commits into
natashabatalha:devfrom
Nicholaswogan:memory

Nicholaswogan commented Apr 22, 2026

Uh oh!

Nicholaswogan commented Apr 22, 2026

Uh oh!

natashabatalha Apr 23, 2026

Uh oh!

Nicholaswogan Apr 23, 2026 •

edited

Loading

Uh oh!

Nicholaswogan Apr 23, 2026

Uh oh!

Nicholaswogan commented Apr 24, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Nicholaswogan commented Apr 22, 2026

Summary

What changed

Comparison to old code

Uh oh!

Nicholaswogan commented Apr 22, 2026

Uh oh!

natashabatalha Apr 23, 2026

Choose a reason for hiding this comment

Uh oh!

Nicholaswogan Apr 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Nicholaswogan Apr 23, 2026

Choose a reason for hiding this comment

Uh oh!

Nicholaswogan commented Apr 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Nicholaswogan Apr 23, 2026 •

edited

Loading

Nicholaswogan commented Apr 24, 2026 •

edited

Loading