generate_quantities, alternative approach to parallel compute

`generated quantities` is embarrassingly parallel over sampling iterations; but the current implementation of parallel compute in `cmdstanr` method function `generate_quantities` is limited to the same chains and threads_per_chain argument as is helpful in a reduce sum calculation. 

Here's how I've been changing its use to employ as many available cores as available.

1. repeat `posterior::split_chains` on the fitted object until these equal to the number of cores wanted
2. in `generate_quantities` set `parallel_chains = nchains(x)` with threads_per_chain equal 1 (if used for reduce sum): _e.g._, 

```
# pull draws and split chains to = cores 
x <- split_chains( f$draws() )
x <- split_chains( x )
# ... split until you have enough for each core

# uncomment generated quantities, recompile
# cpp_options aren't always needed, but to demonstrate when used
m <- cmdstan_model('fit.stan', cpp_options = list(stan_threads = TRUE))

# use the new draws object
q <- m$generate_quantities(
              fitted_params = x, 
              data = dat, 
              parallel_chains = nchains(x), 
              threads_per_chain = 1)
```

Since `cmdstanr` already requires `posterior`, I thought it may be possible to bake in something like the above to make better use of parallel compute natively without manually splitting. 

Otherwise, maybe this will serve as a tip. :)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

generate_quantities, alternative approach to parallel compute #1073

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

generate_quantities, alternative approach to parallel compute #1073

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions