Is this a duplicate?
Area
CUB
Is your feature request related to a problem? Please describe.
Currently, cub::DeviceReduce accepts num_items as an integer. Sometimes, problem size is not available at host side.
Describe the solution you'd like
This issue can be closed with a PR adding device-resident problem size support in device reduce:
int *num_items = ...;
cub::DeviceReduce(..., cuda::args::deferred{num_items})
Test should include passing a fancy (say, transform) iterator into deferred.
Describe alternatives you've considered
No response
Additional context
No response
Is this a duplicate?
Area
CUB
Is your feature request related to a problem? Please describe.
Currently,
cub::DeviceReduceacceptsnum_itemsas an integer. Sometimes, problem size is not available at host side.Describe the solution you'd like
This issue can be closed with a PR adding device-resident problem size support in device reduce:
Test should include passing a fancy (say, transform) iterator into
deferred.Describe alternatives you've considered
No response
Additional context
No response