Fix partial any/all reduction by simonbyrne · Pull Request #959 · NVIDIA/MatX

simonbyrne · 2025-05-08T18:25:02Z

Fixes #931.

cliffburdick · 2025-05-08T18:32:18Z

/build

cliffburdick · 2025-05-08T18:46:38Z

Looks good! Did you happen to reproduce the original bug?

simonbyrne · 2025-05-08T18:55:09Z

Yes, it occurs when you do a partial reduction of a tensor rank greater than 2 (basically when CUB isn't used).

cliffburdick · 2025-05-08T18:57:35Z

Yes, it occurs when you do a partial reduction of a tensor rank greater than 2 (basically when CUB isn't used).

Since this code hasn't been touched in a while, is there a reason we can't use CUB in that case? We used to not use CUB in that situation before we had operators as input into CUB, now using ReduceInput we should support all ranks.

simonbyrne · 2025-05-08T19:05:01Z

ah, I thought CUB required contiguous segments. Let me see if I can figure it out.

simonbyrne · 2025-05-08T22:36:42Z

Okay, it took me a bit to figure out how it all worked, but my reading of it is that we can now use CUB for all reductions.

simonbyrne · 2025-05-08T23:09:02Z

Okay, I've deleted all the non-CUB reduction code.

cliffburdick · 2025-05-15T21:37:12Z

/build

cliffburdick · 2025-05-19T15:02:36Z

Looks like docs are failing:


/home/jenkins/workspace/unit-tests/docs_input/api/manipulation/selecting/reduce.rst:8: WARNING: doxygenfunction: Unable to resolve function "reduce" with arguments (OutType, TensorIndexType, const InType&, ReduceOp, cudaStream_t, bool) in doxygen xml output for project "MatX" from directory: /home/jenkins/workspace/unit-tests/build/docs_input/doxygen/xml.
Potential matches:
- template<typename InType, int D, typename ReduceOp> __MATX_INLINE__ auto reduce(const InType &in, const int (&dims)[D], ReduceOp op, bool init = true)
- template<typename InType, typename ReduceOp> __MATX_INLINE__ auto reduce(const InType &in, ReduceOp op, bool init = true)
- template<typename OutType, typename InType, typename ReduceOp> void __MATX_INLINE__ reduce(OutType dest, const InType &in, ReduceOp op, cudaStream_t stream = 0, bool init = true) [docutils]
- ```

simonbyrne · 2025-05-19T18:24:50Z

Ah, I see what happened. So it turns out there are two different reduce methods

the MatX operators in operators/reduce.h
- they don't appear in the current documentation
the mutating ones in transforms/reduce.h where the output tensor is provided as an argument to the function
- are not actually MatX operators: e.g. they accept a stream as an argument, instead of returning an operator and calling run().
- the first of these is mentioned in the docs
- are removed by this PR

I can either remove 2 completely, or simply make it fall back on calling the operator?

cliffburdick · 2025-05-19T18:43:59Z

Ah, I see what happened. So it turns out there are two different reduce methods

the MatX operators in operators/reduce.h

they don't appear in the current documentation

the mutating ones in transforms/reduce.h where the output tensor is provided as an argument to the function

are not actually MatX operators: e.g. they accept a stream as an argument, instead of returning an operator and calling run().

the first of these is mentioned in the docs

are removed by this PR

I can either remove 2 completely, or simply make it fall back on calling the operator?

Let's remove it from the docs and only have the operator

cliffburdick · 2025-05-19T19:42:52Z

/build

Fix partial any/all reduction

749864b

simonbyrne force-pushed the sbyrne/reduce-all branch from e777e9e to 749864b Compare May 8, 2025 18:25

cliffburdick reviewed May 8, 2025

View reviewed changes

Comment thread include/matx/transforms/reduce.h Outdated

simonbyrne force-pushed the sbyrne/reduce-all branch from 5533afc to 749864b Compare May 8, 2025 18:53

fix comment

59412bc

simonbyrne force-pushed the sbyrne/reduce-all branch from 25ab8bc to 59412bc Compare May 8, 2025 18:55

use CUB for all reductions

e0d0e9a

simonbyrne commented May 8, 2025

View reviewed changes

Comment thread include/matx/transforms/reduce.h Outdated

delete custom reduction kernels

d022b44

simonbyrne requested a review from cliffburdick May 15, 2025 21:29

cliffburdick approved these changes May 15, 2025

View reviewed changes

fix reduce docs

ab2c961

cliffburdick merged commit 266b2a3 into NVIDIA:main May 19, 2025
1 check passed

simonbyrne deleted the sbyrne/reduce-all branch May 20, 2025 17:01

cliffburdick mentioned this pull request Jun 4, 2025

Refactor to use thrust::reduce on any. #685

Closed

Conversation

simonbyrne commented May 8, 2025

Uh oh!

cliffburdick commented May 8, 2025

Uh oh!

Uh oh!

cliffburdick commented May 8, 2025

Uh oh!

simonbyrne commented May 8, 2025

Uh oh!

cliffburdick commented May 8, 2025

Uh oh!

simonbyrne commented May 8, 2025

Uh oh!

simonbyrne commented May 8, 2025

Uh oh!

Uh oh!

simonbyrne commented May 8, 2025

Uh oh!

cliffburdick commented May 15, 2025

Uh oh!

cliffburdick commented May 19, 2025

Uh oh!

simonbyrne commented May 19, 2025

Uh oh!

cliffburdick commented May 19, 2025

Uh oh!

cliffburdick commented May 19, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants