Increase performance of `collection` deepMap and deepForEach with DenseMatrices #3409

dvd101x · 2025-03-01T04:38:03Z

Hi, this PR increases performance of deepMap and deepForEach when iterating over a DenseMatrix in collections by using a non indexed version of ._forEach and also forcing the optimizeCallback function to return a function with one argument.

dvd101x · 2025-03-01T14:48:32Z

I also would like to mention that currently the function norm, uses the second argument of DenseMatrix.forEach as a skipZeros, but it doesn't work.

That's why it is an argument here, but no implementation is made.

The next step of addressing #3262 is for optimizeCallback to return arity information so that the fast non indexed iteration can be used if possible. ~~I will address that in another PR.~~

dvd101x · 2025-03-02T07:07:42Z

In this latest commit, the non indexed version of the iteration functions is implemented when possible, with these benefits (about 25% faster as a maximum)

This includes an arity check for regular functions as stated at #3262 which I think was accurate and concerned about the big gap between abs(array) and the rest.

I think this PR could open a discussion about a state of diminishing returns due to the indexed methods being previously optimized by various PR, if we compared with V13.1.1 before all the PR improving the indexed algorithms the benefits are huge.

I'm open to any discussions this data might arise as I found #3262 intriguing, otherwise I think this PR could be implemented as such.

dvd101x · 2025-03-02T17:29:15Z

Also I would like to know if it's better to pass an options object to skipIndex and skipZeroes and include that in the documentation for DenseMatrix methods or if it's better like individual boolean parameters.

josdejong

We've made huge improvements since v13.1.1! I guess we've addressed all the low hanging fruit now, so maybe time to stop 😜. The 25% improvment in this PR is worth it I think, since map and forEach are used a lot.

I made a couple of inline comments, can you have a look at those?

src/type/matrix/DenseMatrix.js

src/utils/optimizeCallback.js

src/type/matrix/DenseMatrix.js

dvd101x · 2025-03-06T17:34:20Z

Thanks, I looked into your comments and I will address them individually, in my notes I recall another bottleneck and that's it.

The bottleneck occurs when the callback optimization is run twice and sometimes is not run. I'm considering adding attributes to callbacks and arrays, the eliminate the need to pass extra arguments in some functions. I will review this and maybe next week have a response.

josdejong · 2025-03-07T09:48:45Z

The bottleneck occurs when the callback optimization is run twice and sometimes is not run.

Would be good to have a look at that, but please do a quick check first to see if the callback optimization is indeed a bottleneck, I would expect the callback optimization to take a fraction of the time compared to running the callback for every element on a (large) matrix, so maybe there is not much to gain there.

dvd101x · 2025-03-08T02:05:51Z

but please do a quick check first to see if the callback optimization is indeed a bottleneck

Thank's, my intuition was that maybe if it does a double try catch on each iteration and a double check for the number of arguments might affect in some way.... but thinking about it, in many cases it returns a regular function calling the typed function with the improved error message, and since the improved error message happens only for typed functions I think it should be ok. Nonetheless, I might do a quick check.

In the latest commit most comments are addressed and these are the benchmarks.

Just a few comments:

I kept the use of maxDepth as it did affect performance
I think it's best to wait for Improve performance of flatten in DenseMatrix #3400 and then use Matrix.forEach since there is no SparseMatrix._forEach.
About removing the second argument skipZeros in DenseMatrix I just noticed that SparseMatrix uses the second argument skipZeros and it's also part of the jsdocs of Matrix.map. So I don't know if this changes anything.

dvd101x · 2025-03-18T16:16:46Z

As a separate topic, at some point I would like to address the duplicate logic as suggested at #3266

In #3251, @Galm007 has created a single _forEach method used by both map and forEach methods to deduplicate the logic. It may be worth trying to let deepMap use deepForEach under the hood to deduplicate logic of this PR too. What do you think?

I'm thinking of including that after this PR without affecting performance significantly.

josdejong

I see bigger performance improvements than in your last chart, that is a good thing :)

Merging the PR now.

dvd101x · 2025-03-27T13:59:01Z

Hi, I noticed that the HISTORY.md was updated with this PR #3409, Jos approved the change and mentioned this is ready to merge. This is the first time I click the button to merge, so please let me know if I'm missing something.

josdejong · 2025-03-28T12:52:06Z

Ah, you're right. I guess I had to wait before all the checks fiinshed and forgot to press the button in the end 😅.

Congrets with your first merge-button-press 😁

josdejong · 2025-03-28T13:08:58Z

@dvd101x for a next time, when merging a PR, can you click "Squash and Merge"? That way all commits are merged into a single one, so in the git history we see 1 commit for the PR, which is much cleaner.

josdejong · 2025-03-28T13:27:16Z

Published now in v14.4.0

dvd101x · 2025-04-01T15:04:33Z

Hi Jos, yes of course, I will do that next time.

Thank you

feat: add unary support to map and forEach methods in DenseMatrix

637fe39

feat: return arity and use it for faster alrogithms

7c154a0

Refactor to extend ._forEach instead of making a new method.

137b0c7

josdejong requested changes Mar 6, 2025

View reviewed changes

dvd101x added 2 commits March 6, 2025 21:53

Changed algorithm to avoid the use of maxDepth

cc03d30

Returned the use of maxDepth as it was slower to check for an array.

3700e9b

Add comments and simplify the code with regular expressions.

3c48eeb

Merge branch 'develop' into deepMap-perforance-fix-3

bd5799c

josdejong approved these changes Mar 19, 2025

View reviewed changes

dvd101x added 2 commits March 21, 2025 10:20

Merge branch 'develop' into deepMap-perforance-fix-3

459ea62

Merge branch 'develop' into deepMap-perforance-fix-3

2f62fda

dvd101x merged commit 6907614 into develop Mar 27, 2025
15 checks passed

dvd101x deleted the deepMap-perforance-fix-3 branch March 27, 2025 13:55

Uh oh!

Increase performance of collection deepMap and deepForEach with DenseMatrices #3409

Increase performance of collection deepMap and deepForEach with DenseMatrices #3409

Uh oh!

Conversation

dvd101x commented Mar 1, 2025

Uh oh!

dvd101x commented Mar 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dvd101x commented Mar 2, 2025

Uh oh!

dvd101x commented Mar 2, 2025

Uh oh!

josdejong left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dvd101x commented Mar 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

josdejong commented Mar 7, 2025

Uh oh!

dvd101x commented Mar 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dvd101x commented Mar 18, 2025

Uh oh!

josdejong left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

dvd101x commented Mar 27, 2025

Uh oh!

josdejong commented Mar 28, 2025

Uh oh!

josdejong commented Mar 28, 2025

Uh oh!

josdejong commented Mar 28, 2025

Uh oh!

dvd101x commented Apr 1, 2025

Uh oh!

Uh oh!

Increase performance of `collection` deepMap and deepForEach with DenseMatrices #3409

Increase performance of `collection` deepMap and deepForEach with DenseMatrices #3409

dvd101x commented Mar 1, 2025 •

edited

Loading

dvd101x commented Mar 6, 2025 •

edited

Loading

dvd101x commented Mar 8, 2025 •

edited

Loading

josdejong left a comment •

edited

Loading