Microoptimize the walk() function #587

kmod · 2025-01-08T01:30:54Z

I noticed that it takes pynvim about 4ms to attach to an nvim instance for me, and 3ms of that is due to the single line:
metadata = walk(decode_if_bytes, metadata)

This commit reduces the walk() time down to 1.5ms, which brings the total attach time down to 2.5ms. This is helpful for me because in my use case I end up connecting to all of the currently-running nvim processes and this starts to take a noticeable amount of time. Unfortunately parallelization does not help here due to the nature of the slowness.

walk() is expensive because it does a very large amount of pure-python manipulation, so this commit is just some tweaks to reduce the overheads:

*args and **kw make the function call slow, and we can avoid needing them by pre-packing the args into fn via functools.partial
The comprehensions can be written to directly construct the objects rather than create a generator which is passed to a constructor
The typechecking is microoptimized by calling type() once and unrolling the type_ in [list, tuple] check

I did notice that in my setup the metadata contains no byte objects, so the entire call is a noop. I'm not sure if that is something that could be relied on or detected, which could be an even bigger speedup.

fix #250

I noticed that it takes pynvim about 4ms to attach to an nvim instance for me, and 3ms of that is due to the single line: metadata = walk(decode_if_bytes, metadata) This commit reduces the walk() time down to 1.5ms, which brings the total attach time down to 2.5ms. This is helpful for me because in my use case I end up connecting to all of the currently-running nvim processes and this starts to take a noticeable amount of time. Unfortunately parallelization does not help here due to the nature of the slowness. walk() is expensive because it does a very large amount of pure-python manipulation, so this commit is just some tweaks to reduce the overheads: - *args and **kw make the function call slow, and we can avoid needing them by pre-packing the args into fn via functools.partial - The comprehensions can be written to directly construct the objects rather than create a generator which is passed to a constructor - The typechecking is microoptimized by calling type() once and unrolling the `type_ in [list, tuple]` check I did notice that in my setup the metadata contains no byte objects, so the entire call is a noop. I'm not sure if that is something that could be relied on or detected, which could be an even bigger speedup.

kmod · 2025-01-08T01:31:51Z

I believe this is the same issue as was talked about in #250

justinmk

Nice, thanks!

justinmk · 2025-01-08T01:46:19Z

Note that CI looks yucky right now, we're working on that :)

kmod · 2025-01-08T02:29:30Z

Haha yeah at first I got worried about how I broke so much without realizing it

Also wanted to say, thanks for the project! It's been very handy

justinmk approved these changes Jan 8, 2025

View reviewed changes

justinmk merged commit e5ce595 into neovim:master Jan 8, 2025
4 of 25 checks passed

wookayin added the performance label Jan 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Microoptimize the walk() function #587

Microoptimize the walk() function #587

Uh oh!

kmod commented Jan 8, 2025 •

edited by justinmk

Loading

Uh oh!

kmod commented Jan 8, 2025

Uh oh!

justinmk left a comment

Uh oh!

Uh oh!

justinmk commented Jan 8, 2025

Uh oh!

kmod commented Jan 8, 2025

Uh oh!

Uh oh!

Microoptimize the walk() function #587

Microoptimize the walk() function #587

Uh oh!

Conversation

kmod commented Jan 8, 2025 • edited by justinmk Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kmod commented Jan 8, 2025

Uh oh!

justinmk left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

justinmk commented Jan 8, 2025

Uh oh!

kmod commented Jan 8, 2025

Uh oh!

Uh oh!

kmod commented Jan 8, 2025 •

edited by justinmk

Loading