Reduce allocator usage in Gossip implementation #4640

alexpyattaev · 2025-01-26T20:21:11Z

Problem

Gossip is torturing the memory allocator too much. Significant amount of these allocations occur in flate2 compression.

Summary of Changes

Deflate called inflate for some reason, and discards result. This is pure waste. Killed the unneeded call
Switched to bitvec crate to reduce reallocations that were created by lack of API to read the underlying storage buffer

Fixes #
Gossip allocating too much.

alexpyattaev · 2025-01-26T21:51:41Z

gossip/src/epoch_slots.rs

            compressed,
        };
-        let _ = rv.inflate()?;


Apparently this was a hack to validate that compression worked out. A crude but effective one I guess.

This seems like it was a sanity check.
I don't know how often it does fail though.
Have you checked if it ever fails or not?

can we keep this line as a debug check:

debug_assert_matches!(rv.inflate(), Ok(_));

behzadnouri · 2025-01-26T23:04:28Z

Chatted on slack.
We need to keep the old bv crate because any minor implementation differences between the two crates can introduce compatibility issues where a node running the new code cannot send (or receive) epoch-slots to (or from) a node running older version of the code.

alexpyattaev · 2025-01-27T16:28:54Z

revert to bv complete. We are even doing the exact same amount of allocations as we'd get with bitvec.

gregcusack

looks good just a few questions but then should be gtg

gregcusack · 2025-01-27T16:30:38Z

gossip/src/epoch_slots.rs

+            min_slot - self.first_slot
        };
-        for i in start..self.num {
-            if i >= self.slots.len() as usize {
+        for i in start..self.num as u64 {
+            if i >= self.slots.len() {
                break;
            }
-            if self.slots.get(i as u64) {
+            if self.slots.get(i) {
                rv.push(self.first_slot + i as Slot);


are these just stylistic changes?

its to keep clippy happy.

gregcusack · 2025-01-27T16:31:05Z

gossip/src/epoch_slots.rs

+            let offset = *s - self.first_slot;
+            if offset >= self.slots.len() {
                return i;
            }
-            self.slots.set(*s - self.first_slot, true);
-            self.num = std::cmp::max(self.num, 1 + (*s - self.first_slot) as usize);
+            self.slots.set(offset, true);
+            self.num = std::cmp::max(self.num, 1 + offset as usize);


this is purely to make the code more readable.

gregcusack · 2025-01-27T16:32:15Z

gossip/src/epoch_slots.rs

            compressed,
        };
-        let _ = rv.inflate()?;


are we worried about forcing sanity check on the receiver of this EpochSlot? Looks like maybe no since you added the check here:

if status != flate2::Status::StreamEnd { return Err(Error::DecompressError); }

I think the original idea was to validate that compression worked (no idea why else it would be here). But since flate2 can tell us that "it worked", I see no point running decompression to achieve the same result. The check you refer is exactly that, StreamEnd according to docs indicates that all input was consumed and all output fits into buffer provided without reallocation.

behzadnouri

The change looks fine.
But just to make sure, have you verified that this change is backward compatible?
Like the new (old) code is able to ingest epoch slots generated by the old (new) code?

Separately, with the new code does deflate/inflate return Err more often or fewer times than before?

behzadnouri · 2025-01-27T16:42:59Z

gossip/src/epoch_slots.rs

            compressed,
        };
-        let _ = rv.inflate()?;


This seems like it was a sanity check.
I don't know how often it does fail though.
Have you checked if it ever fails or not?

alexpyattaev · 2025-01-28T07:48:39Z

Did some tests on MNB. The comperssion branch never errored, including the old validation path that did decompression to check (it does not get called much though, so maybe it is possible to trigger errors there if we try harder), the decompression path errored in 43104/6059035 packets, each time with BufError (i.e. not enough space for decompression https://prisma.github.io/prisma-engines/doc/flate2/enum.Status.html). Previously, before we did any checking on the decompress path, all of these packets would be passed to the validator and do something in there. I guess these could be formed by incompatible implementations?

behzadnouri · 2025-01-28T13:26:19Z

the decompression path errored in 43104/6059035 packets, each time with BufError

hmm, how often does this happen with the old code?

Separately, can you please check if number of epoch-slots received from the cluster but fail to sigverify increase or not?
Like discussed in the chat before, signatures are defined on the serialized "bytes", so we need to make sure that starting with any serialized input bytes:

serialize(deserialize(bytes)) == bytes

otherwise the signatures do not verify. In other words,

bytes -> deserialize -> serialize -> bytes

should round-trip. So if number of epoch-slots which don't sigverify increase, it means that above is not valid.

alexpyattaev · 2025-01-28T21:25:52Z

hmm, how often does this happen with the old code?

The decompression path is exactly the same as old code, except for the check that did not exist. So in the old code it would fail exactly the same, just it would be ignored.

behzadnouri · 2025-01-28T21:48:10Z

the decompression path errored in 43104/6059035 packets, each time with BufError (i.e. not enough space for decompression https://prisma.github.io/prisma-engines/doc/flate2/enum.Status.html).

Pretty suspicious that it is giving BufError.
Does it help if we increase the head room below?
https://github.com/anza-xyz/agave/blob/9ca57b16e/gossip/src/epoch_slots.rs#L101-L102

alexpyattaev · 2025-01-29T21:14:27Z

Increasing headroom to 128 bytes results in no notable reduction in rate of errors to decode packet. I guess only some nodes produce invalid packets that fail to decompress without errors, and buffer size has nothing to do with it.
5963/649907 packets failed to decompress over a test run of 10 minutes.

alexpyattaev · 2025-01-29T21:52:24Z

Attaching list of validators that send invalid packets
invalids.txt
Update: got a total of 920 validators sending "malformed" packets.

behzadnouri · 2025-02-01T01:51:28Z

Update: got a total of 920 validators sending "malformed" packets.

There is a newer version of flate that the one we are using: #3660
but the dependebot patch isn't merged yet.
It might be worth to give it a shot and see if we see more or fewer number of malformed packets.

alexpyattaev · 2025-02-01T20:59:21Z

13436 /897356 or 1.4% of packets are reported as invalid by inflate process with the new version, which is somewhat more than with the old version (0.8%). But the more suspicious issue is that new version is failing the roundtrip tests in CI, but passes the same tests on a devbox, even with multiple reruns.

alexpyattaev changed the title ~~kill useless call to inflate~~ Reduce allocator usage in Gossip implementation Jan 26, 2025

alexpyattaev requested a review from gregcusack January 26, 2025 21:37

alexpyattaev commented Jan 26, 2025

View reviewed changes

alexpyattaev marked this pull request as ready for review January 26, 2025 22:00

alexpyattaev requested a review from behzadnouri January 26, 2025 22:35

alexpyattaev added 2 commits January 26, 2025 22:40

kill useless call to inflate

e033ccc

switch gossip to bitvec crate for better low-level API

1ef94fe

alexpyattaev force-pushed the gossip_compress branch from 7bb9fc5 to 1ef94fe Compare January 26, 2025 22:40

alexpyattaev removed the request for review from gregcusack January 26, 2025 22:56

alexpyattaev marked this pull request as draft January 26, 2025 22:57

switch back to bv for backwards compat

9339155

alexpyattaev marked this pull request as ready for review January 27, 2025 14:32

gregcusack self-requested a review January 27, 2025 15:44

gregcusack reviewed Jan 27, 2025

View reviewed changes

behzadnouri reviewed Jan 27, 2025

View reviewed changes

alexpyattaev mentioned this pull request Jun 18, 2025

bump flate2 from 1.0.31 to 1.1.2 #6642

Open

Reduce allocator usage in Gossip implementation #4640

Are you sure you want to change the base?

Reduce allocator usage in Gossip implementation #4640

Conversation

alexpyattaev commented Jan 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Summary of Changes

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

behzadnouri commented Jan 26, 2025

Uh oh!

alexpyattaev commented Jan 27, 2025

Uh oh!

gregcusack left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

behzadnouri left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

alexpyattaev commented Jan 28, 2025

Uh oh!

behzadnouri commented Jan 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

alexpyattaev commented Jan 28, 2025

Uh oh!

behzadnouri commented Jan 28, 2025

Uh oh!

alexpyattaev commented Jan 29, 2025

Uh oh!

alexpyattaev commented Jan 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

behzadnouri commented Feb 1, 2025

Uh oh!

alexpyattaev commented Feb 1, 2025

Uh oh!

Uh oh!

alexpyattaev commented Jan 26, 2025 •

edited

Loading

behzadnouri commented Jan 28, 2025 •

edited

Loading

alexpyattaev commented Jan 29, 2025 •

edited

Loading