memtable/skiplist: add a purpose-built skiplist #131

ajwerner · 2025-05-18T04:37:42Z

Fixes #95.

Supersedes #98

marvin-j97 · 2025-05-18T15:31:41Z

With the Borrow<Q> trait bound it's not possible to use InternalKeyRef that would allow to not build an InternalKey (which involves a heap allocation).
Need to use equivalent::Comparable instead.

See: #98

marvin-j97 · 2025-05-19T01:42:54Z

src/memtable/skiplist/arena.rs

+// for the crate to work correctly. Anything larger than that will work.
+//
+// TODO: Justify this size.
+const DEFAULT_BUFFER_SIZE: usize = (32 << 10) - size_of::<AtomicUsize>();


Need to play with this a bit - but should probably be much higher by default: 1 MB or so?

Yeah, it should be bigger than 32k, but 1MiB might be too big. The keys and values are not inline, it’s just the metadata. The questions I’d have are how expensive is allocating a new block, and how expensive is inserting into the skip map. My guess is that the alloc is not likely worse than 10us (it’s probably way less) and the inserts are ~100ns. If you can fit 1000 in here (if we say the average links is 32 and the key and value are each 32 bytes), then you’ll have spent at least 10x as long doing the inserting. In practice I think the mallocs even with zeroing is a lot cheaper. The benchmarks I was playing with don’t show much win above 256KiB.

marvin-j97 · 2025-05-19T01:44:20Z

src/memtable/skiplist/arena.rs

+unsafe impl<const N: usize> Send for Arenas<N> {}
+unsafe impl<const N: usize> Sync for Arenas<N> {}
+
+pub(crate) struct Arenas<const BUFFER_SIZE: usize = DEFAULT_BUFFER_SIZE> {


Eventually, for write transactions, the size should be much smaller (so that small transactions don't overallocate too much) - so this needs to be a non-generic parameter.

Okay, I can do that.

src/memtable/mod.rs

ajwerner

I’ll play with updating. It’s pretty hard to not leak the node from the previous update without hooking up a free list but it’s also not so hard to add one.

src/memtable/mod.rs

ajwerner · 2025-05-20T00:21:28Z

src/memtable/skiplist/arena.rs

+unsafe impl<const N: usize> Send for Arenas<N> {}
+unsafe impl<const N: usize> Sync for Arenas<N> {}
+
+pub(crate) struct Arenas<const BUFFER_SIZE: usize = DEFAULT_BUFFER_SIZE> {


Okay, I can do that.

ajwerner · 2025-05-20T01:13:03Z

src/memtable/skiplist/arena.rs

+// for the crate to work correctly. Anything larger than that will work.
+//
+// TODO: Justify this size.
+const DEFAULT_BUFFER_SIZE: usize = (32 << 10) - size_of::<AtomicUsize>();


Yeah, it should be bigger than 32k, but 1MiB might be too big. The keys and values are not inline, it’s just the metadata. The questions I’d have are how expensive is allocating a new block, and how expensive is inserting into the skip map. My guess is that the alloc is not likely worse than 10us (it’s probably way less) and the inserts are ~100ns. If you can fit 1000 in here (if we say the average links is 32 and the key and value are each 32 bytes), then you’ll have spent at least 10x as long doing the inserting. In practice I think the mallocs even with zeroing is a lot cheaper. The benchmarks I was playing with don’t show much win above 256KiB.

marvin-j97 · 2025-07-12T00:55:42Z

Todo:

Evaluate memory overhead vs crossbeam skiplist
Evaluate read and write latency vs crossbeam skiplist
Try in a full system (heavily cached) benchmark (https://github.com/marvin-j97/rust-storage-bench) vs crossbeam skiplist

marvin-j97 · 2025-09-06T20:41:23Z

I've tried rebasing this onto the v3 branch.

Outlook now: we don't need a free list because all insertions will be unique (because of unique sequence number).

marvin-j97 · 2025-09-08T16:11:54Z

Removing the const generic N parameter is actually a bit harder than expected because it defines the layout of Buffer; changing it to static makes pointer derefs fail with misaligned pointer dereference.

memtable/skiplist: add a purpose-built skiplist

1f12a1a

ajwerner force-pushed the memtable-skiplist branch from 46acb25 to 1f12a1a Compare May 18, 2025 04:41

marvin-j97 added enhancement New feature or request performance type:memtable test labels May 18, 2025

marvin-j97 added 4 commits May 18, 2025 17:04

fmt

41783c9

remove crossbeam skiplist dep

b53b708

refactor skiplist

ee1046a

restore InternalKeyRef

b0cb95e

marvin-j97 added benchmark and removed benchmark labels May 18, 2025

marvin-j97 requested changes May 19, 2025

View reviewed changes

ajwerner commented May 20, 2025

View reviewed changes

marvin-j97 mentioned this pull request Jul 2, 2025

Custom key comparison #108

Open

marvin-j97 added good first issue Good for newcomers help wanted Extra attention is needed labels Jul 12, 2025

marvin-j97 changed the base branch from main to 3.0.0 September 6, 2025 20:29

marvin-j97 added 3 commits September 6, 2025 22:34

Merge branch '3.0.0' into memtable-skiplist

0c0affb

Update mod.rs

868929b

Update key.rs

0a7ebc1

fix: skiplist trait bounds

d76fe11

marvin-j97 added 4 commits September 8, 2025 18:14

clippy

e526b55

clippy

b9476b2

clippy

65a59d1

clippy

a222a4c

marvin-j97 added 2 commits September 8, 2025 18:39

clippy

f48ac0f

remove unused code

e3df87e

Uh oh!

memtable/skiplist: add a purpose-built skiplist #131

Are you sure you want to change the base?

memtable/skiplist: add a purpose-built skiplist #131

Uh oh!

Conversation

ajwerner commented May 18, 2025 • edited by marvin-j97 Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

marvin-j97 commented May 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

marvin-j97 May 19, 2025

Choose a reason for hiding this comment

Uh oh!

ajwerner May 20, 2025

Choose a reason for hiding this comment

Uh oh!

marvin-j97 May 19, 2025

Choose a reason for hiding this comment

Uh oh!

ajwerner May 20, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ajwerner left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ajwerner May 20, 2025

Choose a reason for hiding this comment

Uh oh!

ajwerner May 20, 2025

Choose a reason for hiding this comment

Uh oh!

marvin-j97 commented Jul 12, 2025

Uh oh!

marvin-j97 commented Sep 6, 2025

Uh oh!

marvin-j97 commented Sep 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ajwerner commented May 18, 2025 •

edited by marvin-j97

Loading

marvin-j97 commented May 18, 2025 •

edited

Loading