feat(memory): Introduce a pluggable mem-buffer mechanism and an optimized staging buffer impl #564

thuongle2210 · 2025-12-25T01:08:14Z

Description

This PR introduces a pluggable mem-buffer mechanism, and based on this, this also introduces a optimized staging lookup implementaion of mem-buffer.

Changelogs of `optimized staging lookup impl`

Converts staging buffer from previous structure to Vec<Block> for contiguous memory access
Adds batch_boundaries vector to track batch start positions
Introduces block_position_index for O(1) block ID to vector index mapping

Performance Impact

Improves search performance in staging buffers from O(number of staging blocks) to O(1 + number of selected staging blocks) by eliminating linear scans during get operations
Easily extendable with new memory buffer implementations for benchmarking, testing, etc., and switch to the most efficient approach.

thuongle2210 · 2025-12-25T01:40:36Z

hi @zuston , could you pls help me review this PR?

zuston

Thanks for your great work, @thuongle2210 — this is definitely moving in the right direction 👍

Before merging this, I’d like to make the buffer implementation more pluggable.
Would you mind abstracting the buffer into a trait, and then introducing different implementations on top of it?

I’m happy to bring in the optimization in this part of the codebase, but this path is critical for our internal shuffle service. Before rolling it out, I plan to run long-term performance tests as well as partial online bandwidth benchmarks. Based on the results, having a pluggable design will give us much more flexibility to iterate and tune.

Thanks again for the solid progress — looking forward to your thoughts on this.

riffle-server/src/store/mem/buffer.rs

thuongle2210 · 2025-12-25T06:35:09Z

Hi @zuston, do you mean I should define a trait—for example, MemoryBufferTrait (consisting of functions like total_size, flight_size, spill, append, get, etc.)—and implement it for MemoryBuffer, plus create a new OptStagingMemoryBuffer struct?

zuston · 2025-12-25T08:36:30Z

Hi @zuston, do you mean I should define a trait—for example, MemoryBufferTrait (consisting of functions like total_size, flight_size, spill, append, get, etc.)—and implement it for MemoryBuffer, plus create a new OptStagingMemoryBuffer struct?

Yes. Please go ahead 🛫

…perations, implement trait for memory_buffer

…tation feat: make buffer implementation pluggable

thuongle2210 · 2025-12-26T01:05:37Z

Hi @zuston, I made the buffer implementation more pluggable. Could you please review it?

zuston · 2025-12-26T07:09:18Z

Hi @zuston, I made the buffer implementation more pluggable. Could you please review it?

I have checked this PR, thanks for your great work! Overall lgtm. But for the better pluggable mechanism, I have refactored something. Could you help take a look? @thuongle2210

riffle-server/src/store/mem/buffer/unified_buffer.rs

thuongle2210 · 2025-12-26T08:49:17Z

LGTM! @zuston

zuston · 2025-12-26T08:57:09Z

thanks for your contribution! @thuongle2210 Let's merge this. 🎉

feat: format code

a7ccc8d

thuongle2210 marked this pull request as draft December 25, 2025 01:08

thuongle2210 marked this pull request as ready for review December 25, 2025 01:40

zuston reviewed Dec 25, 2025

View reviewed changes

riffle-server/src/store/mem/buffer.rs Outdated Show resolved Hide resolved

thuong and others added 11 commits December 25, 2025 20:51

feat: pre-allocate capabilities

8becaa0

feat: create buffer_core.rs where it stores core data structure and o…

904bf5a

…perations, implement trait for memory_buffer

feat: add buffer operations trait to all related components

c991380

feat: update test cases for supporting both memory buffer implementation

c22594e

feat: update test cases for supporting both memory buffer implementation

97ebedb

feat: update test cases for using BufferOps

f53bfa3

feat: remove the redundant import line

e05f7b1

feat: remove redundant lines

fcc1bf2

feat: change the ordering of memory_buffer implementations

615b3e1

convert TestBuffer trait to BufferOps trait at test cases

a33a782

Merge pull request #2 from thuongle2210/feat/refactor-buffer-implemen…

d829add

…tation feat: make buffer implementation pluggable

thuongle2210 changed the title ~~feat: optimize buffer get action~~ feat: make buffer implementation pluggable and optimize get action Dec 26, 2025

zuston added 6 commits December 26, 2025 13:58

re-org folders

067220a

enum to avoid generic type leak

4e74b4d

activate this

cc96c1c

memoryBuffer rename to DefaultMemoryBuffer

4bd2112

rename to unifiedBuffer

376e8b1

use bufferOptions

81d9a89

zuston approved these changes Dec 26, 2025

View reviewed changes

rename to MemoryBuffer as a trait name

187707a

zuston changed the title ~~feat: make buffer implementation pluggable and optimize get action~~ feat: Introduce a pluggable mem-buffer mechanism and an optimized staging buffer impl Dec 26, 2025

zuston changed the title ~~feat: Introduce a pluggable mem-buffer mechanism and an optimized staging buffer impl~~ feat(memory): Introduce a pluggable mem-buffer mechanism and an optimized staging buffer impl Dec 26, 2025

zuston reviewed Dec 26, 2025

View reviewed changes

riffle-server/src/store/mem/buffer/unified_buffer.rs Show resolved Hide resolved

zuston merged commit 27576c9 into zuston:master Dec 26, 2025
7 of 8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(memory): Introduce a pluggable mem-buffer mechanism and an optimized staging buffer impl #564

feat(memory): Introduce a pluggable mem-buffer mechanism and an optimized staging buffer impl #564

Uh oh!

thuongle2210 commented Dec 25, 2025 •

edited by zuston

Loading

Uh oh!

thuongle2210 commented Dec 25, 2025

Uh oh!

zuston left a comment

Uh oh!

Uh oh!

thuongle2210 commented Dec 25, 2025 •

edited

Loading

Uh oh!

zuston commented Dec 25, 2025

Uh oh!

thuongle2210 commented Dec 26, 2025

Uh oh!

zuston commented Dec 26, 2025

Uh oh!

Uh oh!

thuongle2210 commented Dec 26, 2025

Uh oh!

zuston commented Dec 26, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat(memory): Introduce a pluggable mem-buffer mechanism and an optimized staging buffer impl #564

feat(memory): Introduce a pluggable mem-buffer mechanism and an optimized staging buffer impl #564

Uh oh!

Conversation

thuongle2210 commented Dec 25, 2025 • edited by zuston Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Changelogs of optimized staging lookup impl

Performance Impact

Uh oh!

thuongle2210 commented Dec 25, 2025

Uh oh!

zuston left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

thuongle2210 commented Dec 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

zuston commented Dec 25, 2025

Uh oh!

thuongle2210 commented Dec 26, 2025

Uh oh!

zuston commented Dec 26, 2025

Uh oh!

Uh oh!

thuongle2210 commented Dec 26, 2025

Uh oh!

zuston commented Dec 26, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

thuongle2210 commented Dec 25, 2025 •

edited by zuston

Loading

Changelogs of `optimized staging lookup impl`

thuongle2210 commented Dec 25, 2025 •

edited

Loading