feat(fibre): bucketed row pool by Wondertan · Pull Request #7159 · celestiaorg/celestia-app

Wondertan · 2026-04-22T00:28:11Z

Third and final iteration of the encoding memory layout for Fibre.

The 2nd iteration (#7091) rested on the intuition that freeing the rows for each validator as soon as Upload finished with them would keep peak memory lower than holding the full blob's rows through the tail-latency drain. Since Upload returns after 2/3 of validators have acked, carrying all 128 MiB through the remaining tail felt obviously wasteful.

In practice, this was a premature optimization that generated more complexity than it repaid. Per-validator releases are adversarial to the allocator: the rows that go free at any moment are a random subset of the blob, so the freed memory lands as fragmented holes rather than reusable slots. During implementation, it became clear that fragmentation is an issue with several rounds of optimization layered on top to compensate; however, they were only putting complex makeup on a pig.

In a sync review, @walldiss flagged the complexity of the slab allocator as an issue that reduces trust in it. We agreed to eliminate per-validator releases to reduce complexity. It was a great call that also confirmed the optimization was flawed in the end.

The 3rd iteration drops per-validator release entirely in favor of whole-batch pooling. It is significantly simpler, and more importantly, it behaves better under load: steady-state memory tracks the count of concurrent in-flight encodes rather than worker count × blob size. For example, 10 workers × 128 MiB blobs no longer pin ~10 GiB of work buffers; memory settles around whatever the network bottleneck is.

This steady state never emerged in the 2nd iteration because fragmented reuse forced fresh allocation for nearly every encode, and the allocator couldn't recycle a random scatter of freed rows into a contiguous batch-shaped request, so memory grew until every worker effectively held its own reservation.

claude

Claude Code Review

This repository is configured for manual code reviews. Comment @claude review to trigger a review and subscribe this PR to future pushes, or @claude review once for a one-time review.

_{Tip: disable this comment in your organization's Code Review settings.}

Introduces fibre/internal/row, a bucketed allocator of fixed-shape row batches used by the blob encode path and the rsema1d codec's work buffers. Replaces the per-encode sync.Pool with explicit retention (aged eviction, idle-grace drop) and mmap-backed regions above 1 MiB, keeping steady-state RSS proportional to concurrent in-flight encodes rather than worst-case per-worker reservation. Allocations run without holding the pool lock so a fresh mmap doesn't stall concurrent Gets/Puts behind a multi-ms syscall. row.Assembler layers a K+N row view on top of the pool: original rows alias input data zero-copy where possible, parity+head+tail come from a single pooled batch released as one unit. ProtocolParams.CodecWorkRows() exposes leopard-GF16's work-row count so callers size the pool without pool code needing to know codec internals. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

devin-ai-integration

Devin Review found 1 new potential issue.

View 9 additional findings in Devin Review.

Wondertan self-assigned this Apr 22, 2026

Wondertan requested a review from a team as a code owner April 22, 2026 00:28

Wondertan requested review from ninabarbakadze and removed request for a team April 22, 2026 00:28

claude Bot reviewed Apr 22, 2026

View reviewed changes

Wondertan mentioned this pull request Apr 22, 2026

perf(fibre): decrease encoding memory usage by 8x #7091

Closed

This comment was marked as resolved.

Sign in to view

Wondertan force-pushed the feat/slab-pool-allocator-alt branch from 6fbf0b1 to e5cb881 Compare April 22, 2026 01:01

devin-ai-integration Bot reviewed Apr 22, 2026

View reviewed changes

Comment thread fibre/client_upload.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(fibre): bucketed row pool#7159

feat(fibre): bucketed row pool#7159
Wondertan wants to merge 1 commit intomainfrom
feat/slab-pool-allocator-alt

Wondertan commented Apr 22, 2026 •

edited by devin-ai-integration Bot

Loading

Uh oh!

claude Bot left a comment

Uh oh!

This comment was marked as resolved.

Uh oh!

devin-ai-integration Bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Wondertan commented Apr 22, 2026 • edited by devin-ai-integration Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

claude Bot left a comment

Choose a reason for hiding this comment

Claude Code Review

Uh oh!

This comment was marked as resolved.

Uh oh!

devin-ai-integration Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Wondertan commented Apr 22, 2026 •

edited by devin-ai-integration Bot

Loading