Skip to content

Add blog post for CDC#12025

Open
tyler-french wants to merge 1 commit intomasterfrom
tfrench/blog
Open

Add blog post for CDC#12025
tyler-french wants to merge 1 commit intomasterfrom
tfrench/blog

Conversation

@tyler-french
Copy link
Copy Markdown
Contributor

@tyler-french tyler-french commented Apr 28, 2026

Screenshot 2026-04-29 at 3 11 35 PM

Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Adds a new blog post introducing Remote Cache Content-Defined Chunking (CDC) and the accompanying diagrams, plus updates existing docs to reference the new Bazel chunking flag and the blog post.

Changes:

  • Add a new CDC deep-dive blog post (content-defined-chunking) with supporting images/diagrams.
  • Update CDC changelog entry to simplify enablement instructions and link to the new blog post.
  • Update the “Why was my Bazel build so slow?” post to include --experimental_remote_cache_chunking.

Reviewed changes

Copilot reviewed 4 out of 13 changed files in this pull request and generated 2 comments.

Show a summary per file
File Description
website/static/img/blog/cdc_transitive_action_collapse.svg New diagram illustrating transitive action collapse (with highlighted path).
website/static/img/blog/cdc_splitblob_spliceblob_flows.svg New diagram showing SplitBlob/SpliceBlob read/write flows.
website/static/img/blog/cdc_local_vs_distributed_cache.svg New diagram comparing local chunk store vs distributed cache/manifest.
website/static/img/blog/cdc_golink_insertion_simplified.svg New diagram demonstrating chunk boundary stability after insertion.
website/static/img/blog/cdc_chunk_stability.svg New diagram explaining CDC stability under small edits.
website/static/img/blog/cdc_chunk_offsets_not_copies.svg New diagram explaining offset-based chunk representation.
website/changelog/bazel-remote-cache-cdc.md Updates enablement snippet and links to the new blog post.
website/blog/why-was-my-bazel-build-so-slow.md Adds --experimental_remote_cache_chunking to the recommended flags list.
website/blog/content-defined-chunking.md New CDC blog post content and references to the new diagrams.
website/blog/authors.yaml Adds author key tfrench for the new post.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Comment thread website/blog/content-defined-chunking.md Outdated
Comment thread website/blog/authors.yaml Outdated
Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Adds a new “Remote Cache CDC” (content-defined chunking) blog post to the website, along with supporting diagrams, and updates existing docs/changelog to reference the new capability and article.

Changes:

  • Added new blog post: content-defined-chunking explaining CDC and related Remote Execution APIs.
  • Added multiple new SVG diagrams used by the blog post.
  • Updated existing changelog entry and Bazel performance-flags blog to reference --experimental_remote_cache_chunking and link to the new post.

Reviewed changes

Copilot reviewed 3 out of 13 changed files in this pull request and generated no comments.

Show a summary per file
File Description
website/static/img/blog/cdc_transitive_action_collapse.svg New diagram illustrating how transitive actions collapse the dependency graph.
website/static/img/blog/cdc_splitblob_spliceblob_flows.svg New diagram showing read/write RPC flows using SplitBlob/SpliceBlob.
website/static/img/blog/cdc_local_vs_distributed_cache.svg New diagram comparing local chunk reuse vs distributed cache storage model.
website/static/img/blog/cdc_golink_insertion_simplified.svg New diagram illustrating chunk stability across an insertion.
website/static/img/blog/cdc_chunk_stability.svg New diagram illustrating CDC boundary stability under small edits.
website/static/img/blog/cdc_chunk_offsets_not_copies.svg New diagram contrasting materialized chunks vs offsets.
website/changelog/bazel-remote-cache-cdc.md Updated CDC changelog entry to simplify enablement guidance and link to the new post.
website/blog/why-was-my-bazel-build-so-slow.md Added the --experimental_remote_cache_chunking flag to the recommended flags list.
website/blog/content-defined-chunking.md New long-form blog post introducing remote cache CDC.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

@iain-macdonald iain-macdonald self-requested a review April 28, 2026 23:04
@tyler-french tyler-french force-pushed the tfrench/blog branch 2 times, most recently from 3ad9ff8 to a6806b1 Compare April 29, 2026 15:18
Copy link
Copy Markdown
Collaborator

@maggie-lou maggie-lou left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Awesome!

Comment thread website/blog/content-defined-chunking.md Outdated
Comment thread website/blog/content-defined-chunking.md Outdated
Comment thread website/blog/content-defined-chunking.md Outdated
Comment thread website/blog/why-was-my-bazel-build-so-slow.md Outdated
@tyler-french tyler-french force-pushed the tfrench/blog branch 6 times, most recently from ff7a0db to 2430c09 Compare April 29, 2026 16:15
Copy link
Copy Markdown
Member

@bduffany bduffany left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

awesome writeup!

Comment thread website/blog/content-defined-chunking.md Outdated
Comment thread website/blog/content-defined-chunking.md Outdated
@tyler-french tyler-french force-pushed the tfrench/blog branch 6 times, most recently from aaf945a to 7a0e9de Compare April 29, 2026 19:16
Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Adds a new BuildBuddy blog post explaining Remote Cache CDC (content-defined chunking), along with supporting diagrams and a couple of cross-links/flag mentions in existing site content.

Changes:

  • Added the new blog post: Remote Cache CDC: Reusing Bytes (/blog/content-defined-chunking).
  • Added multiple new SVG diagrams used by the post.
  • Updated the CDC changelog entry and the “slow Bazel build” post to reference --experimental_remote_cache_chunking and link to the new post.

Reviewed changes

Copilot reviewed 3 out of 13 changed files in this pull request and generated no comments.

Show a summary per file
File Description
website/static/img/blog/cdc_transitive_action_collapse.svg New diagram illustrating transitive action “collapse” behavior.
website/static/img/blog/cdc_splitblob_spliceblob_flows.svg New diagram showing SplitBlob/SpliceBlob read+write flows.
website/static/img/blog/cdc_local_vs_distributed_cache.svg New diagram comparing local chunk store vs distributed remote cache (has aria-labelledby).
website/static/img/blog/cdc_golink_insertion_simplified.svg New diagram illustrating chunk boundary stability after an insertion.
website/static/img/blog/cdc_chunk_stability.svg New diagram illustrating chunk stability under small edits.
website/static/img/blog/cdc_chunk_offsets_not_copies.svg New diagram illustrating “byte ranges, not copies” optimization.
website/changelog/bazel-remote-cache-cdc.md Updates CDC changelog entry to simplify enablement instructions and link to the new post.
website/blog/why-was-my-bazel-build-so-slow.md Adds --experimental_remote_cache_chunking to the recommended flags list.
website/blog/content-defined-chunking.md New long-form post describing CDC motivation, APIs, and results.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

@tyler-french tyler-french review requested due to automatic review settings April 29, 2026 19:52
@iain-macdonald
Copy link
Copy Markdown
Contributor

Nice work! This looks good, a few high-level comments:

  • We haven't really talked about the proxies externally yet, so you may want to avoid going into too much detail about them and provide an explanation before you reference them.
  • When I read this one question I had is: "how chunkable are different kinds of content." If you have an answer, that might be nice to include.
  • Organizationally, you might want to put all of the performance data and stuff in one section? Currently it shows up in a few places (e.g. in "How does this benefit remote caching?" and "Results")
  • It might make sense to explain in the "BuildBuddy Implementation" section or above that any traffic between caches (Bazel client, BuildBuddy, executors) is CDC able, and we've implemented CDC for most internal traffic. That clarification kind of sets the stage for the next couple of sections.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants