Playbook for 2 node RPC Clustering by abdmalik-amd · Pull Request #79 · amd/playbooks

abdmalik-amd · 2026-02-17T16:27:51Z

add playbook covering llama.cpp RPC-based distributed inference across two STX Halo systems, including VRAM configuration, build instructions, and model deployment.

update page.tsx to support h3 headers and in page section navigation

…o systems, including VRAM configuration, build instructions, and model deployment. update page.tsx to support h3 headers and in page section navigation

eddierichter-amd

This looks great @abdmalik-amd! I just had a couple small comments.

danielholanda · 2026-03-04T23:40:31Z

@eddierichter-amd were you able to actually reproduce the results here?

eddierichter-amd · 2026-03-05T01:04:07Z

@eddierichter-amd were you able to actually reproduce the results here?

You mean performance results? I don't have a 4-Strix Halo machine but I do infact have a 2-Strix Halo setup and following the same steps functionally works.

adamlam2-amd · 2026-03-05T07:49:08Z

Nice, I'm glad the functionality works. Some more minor UX considerations:

maybe specify what exact memory values are needed to run the model we are working with. Can mention to set the variable graphics memory to ~75% and also the TTM values to there to load larger models
i think rpc does not improve compute, but rather just allows for model offloading on the combined vram
im not too familiar with building llama.cpp from source, but ensure they work. What is Ninja for windows? is that installed. Do we need build-essential for Linux?
some windows v linux instructions - .\rpc-server.exe vs ./rpc-server
any images/screenshots/gifs you can add would be helpful!

Other than that, pretty good and should be able to pass onto QA

abdmalik-amd · 2026-03-05T16:01:02Z

Nice, I'm glad the functionality works. Some more minor UX considerations:

maybe specify what exact memory values are needed to run the model we are working with. Can mention to set the variable graphics memory to ~75% and also the TTM values to there to load larger models

i think rpc does not improve compute, but rather just allows for model offloading on the combined vram

im not too familiar with building llama.cpp from source, but ensure they work. What is Ninja for windows? is that installed. Do we need build-essential for Linux?

some windows v linux instructions - .\rpc-server.exe vs ./rpc-server

any images/screenshots/gifs you can add would be helpful!

Other than that, pretty good and should be able to pass onto QA

Sure I will update the memory text, the current memory text is using the memory macro
Correct
Ninja is included in VS Code Build Tools
Will fix this typo for the windows section
Sure I will look into adding some pictures/gifs of usage

abdmalik-amd · 2026-03-05T19:49:30Z

@eddierichter-amd @adamlam2-amd

Images for llama-cli & llama-server interface have been added to the playbook

danielholanda · 2026-03-09T18:23:56Z

@adamlam2-amd @eddierichter-amd Any additional comments or requirements before we merge this?

adamlam2-amd

lgtm - one thing we should keep note of is to differentiate between RPC clustering and RCCL clustering so that users know exactly which one to use.

abdmalik-amd added 4 commits February 17, 2026 11:34

covering llama.cpp RPC-based distributed inference across two STX Hal…

20efbd7

…o systems, including VRAM configuration, build instructions, and model deployment. update page.tsx to support h3 headers and in page section navigation

edit page.tsx logic to support h3 headers

0aef735

update page.tsx to add smooth scrolling to # ref links

a86d51d

Update clustering-rpc-server README wording

e1b58a2

abdmalik-amd force-pushed the abdmalik/rpcclusterplaybook branch from 19adc3a to e1b58a2 Compare February 17, 2026 16:34

danielholanda requested review from adamlam2-amd and eddierichter-amd February 17, 2026 18:23

eddierichter-amd reviewed Feb 17, 2026

View reviewed changes

Comment thread playbooks/supplemental/clustering-rpc-server/playbook.json Outdated

Comment thread playbooks/supplemental/clustering-rpc-server/README.md

Comment thread playbooks/supplemental/clustering-rpc-server/README.md

Comment thread playbooks/supplemental/clustering-rpc-server/README.md

danielholanda assigned abdmalik-amd Feb 17, 2026

Update title, add note on how to find RPC_IP for worker machine

219a903

danielholanda requested a review from bog601 February 26, 2026 19:16

bog601 approved these changes Feb 27, 2026

View reviewed changes

abdmalik-amd added 2 commits March 5, 2026 11:23

fix RPC launch typo and update extended vram allocation text

5659302

add llama-cli & llama-server ui images

b3d1cba

eddierichter-amd approved these changes Mar 11, 2026

View reviewed changes

adamlam2-amd approved these changes Mar 13, 2026

View reviewed changes

Merge branch 'main' into abdmalik/rpcclusterplaybook

2a2a8cd

danielholanda merged commit d5cddfd into main Mar 13, 2026
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Playbook for 2 node RPC Clustering#79

Playbook for 2 node RPC Clustering#79
danielholanda merged 8 commits intomainfrom
abdmalik/rpcclusterplaybook

abdmalik-amd commented Feb 17, 2026

Uh oh!

eddierichter-amd left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

danielholanda commented Mar 4, 2026

Uh oh!

eddierichter-amd commented Mar 5, 2026

Uh oh!

adamlam2-amd commented Mar 5, 2026

Uh oh!

abdmalik-amd commented Mar 5, 2026 •

edited

Loading

Uh oh!

abdmalik-amd commented Mar 5, 2026

Uh oh!

danielholanda commented Mar 9, 2026

Uh oh!

adamlam2-amd left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

abdmalik-amd commented Feb 17, 2026

Uh oh!

eddierichter-amd left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

danielholanda commented Mar 4, 2026

Uh oh!

eddierichter-amd commented Mar 5, 2026

Uh oh!

adamlam2-amd commented Mar 5, 2026

Uh oh!

abdmalik-amd commented Mar 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

abdmalik-amd commented Mar 5, 2026

Uh oh!

danielholanda commented Mar 9, 2026

Uh oh!

adamlam2-amd left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

abdmalik-amd commented Mar 5, 2026 •

edited

Loading