Playbook for 2 node RPC Clustering#79
Conversation
…o systems, including VRAM configuration, build instructions, and model deployment. update page.tsx to support h3 headers and in page section navigation
19adc3a to
e1b58a2
Compare
eddierichter-amd
left a comment
There was a problem hiding this comment.
This looks great @abdmalik-amd! I just had a couple small comments.
|
@eddierichter-amd were you able to actually reproduce the results here? |
You mean performance results? I don't have a 4-Strix Halo machine but I do infact have a 2-Strix Halo setup and following the same steps functionally works. |
|
Nice, I'm glad the functionality works. Some more minor UX considerations:
Other than that, pretty good and should be able to pass onto QA |
|
|
@eddierichter-amd @adamlam2-amd Images for llama-cli & llama-server interface have been added to the playbook |
|
@adamlam2-amd @eddierichter-amd Any additional comments or requirements before we merge this? |
adamlam2-amd
left a comment
There was a problem hiding this comment.
lgtm - one thing we should keep note of is to differentiate between RPC clustering and RCCL clustering so that users know exactly which one to use.
add playbook covering llama.cpp RPC-based distributed inference across two STX Halo systems, including VRAM configuration, build instructions, and model deployment.
update page.tsx to support h3 headers and in page section navigation