We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent c52dcc1 commit c54ba82Copy full SHA for c54ba82
Deployment/Kubernetes/README.md
@@ -1,4 +1,4 @@
1
# Kubernetes Deployment of Triton Server Guides
2
3
* [TensorRT-LLM Gen. AI Autoscaling & Load Balancing](./TensorRT-LLM_Autoscaling_and_Load_Balancing/README.md)
4
-* [Multi-Node Generative AI w/ Triton Server and TensorRT-LLM](./TensorRT-LLM_Multi-Node_Distributed_Models/README.md)
+* [EKS Multinode Triton TRT-LLM](https://github.com/triton-inference-server/tutorials/tree/main/Deployment/Kubernetes/EKS_Multinode_Triton_TRTLLM)
0 commit comments