Skip to content

Commit 1145f10

Browse files
committed
Update AutoScaling Blog to 24.07
1 parent 5f85241 commit 1145f10

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

Deployment/Kubernetes/TensorRT-LLM_Autoscaling_and_Load_Balancing/containers/triton_trt-llm.containerfile

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -12,7 +12,7 @@
1212
# See the License for the specific language governing permissions and
1313
# limitations under the License.
1414

15-
ARG BASE_CONTAINER_IMAGE=nvcr.io/nvidia/tritonserver:24.04-trtllm-python-py3
15+
ARG BASE_CONTAINER_IMAGE=nvcr.io/nvidia/tritonserver:24.07-trtllm-python-py3
1616
ARG ENGINE_DEST_PATH=/var/run/engines
1717
ARG HF_HOME=/var/run/cache
1818

@@ -42,7 +42,7 @@ RUN pip --verbose install \
4242
--no-cache-dir \
4343
--no-color \
4444
--no-input \
45-
git+https://github.com/triton-inference-server/triton_cli.git@jwyman/aslb-mn
45+
git+https://github.com/triton-inference-server/triton_cli.git@ibhosale/aslb-mn
4646

4747
# Copy the server script.
4848
COPY server.py .

0 commit comments

Comments
 (0)