We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent e2801ae commit 594d4f5Copy full SHA for 594d4f5
Popular_Models_Guide/Hermes-2-Pro-Llama-3-8B/README.md
@@ -202,7 +202,7 @@ First, let's start Triton SDK container:
202
docker run --rm -it --net host --shm-size=2g \
203
--ulimit memlock=-1 --ulimit stack=67108864 --gpus all \
204
-v /path/to/tensorrtllm_backend/inflight_batcher_llm/client:/tensorrtllm_client \
205
- -v /path/to/Hermes-2-Pro-Llama-3-8B/repo:/Llama-2-7b-hf \
+ -v /path/to/Hermes-2-Pro-Llama-3-8B/repo:/Hermes-2-Pro-Llama-3-8B \
206
nvcr.io/nvidia/tritonserver:<xx.yy>-py3-sdk
207
```
208
0 commit comments