Skip to content

Commit e23775d

Browse files
Update trtllm_guide.md (#112)
1 parent 33b005f commit e23775d

File tree

1 file changed

+2
-1
lines changed

1 file changed

+2
-1
lines changed

Popular_Models_Guide/Llama2/trtllm_guide.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -345,6 +345,7 @@ You can read more about Gen-AI Perf [here](https://docs.nvidia.com/deeplearning/
345345
To use Gen-AI Perf, run the following command in the same Triton docker container:
346346
```bash
347347
genai-perf \
348+
profile \
348349
-m ensemble \
349350
--service-kind triton \
350351
--backend tensorrtllm \
@@ -380,4 +381,4 @@ Request throughput (per sec): 0.61
380381
381382
## References
382383
383-
For more examples feel free to refer to [End to end workflow to run llama.](https://github.com/triton-inference-server/tensorrtllm_backend/blob/main/docs/llama.md)
384+
For more examples feel free to refer to [End to end workflow to run llama.](https://github.com/triton-inference-server/tensorrtllm_backend/blob/main/docs/llama.md)

0 commit comments

Comments
 (0)