We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
1 parent 687e477 commit 0a54882Copy full SHA for 0a54882
1 file changed
README.md
@@ -91,7 +91,7 @@ docker run --runtime nvidia --gpus all \
91
#### 2. Run a benchmark
92
93
```shell
94
-inference-benchmarker
+inference-benchmarker \
95
--tokenizer-name "meta-llama/Llama-3.1-8B-Instruct" \
96
--url http://localhost:8080 \
97
--profile chat
0 commit comments