Skip to content

Commit 3a9c80d

Browse files
committed
Updates
1 parent 8af0aa3 commit 3a9c80d

File tree

1 file changed

+2
-9
lines changed

1 file changed

+2
-9
lines changed

docs/best_practices.md

Lines changed: 2 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -127,16 +127,9 @@ Pass: Cold: 0.007819ms GPU, 0.013864ms CPU, 0.50s total GPU, 3.59s total wall, 6
127127
| 63952x | 13.864 us | 432.95% | 7.819 us | 447.95% |
128128
```
129129

130-
By default, NVBench runs benchmarks on all available GPUs unless specified otherwise. On multi-GPU systems, this can unnecessarily increase runtime and resource usage. To target a specific GPU, saving both time and resources, you can set the `CUDA_VISIBLE_DEVICES` environment variable. In our case, we target the **RTX8000**:
131-
132-
```bash
133-
user@nvbench-test:~/nvbench/build/bin$ export CUDA_VISIBLE_DEVICES=0
134-
```
135-
136-
Now, if we rerun:
137-
130+
By default, NVBench executes benchmarks on all available GPUs unless instructed otherwise. On multi-GPU systems, this can lead to longer runtimes and higher resource usage. To focus on a specific GPU and optimize both time and resources, users can use the `-d` CLI option to select the target GPU. In our example, we target the **RTX8000**:
138131
```bash
139-
user@nvbench-test:~/nvbench/build/bin$ ./sequence_bench
132+
user@nvbench-test:~/nvbench/build/bin$ ./sequence_bench -d 0
140133
# Devices
141134

142135
## [0] `Quadro RTX 8000`

0 commit comments

Comments
 (0)