|
83 | 83 | * Applies to the most recent `--benchmark`, or all benchmarks if specified |
84 | 84 | before any `--benchmark` arguments. |
85 | 85 |
|
86 | | -* `--min-samples <count>` |
87 | | - * Gather at least `<count>` samples per measurement. |
88 | | - * Default is 10 samples. |
89 | | - * Applies to the most recent `--benchmark`, or all benchmarks if specified |
90 | | - before any `--benchmark` arguments. |
91 | | - |
92 | | -* `--stopping-criterion <criterion>` |
93 | | - * After `--min-samples` is satisfied, use `<criterion>` to detect if enough |
94 | | - samples were collected. |
95 | | - * Only applies to Cold measurements. |
96 | | - * Default is stdrel (`--stopping-criterion stdrel`) |
97 | | - |
98 | | -* `--min-time <seconds>` |
99 | | - * Accumulate at least `<seconds>` of execution time per measurement. |
100 | | - * Only applies to `stdrel` stopping criterion. |
101 | | - * Default is 0.5 seconds. |
102 | | - * If both GPU and CPU times are gathered, this applies to GPU time only. |
103 | | - * Applies to the most recent `--benchmark`, or all benchmarks if specified |
104 | | - before any `--benchmark` arguments. |
105 | | - |
106 | | -* `--max-noise <value>` |
107 | | - * Gather samples until the error in the measurement drops below `<value>`. |
108 | | - * Noise is specified as the percent relative standard deviation. |
109 | | - * Default is 0.5% (`--max-noise 0.5`) |
110 | | - * Only applies to `stdrel` stopping criterion. |
111 | | - * Only applies to Cold measurements. |
112 | | - * If both GPU and CPU times are gathered, this applies to GPU noise only. |
113 | | - * Applies to the most recent `--benchmark`, or all benchmarks if specified |
114 | | - before any `--benchmark` arguments. |
115 | | - |
116 | 86 | * `--skip-time <seconds>` |
117 | 87 | * Skip a measurement when a warmup run executes in less than `<seconds>`. |
118 | 88 | * Default is -1 seconds (disabled). |
|
123 | 93 | * Applies to the most recent `--benchmark`, or all benchmarks if specified |
124 | 94 | before any `--benchmark` arguments. |
125 | 95 |
|
126 | | -* `--timeout <seconds>` |
127 | | - * Measurements will timeout after `<seconds>` have elapsed. |
128 | | - * Default is 15 seconds. |
129 | | - * `<seconds>` is walltime, not accumulated sample time. |
130 | | - * If a measurement times out, the default markdown log will print a warning to |
131 | | - report any outstanding termination criteria (min samples, min time, max |
132 | | - noise). |
133 | | - * Applies to the most recent `--benchmark`, or all benchmarks if specified |
134 | | - before any `--benchmark` arguments. |
135 | | - |
136 | 96 | * `--throttle-threshold <value>` |
137 | 97 | * Set the GPU throttle threshold as percentage of the device's default clock rate. |
138 | 98 | * Default is 75. |
|
166 | 126 | * Intended for use with external profiling tools. |
167 | 127 | * Applies to the most recent `--benchmark`, or all benchmarks if specified |
168 | 128 | before any `--benchmark` arguments. |
| 129 | + |
| 130 | +## Stopping Criteria |
| 131 | + |
| 132 | +* `--timeout <seconds>` |
| 133 | + * Measurements will timeout after `<seconds>` have elapsed. |
| 134 | + * Default is 15 seconds. |
| 135 | + * `<seconds>` is walltime, not accumulated sample time. |
| 136 | + * If a measurement times out, the default markdown log will print a warning to |
| 137 | + report any outstanding termination criteria (min samples, min time, max |
| 138 | + noise). |
| 139 | + * Applies to the most recent `--benchmark`, or all benchmarks if specified |
| 140 | + before any `--benchmark` arguments. |
| 141 | + |
| 142 | +* `--min-samples <count>` |
| 143 | + * Gather at least `<count>` samples per measurement before checking any |
| 144 | + other stopping criterion besides the timeout. |
| 145 | + * Default is 10 samples. |
| 146 | + * Applies to the most recent `--benchmark`, or all benchmarks if specified |
| 147 | + before any `--benchmark` arguments. |
| 148 | + |
| 149 | +* `--stopping-criterion <criterion>` |
| 150 | + * After `--min-samples` is satisfied, use `<criterion>` to detect if enough |
| 151 | + samples were collected. |
| 152 | + * Only applies to Cold and CPU-only measurements. |
| 153 | + * If both GPU and CPU times are gathered, GPU time is used for stopping |
| 154 | + analysis. |
| 155 | + * Stopping criteria provided by NVBench are: |
| 156 | + * "stdrel": (default) Converges to a minimal relative standard deviation, |
| 157 | + stdev / mean |
| 158 | + * "entropy": Converges based on the cumulative entropy of all samples. |
| 159 | + * Each stopping criterion may provide additional parameters to customize |
| 160 | + behavior, as detailed below: |
| 161 | + |
| 162 | +### "stdrel" Stopping Criterion Parameters |
| 163 | + |
| 164 | +* `--min-time <seconds>` |
| 165 | + * Accumulate at least `<seconds>` of execution time per measurement. |
| 166 | + * Only applies to `stdrel` stopping criterion. |
| 167 | + * Default is 0.5 seconds. |
| 168 | + * Applies to the most recent `--benchmark`, or all benchmarks if specified |
| 169 | + before any `--benchmark` arguments. |
| 170 | + |
| 171 | +* `--max-noise <value>` |
| 172 | + * Gather samples until the error in the measurement drops below `<value>`. |
| 173 | + * Noise is specified as the percent relative standard deviation (stdev/mean). |
| 174 | + * Default is 0.5% (`--max-noise 0.5`) |
| 175 | + * Applies to the most recent `--benchmark`, or all benchmarks if specified |
| 176 | + before any `--benchmark` arguments. |
| 177 | + |
| 178 | +### "entropy" Stopping Criterion Parameters |
| 179 | + |
| 180 | +* `--max-angle <value>` |
| 181 | + * Maximum linear regression angle of cumulative entropy. |
| 182 | + * Smaller values give more accurate results. |
| 183 | + * Default is 0.048. |
| 184 | + * Applies to the most recent `--benchmark`, or all benchmarks if specified |
| 185 | + before any `--benchmark` arguments. |
| 186 | + |
| 187 | +* `--min-r2 <value>` |
| 188 | + * Minimum coefficient of determination for linear regression of cumulative |
| 189 | + entropy. |
| 190 | + * Larger values give more accurate results. |
| 191 | + * Default is 0.36. |
| 192 | + * Applies to the most recent `--benchmark`, or all benchmarks if specified |
| 193 | + before any `--benchmark` arguments. |
0 commit comments