You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
-`2`: Double the number of unique prefix combinations
147
-
-`3`: Triple the number of unique prefix combinations
144
+
Distribute traces across N independent radix trees:
145
+
-`1`: All traces share the same prefix tree (default)
146
+
-`2`: Traces randomly assigned to 2 independent trees (50% each)
147
+
-`3`: Traces randomly assigned to 3 independent trees (33% each)
148
148
149
-
Example: Generate more diverse prefix patterns:
149
+
Each tree has identical structure but different hash IDs, so traces in different trees cannot share prefixes. This reduces the effective cache hit rate by splitting the workload.
150
+
151
+
Example: Simulate lower cache hit rates with more diverse prefix roots:
150
152
```bash
151
153
aiperf profile \
152
154
--input-file traces/production.jsonl \
@@ -171,11 +173,11 @@ aiperf profile \
171
173
```
172
174
173
175
#### `--synthesis-max-isl` (optional)
174
-
Cap the maximum input sequence length:
175
-
- Not set: No cap
176
-
-`4096`: Maximum 4,096 tokens per request
176
+
Filter traces by maximum input sequence length. Traces with input_length > max_isl are skipped:
177
+
- Not set: No filtering
178
+
-`4096`: Skip traces with more than 4,096 input tokens
177
179
178
-
Example: Test with bounded context:
180
+
Example: Filter out long contexts:
179
181
```bash
180
182
aiperf profile \
181
183
--input-file traces/production.jsonl \
@@ -184,6 +186,20 @@ aiperf profile \
184
186
...
185
187
```
186
188
189
+
#### `--synthesis-max-osl` (optional)
190
+
Cap traces to a maximum output sequence length. Traces with output_length > max_osl are capped to max_osl:
0 commit comments