You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
feat: trace-replay benchmark with aiperf_args passthrough (#18)
* feat: add aiperf_args passthrough and fix tokenizer for trace-replay
- Add --tokenizer-trust-remote-code to aiperf calls (fixes Kimi tokenizer)
- Install tiktoken if missing (required by Kimi's custom tokenizer)
- Add aiperf_args dict to BenchmarkConfig for passing extra aiperf CLI flags
- bench.sh accepts extra args after positional params
- Add production aiperf flags to kimi recipe (duration, timeout, workers, etc)
- Increase file descriptor limit and add PYTHONUNBUFFERED for real-time logging
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
* fix: direct warmup artifacts to ARTIFACT_DIR instead of cwd
The aiperf warmup call was missing --artifact-dir, causing it to write
artifacts to the working directory (creating artifacts/ in repo root).
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
* test: add tests for aiperf_args passthrough in trace-replay
Tests that key-value args are passed as --key value flags and boolean
args are passed as --flag (true) or omitted (false).
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
* feat: add aiperf_package config for controlling aiperf version
Add benchmark.aiperf_package field to specify the pip install spec for
aiperf (e.g., "aiperf>=0.7.0"). Passed as AIPERF_PACKAGE env var to
bench.sh which does pip install --upgrade. Defaults to "aiperf" if not
set. Always installs tiktoken alongside.
Needed because container-bundled aiperf may predate fixes like
trust-remote-code propagation to pool workers (aiperf PR #744).
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
---------
Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Co-authored-by: ishandhanani <82981111+ishandhanani@users.noreply.github.com>
0 commit comments