Command-line tool for distributed LLM inference benchmarks on SLURM clusters using SGLang. Replace complex shell scripts and 50+ CLI flags with declarative YAML configuration.
# Clone and install
git clone https://github.com/your-org/srtctl.git
cd srtctl
pip install -e .
# One-time setup (downloads NATS/ETCD, creates srtslurm.yaml)
make setup ARCH=aarch64 # or ARCH=x86_64Full documentation: https://srtctl.gitbook.io/srtctl-docs/
- Installation - Setup and configuration
- Monitoring - Job logs and debugging
- Parameter Sweeps - Grid searches
- Profiling - Torch/nsys profiling
- Analyzing Results - Dashboard and visualization
- Accuracy Benchmarks - Running accuracy benchmarks
# Submit job(s)
srtctl apply -f config.yaml
# Submit with custom setup script
srtctl apply -f config.yaml --setup-script custom-setup.sh
# Submit with tags for filtering
srtctl apply -f config.yaml --tags experiment,baseline
# Dry-run (validate without submitting)
srtctl dry-run -f config.yaml
# Launch analysis dashboard
make dashboard