|
| 1 | +# benchmark_compiled report |
| 2 | + |
| 3 | +## Run |
| 4 | + |
| 5 | +``` |
| 6 | +run_gcs_dir=gs://grouping-data/runs/2026-04-10-12-39-45-large-no-prefix |
| 7 | +df_path=final_csvs/test_full2.csv |
| 8 | +stamp=2026-04-24-23-44-59 |
| 9 | +sample_size=66753 |
| 10 | +text_prefix='' |
| 11 | +model_kwargs={'dtype': torch.bfloat16, 'attn_implementation': 'sdpa'} |
| 12 | +``` |
| 13 | + |
| 14 | +- Token bucket boundaries used for analysis: `(64, 128, 256, 512, 1024)` |
| 15 | +- Rows: 66,753 |
| 16 | + |
| 17 | +## Headline |
| 18 | + |
| 19 | +- Median compiled: **14.7 ms** |
| 20 | +- Median base: **36.8 ms** |
| 21 | +- Per-row speedup p10/p50/p90: **1.01x / 2.51x / 3.45x** |
| 22 | +- Compiled wins on **92.6%** of rows |
| 23 | + |
| 24 | +## Per-bucket |
| 25 | + |
| 26 | +| bucket | n | tok_p50 | compiled_ms_p50 | base_ms_p50 | compiled_ms_p90 | base_ms_p90 | speedup_p50 | |
| 27 | +|----------|-------|---------|-----------------|-------------|-----------------|-------------|-------------| |
| 28 | +| <=64 | 8301 | 35.0 | 10.58 | 35.17 | 11.72 | 35.99 | 3.32 | |
| 29 | +| 65-128 | 7628 | 93.0 | 10.68 | 35.4 | 11.65 | 36.21 | 3.31 | |
| 30 | +| 129-256 | 13930 | 194.0 | 11.62 | 36.05 | 12.85 | 37.02 | 3.1 | |
| 31 | +| 257-512 | 14804 | 369.0 | 15.54 | 36.97 | 17.3 | 38.1 | 2.38 | |
| 32 | +| 513-1024 | 13858 | 682.0 | 26.9 | 38.86 | 28.97 | 40.85 | 1.44 | |
| 33 | +| >1024 | 8232 | 1494.5 | 48.06 | 47.04 | 115.66 | 115.89 | 0.98 | |
| 34 | + |
| 35 | +## Worst 5 rows for compiled |
| 36 | + |
| 37 | +| num_tokens | compiled_ms | base_ms | speedup | |
| 38 | +|------------|-------------|---------|---------| |
| 39 | +| 1252 | 54.8 | 43.26 | 0.789 | |
| 40 | +| 1043 | 50.24 | 42.14 | 0.839 | |
| 41 | +| 2020 | 82.6 | 70.98 | 0.859 | |
| 42 | +| 1122 | 49.67 | 42.91 | 0.864 | |
| 43 | +| 2040 | 79.23 | 69.04 | 0.871 | |
0 commit comments