For ultra-long input sequence lengths, GuideLLM can take long times to prepare the samples for the benchmark. The default multiplier of 1.5x of test duration is insufficient in this case and we will need to provide the ability to tweak the timeout duration here.