Throughput statistics for min/max/stdev/percentiles do not give a good representation of the benchmark

### Bug Description

### Description
I am benchmarking Llama 3.1 8B on an NVIDIA H100 GPU. When running in concurrent mode, I observed highly inconsistent metrics: 

- A significant discrepancy between Mean and Median RPS and TPUT metrics.
- The Standard Deviation (std) for the RPS metric reached ~900, even though the actual maximum RPS was around 50. 

### Observations

- Console output: 
<img width="955" height="867" alt="Image" src="https://github.com/user-attachments/assets/06e18f4a-225f-48bc-82a2-c20d4618ba0f" />

- Data file:  [concurrenct50.json](https://github.com/user-attachments/files/25416006/concurrenct50.json)

### Debug
I suppose the problem is caused by a very low [threshold](https://github.com/vllm-project/guidellm/blob/4786a12d17f8c9c9a6de87e429c1d926085a0ceb/src/guidellm/schemas/statistics.py#L318) for merging. In concurrent mode, many requests can finish almost simultaneously, which leads to _durations_ being as low as 1e-4 in [this line](https://github.com/vllm-project/guidellm/blob/4786a12d17f8c9c9a6de87e429c1d926085a0ceb/src/guidellm/schemas/statistics.py#L380), resulting in an inflated rate. I set the threshold to 1.0 and received more stable results.

### Expected Behavior

Stable quantile metrics and low variance 

### Steps to Reproduce

Below code for reproducing problem: start model in sglang and benchmark scenario

model start:
```
python3 -m sglang.launch_server \
  --model-path /models/Llama/Llama3.1-8B-Instruct/ \
  --tp=1 \
  --dp=1 \
  --enable-metrics \
  --disable-radix-cache \
```

scenario.json:
```
{
  "profile": "concurrent",
  "rate": 50,
  "max_seconds": 10,
  "target": "http://localhost:30000",
  "data": "prompt_tokens=128,output_tokens=128",
  "processor": "/models/Llama/Llama3.1-8B-Instruct/",
}
```

### Operating System

Ubunty 22.04

### Python Version

Python 3.12.12

### GuideLLM Version

guidellm version: 0.6.0.dev75

### Installation Method

pip install guidellm

### Installation Details

_No response_

### Error Messages or Stack Traces

```shell

```

### Additional Context

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Throughput statistics for min/max/stdev/percentiles do not give a good representation of the benchmark #602

Bug Description

Description

Observations

Debug

Expected Behavior

Steps to Reproduce

Operating System

Python Version

GuideLLM Version

Installation Method

Installation Details

Error Messages or Stack Traces

Additional Context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Throughput statistics for min/max/stdev/percentiles do not give a good representation of the benchmark #602

Description

Bug Description

Description

Observations

Debug

Expected Behavior

Steps to Reproduce

Operating System

Python Version

GuideLLM Version

Installation Method

Installation Details

Error Messages or Stack Traces

Additional Context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions