Description
In miles/rollout/sglang_rollout.py:571, the example logging assumes sample is a single Sample:
if do_print:
logger.info(
"eval_rollout_single_dataset example data: "
f"{[str(sample.prompt) + sample.response]} "
f"reward={sample.reward}"
)
However, the isinstance(sample, list) branch a few lines below shows that generate_and_rm can return list[Sample] (multi-turn / multi-agent case). When it does, eval crashes.
Reproduction
bash examples/multi_agent/run-qwen3-30B-A3B-multi-agent.sh
Stack trace
File "miles/rollout/sglang_rollout.py", line 571, in eval_rollout_single_dataset
f"{[str(sample.prompt) + sample.response]} "
AttributeError: 'list' object has no attribute 'prompt'
Suggested fix
example = sample[0] if isinstance(sample, list) else sample
Description
In
miles/rollout/sglang_rollout.py:571, the example logging assumessampleis a singleSample:However, the
isinstance(sample, list)branch a few lines below shows thatgenerate_and_rmcan returnlist[Sample](multi-turn / multi-agent case). When it does, eval crashes.Reproduction
Stack trace
Suggested fix