Skip to content

Commit f5ed4bf

Browse files
pbielakPiotr Bielak
andauthored
Fix run_clm.py for streaming datasets (#2309)
Co-authored-by: Piotr Bielak <pbielak@habana.ai>
1 parent bdd3879 commit f5ed4bf

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

examples/language-modeling/run_clm.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -766,7 +766,7 @@ def compute_metrics(eval_preds):
766766
metrics = trainer.evaluate()
767767

768768
if data_args.streaming:
769-
metrics["eval_samples"] = max_eval_samples
769+
metrics["eval_samples"] = training_args.max_steps * training_args.per_device_eval_batch_size
770770
else:
771771
max_eval_samples = (
772772
data_args.max_eval_samples if data_args.max_eval_samples is not None else len(eval_dataset)

0 commit comments

Comments
 (0)