Skip to content
This repository was archived by the owner on Apr 30, 2026. It is now read-only.

Commit 728b858

Browse files
committed
Generate mmlu bench data with the original samples
The generated samples contain excess data that isn't required and can clash with variables created in pipelines and data mixing. Fixes #202 Co-authored-by: Ben Browning <bbrownin@redhat.com> Signed-off-by: Derek Higgins <derekh@redhat.com>
1 parent 3904fef commit 728b858

1 file changed

Lines changed: 1 addition & 1 deletion

File tree

src/instructlab/sdg/generate_data.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -384,7 +384,7 @@ def generate_data(
384384
generate_eval_task_data(
385385
mmlu_bench_pipe,
386386
leaf_node_path,
387-
new_generated_data,
387+
ds,
388388
output_dir,
389389
date_suffix,
390390
)

0 commit comments

Comments
 (0)