Skip to content

Commit 3560ae9

Browse files
committed
Use more shards.
1 parent d708e77 commit 3560ae9

1 file changed

Lines changed: 1 addition & 1 deletion

File tree

experiments/dedup/fineweb_10bt_exact.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -36,7 +36,7 @@ def build_steps() -> list[StepSpec]:
3636
fn=lambda op: dedup_exact_paragraph(
3737
input_paths=os.path.join(download.output_path, "sample/10BT"),
3838
output_path=op,
39-
max_parallelism=4,
39+
max_parallelism=128,
4040
),
4141
)
4242
return [download, dedup_step]

0 commit comments

Comments
 (0)