Skip to content

Commit 063692b

Browse files
committed
prepare for release
1 parent e357ecb commit 063692b

File tree

2 files changed

+4
-12
lines changed

2 files changed

+4
-12
lines changed

src/examples/bolmo/compute_entropies.py

Lines changed: 0 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -12,10 +12,6 @@
1212
--batch-size 32 \
1313
--sequence-length 2048
1414
15-
Environment variables:
16-
DATA_SOURCE: One of 'dclm', 'dolmino', 'dolma2_code_string', 'dolmino_code_string', 'tulu3'
17-
HAS_WEKA: Set to '1' if running on Weka filesystem
18-
1915
For each input shard like "part-0-00000.npy", this will create:
2016
- output_dir/entropy/path/to/part-0-00000.npy (entropy values as raw binary float16)
2117
- output_dir/cross_entropy/path/to/part-0-00000.npy (cross-entropy values as raw binary float16)

src/examples/bolmo/instructify.py

Lines changed: 4 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -33,14 +33,10 @@
3333

3434
def parse_args():
3535
parser = argparse.ArgumentParser(description="Instructify OLMo model merging script")
36-
parser.add_argument("--checkpoint-dir", type=str, default="/weka/oe-training-default/benjaminm/runs/stage2n_hnet_v8-e1d4-w-ee-no-sm-150k-half-local-lr_wd01gc05/step150000",
37-
help="Path to the main checkpoint directory")
38-
parser.add_argument("--output", type=str, default="/weka/oe-training-default/benjaminm/merges/stage2n_hnet_v8-e1d4-w-ee-no-sm-150k-half-local-lr_wd01gc05_zeroshot_instruct_alpha1",
39-
help="Path to output directory")
40-
parser.add_argument("--base-checkpoint-dir", type=str, default="/weka/oe-training-default/benjaminm/checkpoints/olmo2_1b",
41-
help="Path to base checkpoint directory")
42-
parser.add_argument("--instruct-checkpoint-dir", type=str, default="/weka/oe-training-default/benjaminm/checkpoints/olmo2_1b_instruct",
43-
help="Path to instruct checkpoint directory")
36+
parser.add_argument("--checkpoint-dir", type=str, help="Path to the main checkpoint directory")
37+
parser.add_argument("--output", type=str, help="Path to output directory")
38+
parser.add_argument("--base-checkpoint-dir", type=str, help="Path to base checkpoint directory")
39+
parser.add_argument("--instruct-checkpoint-dir", type=str, help="Path to instruct checkpoint directory")
4440
parser.add_argument("--include-instruct-teacher", action="store_true",
4541
default=True,
4642
help="Include instruct teacher in the merged model")

0 commit comments

Comments
 (0)