We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 86f75c7 commit 6ce5dd2Copy full SHA for 6ce5dd2
1 file changed
pretrain.py
@@ -1,9 +1,9 @@
1
"""
2
-Continues MLM pretraining of a base encoder (default: ModernBERT-large) on Sentry-grouping LLM analyses
3
-(per-row `prompt` + `thinking_output` + `response_output`, joined with the tokenizer's sep_token).
+Continues MLM pretraining of a base encoder on Sentry-grouping LLM prompts and completions:
+`prompt[SEP]thinking_output[SEP]response_output`
4
5
-Logs to wandb. Writes checkpoints + the final model to GCS. Unlike `train.py`, there's no async eval — the MLM loss
6
-in wandb is the only training-time signal.
+Logs to wandb. Writes checkpoints + the final model to GCS. Unlike `train.py`, there's no async eval. Just MLM loss on a
+subsample of val data run sync.
7
8
9
import logging
0 commit comments