Skip to content

Commit b891f0d

Browse files
Delaunaypierre.delaunay
andauthored
Update RLHF name (#275)
* new RLHF benchmark * Add RLHF config to standard * Fix --------- Co-authored-by: pierre.delaunay <[email protected]>
1 parent 5672f16 commit b891f0d

File tree

4 files changed

+7
-14
lines changed

4 files changed

+7
-14
lines changed

benchmarks/rlhf/dev.yaml

Lines changed: 3 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11

2-
rlhf_:
2+
_rlhf:
33
inherits: _defaults
44
definition: .
55
install-variant: unpinned
@@ -17,13 +17,12 @@ rlhf_:
1717

1818

1919
rlhf-single:
20-
inherits: rlhf_
20+
inherits: _rlhf
2121
plan:
2222
method: per_gpu
2323

24-
2524
rlhf-gpus:
26-
inherits: rlhf_
25+
inherits: _rlhf
2726
plan:
2827
method: njobs
2928
n: 1

benchmarks/rlhf/main.py

Lines changed: 1 addition & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -118,11 +118,8 @@ def tokenize(element):
118118
train_dataset=train_dataset,
119119
eval_dataset=eval_dataset,
120120
)
121+
121122
trainer.train()
122-
trainer.save_model(config.output_dir)
123-
if config.push_to_hub:
124-
trainer.push_to_hub()
125-
trainer.generate_completions()
126123

127124

128125
if __name__ == "__main__":

config/base.yaml

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -777,7 +777,7 @@ llava-gpus:
777777
--gradient_accumulation_steps: 1
778778

779779

780-
rlhf_:
780+
_rlhf:
781781
inherits: _defaults
782782
definition: ../benchmarks/rlhf
783783
install-variant: unpinned
@@ -796,12 +796,12 @@ rlhf_:
796796
--bf16: true
797797

798798
rlhf-single:
799-
inherits: rlhf_
799+
inherits: _rlhf
800800
plan:
801801
method: per_gpu
802802

803803
rlhf-gpus:
804-
inherits: rlhf_
804+
inherits: _rlhf
805805
plan:
806806
method: njobs
807807
n: 1

scripts/article/run_cuda.sh

Lines changed: 0 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -76,9 +76,6 @@ else
7676
. $MILABENCH_WORDIR/env/bin/activate
7777
fi
7878

79-
80-
export MILABENCH_CONFIG=/home/mila/d/delaunap/milabench/benchmarks/rlhf/dev.yaml
81-
8279
if [ "$MILABENCH_PREPARE" -eq 0 ]; then
8380
cd $MILABENCH_WORDIR
8481

0 commit comments

Comments
 (0)