-
Notifications
You must be signed in to change notification settings - Fork 20
Description
Hello, I have faced a problem and have not seen it in the issues.
(llm_unlearn) root@5e0bce17bb64:/home/chixu/ELEC631/llm_unlearn# python unlearn_harm.py --model_name=facebook/opt-1.3b --model_save_dir=models/opt1.3b_unlearned --log_file=logs/opt-1.3b-unlearn.log
Traceback (most recent call last):
File "/home/chixu/ELEC631/llm_unlearn/unlearn_harm.py", line 228, in
main(args)
File "/home/chixu/ELEC631/llm_unlearn/unlearn_harm.py", line 56, in main
train_dataset = load_dataset("PKU-Alignment/PKU-SafeRLHF", split="330k_train")
File "/home/chixu/anaconda3/envs/llm_unlearn/lib/python3.10/site-packages/datasets/load.py", line 2166, in load_dataset
ds = builder_instance.as_dataset(split=split, verification_mode=verification_mode, in_memory=keep_in_memory)
File "/home/chixu/anaconda3/envs/llm_unlearn/lib/python3.10/site-packages/datasets/builder.py", line 1190, in as_dataset
datasets = map_nested(
File "/home/chixu/anaconda3/envs/llm_unlearn/lib/python3.10/site-packages/datasets/utils/py_utils.py", line 456, in map_nested
return function(data_struct)
File "/home/chixu/anaconda3/envs/llm_unlearn/lib/python3.10/site-packages/datasets/builder.py", line 1220, in _build_single_dataset
ds = self._as_dataset(
File "/home/chixu/anaconda3/envs/llm_unlearn/lib/python3.10/site-packages/datasets/builder.py", line 1294, in _as_dataset
dataset_kwargs = ArrowReader(cache_dir, self.info).read(
File "/home/chixu/anaconda3/envs/llm_unlearn/lib/python3.10/site-packages/datasets/arrow_reader.py", line 240, in read
files = self.get_file_instructions(name, instructions, split_infos)
File "/home/chixu/anaconda3/envs/llm_unlearn/lib/python3.10/site-packages/datasets/arrow_reader.py", line 213, in get_file_instructions
file_instructions = make_file_instructions(
File "/home/chixu/anaconda3/envs/llm_unlearn/lib/python3.10/site-packages/datasets/arrow_reader.py", line 130, in make_file_instructions
absolute_instructions = instruction.to_absolute(name2len)
File "/home/chixu/anaconda3/envs/llm_unlearn/lib/python3.10/site-packages/datasets/arrow_reader.py", line 653, in to_absolute
return [_rel_to_abs_instr(rel_instr, name2len) for rel_instr in self._relative_instructions]
File "/home/chixu/anaconda3/envs/llm_unlearn/lib/python3.10/site-packages/datasets/arrow_reader.py", line 653, in
return [_rel_to_abs_instr(rel_instr, name2len) for rel_instr in self._relative_instructions]
File "/home/chixu/anaconda3/envs/llm_unlearn/lib/python3.10/site-packages/datasets/arrow_reader.py", line 465, in _rel_to_abs_instr
raise ValueError(f'Unknown split "{split}". Should be one of {list(name2len)}.')
ValueError: Unknown split "330k_train". Should be one of ['train', 'test'].