-
Notifications
You must be signed in to change notification settings - Fork 13
Description
I‘m confused about the " Couldn't reach 'videofolder' on the Hub (ConnectionError)".
[rank0]:[W1118 08:51:00.973446097 ProcessGroupNCCL.cpp:4115] [PG ID 0 PG GUID 0 Rank 0] using GPU 0 to perform barrier as devices used by this process are currently unknown. This can potentially cause a hang if this rank to GPU mapping is incorrect.Specify device_ids in barrier() to force use of a particular device,or call init_process_group() with a device_id.
You set add_prefix_space. The tokenizer needs to be converted from the slow tokenizers
torch_dtype is deprecated! Use dtype instead!
Loading checkpoint shards: 100%|███████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 4/4 [00:20<00:00, 5.11s/it]
Loading checkpoint shards: 100%|███████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 3/3 [00:00<00:00, 5.77it/s]
[rank0]: 2025-11-18 08:51:24 | i2v-train | INFO | Checking settings...
[rank0]: 2025-11-18 08:51:24 | i2v-train | INFO | Initializing models...
[rank0]: 2025-11-18 08:51:24 | i2v-train | INFO | Initializing dataset and dataloader...
[rank0]: Traceback (most recent call last):
[rank0]: File "/mnt/afs/250010063/Codes/SurVGen/train.py", line 35, in
[rank0]: main()
[rank0]: File "/mnt/afs/250010063/Codes/SurVGen/train.py", line 29, in main
[rank0]: trainer.fit()
[rank0]: File "/mnt/afs/250010063/Codes/SurVGen/cogkit/finetune/base/base_trainer.py", line 131, in fit
[rank0]: self.prepare_dataset()
[rank0]: File "/mnt/afs/250010063/Codes/SurVGen/cogkit/finetune/diffusion/trainer.py", line 103, in prepare_dataset
[rank0]: self.train_dataset = dataset_cls(
[rank0]: File "/mnt/afs/250010063/Codes/SurVGen/cogkit/finetune/datasets/i2v_dataset.py", line 252, in init
[rank0]: super().init(*args, **kwargs)
[rank0]: File "/mnt/afs/250010063/Codes/SurVGen/cogkit/finetune/datasets/i2v_dataset.py", line 66, in init
[rank0]: video_data = load_dataset("videofolder", data_dir=video_path, split="train")
[rank0]: File "/mnt/afs/250010063/anconda/envs/Hal_Med310/lib/python3.10/site-packages/datasets/load.py", line 2587, in load_dataset
[rank0]: builder_instance = load_dataset_builder(
[rank0]: File "/mnt/afs/250010063/anconda/envs/Hal_Med310/lib/python3.10/site-packages/datasets/load.py", line 2259, in load_dataset_builder
[rank0]: dataset_module = dataset_module_factory(
[rank0]: File "/mnt/afs/250010063/anconda/envs/Hal_Med310/lib/python3.10/site-packages/datasets/load.py", line 1910, in dataset_module_factory
[rank0]: raise e1 from None
[rank0]: File "/mnt/afs/250010063/anconda/envs/Hal_Med310/lib/python3.10/site-packages/datasets/load.py", line 1843, in dataset_module_factory
[rank0]: raise ConnectionError(f"Couldn't reach '{path}' on the Hub ({type(e).name})")
[rank0]: ConnectionError: Couldn't reach 'videofolder' on the Hub (ConnectionError)