Open
Description
Hi, after installing everything by following these commands
git clone https://github.com/mosaicml/diffusion.git
cd diffusion
pip install -e .
I was trying to run fid-clip-evaluation.py but got the following error:
Traceback (most recent call last):
File "diffusion/scripts/fid-clip-evaluation.py", line 39, in <module>
coco_val_dataloader = build_streaming_cocoval_dataloader(
File "diffusion/diffusion/datasets/coco/coco_captions.py", line 110, in build_streaming_cocoval_dataloader
dataset = StreamingCOCOCaption(
File "diffusion/diffusion/datasets/coco/coco_captions.py", line 60, in __init__
super().__init__(
File "python3.9/site-packages/streaming/base/dataset.py", line 496, in __init__
self._shm_prefix_int, self._locals_shm = get_shm_prefix(streams_local, streams_remote,
File "python3.9/site-packages/streaming/base/shared/prefix.py", line 189, in get_shm_prefix
prefix_int = _check_and_find_retrying(streams_local, streams_remote, retry)
File "python3.9/site-packages/streaming/base/shared/prefix.py", line 162, in _check_and_find_retrying
raise errs[-1]
File "python3.9/site-packages/streaming/base/shared/prefix.py", line 158, in _check_and_find_retrying
return _check_and_find(streams_local, streams_remote)
File "python3.9/site-packages/streaming/base/shared/prefix.py", line 115, in _check_and_find
their_locals, _ = _unpack_locals(bytes(shm.buf))
File "python3.9/site-packages/streaming/base/shared/prefix.py", line 75, in _unpack_locals
return text[:-1], int(text[-1] or 0)
ValueError: invalid literal for int() with base 10: '/tmp/mds-cache/mds-coco-2014-val-fid-clip-17'
ERROR:composer.cli.launcher:Rank 0 crashed with exit code 1.
Waiting up to 30 seconds for all training processes to terminate. Press Ctrl-C to exit immediately.
This error is fixed after installing old packages:
pip install mosaicml==0.14.1
pip install mosaicml-streaming==0.5.0
Metadata
Metadata
Assignees
Labels
No labels