diffusion icon indicating copy to clipboard operation
diffusion copied to clipboard

Bug report: ValueError: invalid literal for int() with base 10: '/tmp/mds-cache/mds-coco-2014-val-fid-clip-17'

Open viyjy opened this issue 2 years ago • 0 comments

Hi, after installing everything by following these commands

git clone https://github.com/mosaicml/diffusion.git
cd diffusion
pip install -e .

I was trying to run fid-clip-evaluation.py but got the following error:

Traceback (most recent call last):
  File "diffusion/scripts/fid-clip-evaluation.py", line 39, in <module>
    coco_val_dataloader = build_streaming_cocoval_dataloader(
  File "diffusion/diffusion/datasets/coco/coco_captions.py", line 110, in build_streaming_cocoval_dataloader
    dataset = StreamingCOCOCaption(
  File "diffusion/diffusion/datasets/coco/coco_captions.py", line 60, in __init__
    super().__init__(
  File "python3.9/site-packages/streaming/base/dataset.py", line 496, in __init__
    self._shm_prefix_int, self._locals_shm = get_shm_prefix(streams_local, streams_remote,
  File "python3.9/site-packages/streaming/base/shared/prefix.py", line 189, in get_shm_prefix
    prefix_int = _check_and_find_retrying(streams_local, streams_remote, retry)
  File "python3.9/site-packages/streaming/base/shared/prefix.py", line 162, in _check_and_find_retrying
    raise errs[-1]
  File "python3.9/site-packages/streaming/base/shared/prefix.py", line 158, in _check_and_find_retrying
    return _check_and_find(streams_local, streams_remote)
  File "python3.9/site-packages/streaming/base/shared/prefix.py", line 115, in _check_and_find
    their_locals, _ = _unpack_locals(bytes(shm.buf))
  File "python3.9/site-packages/streaming/base/shared/prefix.py", line 75, in _unpack_locals
    return text[:-1], int(text[-1] or 0)
ValueError: invalid literal for int() with base 10: '/tmp/mds-cache/mds-coco-2014-val-fid-clip-17'
ERROR:composer.cli.launcher:Rank 0 crashed with exit code 1.
Waiting up to 30 seconds for all training processes to terminate. Press Ctrl-C to exit immediately.

This error is fixed after installing old packages:

pip install mosaicml==0.14.1
pip install mosaicml-streaming==0.5.0

viyjy avatar Oct 19 '23 19:10 viyjy