Skip to content

Read timed out error when downloading dataset  #9

@GeetikaSi

Description

@GeetikaSi

Hi, I attempted to download the workflow-calcium-imaging-test-set version 0_1_0a2 using the following code line:
djarchive download workflow-calcium-imaging-test-set 0_1_0a2 ~/workflow-calcium-imaging-test-set\ 0_1_0a2
using MacOS (Big Sur Version 11.5.2). After successfully downloading a subset of the data, I run into the following error:

Error:

Traceback (most recent call last):
  File "/opt/anaconda3/envs/djarchive/lib/python3.9/site-packages/urllib3/response.py", line 438, in _error_catcher
    yield
  File "/opt/anaconda3/envs/djarchive/lib/python3.9/site-packages/urllib3/response.py", line 515, in read
    data = self._fp.read() if not fp_closed else b""
  File "/opt/anaconda3/envs/djarchive/lib/python3.9/http/client.py", line 472, in read
    s = self._safe_read(self.length)
  File "/opt/anaconda3/envs/djarchive/lib/python3.9/http/client.py", line 613, in _safe_read
    data = self.fp.read(amt)
  File "/opt/anaconda3/envs/djarchive/lib/python3.9/socket.py", line 704, in readinto
    return self._sock.recv_into(b)
  File "/opt/anaconda3/envs/djarchive/lib/python3.9/ssl.py", line 1241, in recv_into
    return self.read(nbytes, buffer)
  File "/opt/anaconda3/envs/djarchive/lib/python3.9/ssl.py", line 1099, in read
    return self._sslobj.read(len, buffer)
socket.timeout: The read operation timed out

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/opt/anaconda3/envs/djarchive/bin/djarchive", line 197, in <module>
    actions[action][0](*sys.argv[2:])
  File "/opt/anaconda3/envs/djarchive/bin/djarchive", line 115, in download
    nfound, nerr = client().download(
  File "/opt/anaconda3/envs/djarchive/lib/python3.9/site-packages/djarchive_client/__init__.py", line 520, in download
    self.fget_object(spath, lpath, display_progress)
  File "/opt/anaconda3/envs/djarchive/lib/python3.9/site-packages/djarchive_client/__init__.py", line 565, in fget_object
    fh.write(dat.data)
  File "/opt/anaconda3/envs/djarchive/lib/python3.9/site-packages/urllib3/response.py", line 300, in data
    return self.read(cache_content=True)
  File "/opt/anaconda3/envs/djarchive/lib/python3.9/site-packages/urllib3/response.py", line 541, in read
    raise IncompleteRead(self._fp_bytes_read, self.length_remaining)
  File "/opt/anaconda3/envs/djarchive/lib/python3.9/contextlib.py", line 135, in __exit__
    self.gen.throw(type, value, traceback)
  File "/opt/anaconda3/envs/djarchive/lib/python3.9/site-packages/urllib3/response.py", line 443, in _error_catcher
    raise ReadTimeoutError(self._pool, None, "Read timed out.")
urllib3.exceptions.ReadTimeoutError: HTTPSConnectionPool(host='s3.djhub.io', port=443): Read timed out.

After the code is run again, the download resumes but runs into the same error after a couple of hours. Is there a different way I should be approaching the download steps? Thank you!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions