datapusher icon indicating copy to clipboard operation
datapusher copied to clipboard

Get resource wait configuration

Open mihail-ivanov opened this issue 9 years ago • 2 comments

  • Add configuration for DOWNLOAD_TIMEOUT
  • Add configuration and retries logic for get_resource in push_to_datastore job. If the uploaded resource is too big or the server have huge load, this call was failing which produces a lot of errors in the log.

mihail-ivanov avatar Sep 14 '16 05:09 mihail-ivanov

get_resource just gets the resource metadata, not the actual file that resource['url'] points to, and the one that could be potentially big. So your improvements (which make sense) should apply to this request, unless I'm missing something?

amercader avatar Feb 24 '17 12:02 amercader

In our case the code was failing in both parts of the code. If the uploaded file was too big, get_resource needed more time to return the metadata, so we added more than one retries to do that. After that downloading the data for this file was taking more than 30 seconds, so we added a configuration option for DOWNLOAD_TIMEOUT.

mihail-ivanov avatar Feb 24 '17 15:02 mihail-ivanov