weather-tools icon indicating copy to clipboard operation
weather-tools copied to clipboard

Retry downloading a shard when file sizes are zero.

Open alxmrs opened this issue 2 years ago • 3 comments

Sometimes, files downloaded by weather-dl are created in GCS, but contain zero bytes. To address this, we could:

  • Add a check after fetch is complete that checks the file size in the bucket. If zero, retry the DL.
  • When we check if the file exists or needs to be skipped, we should also check the file size.

Workarounds: Users can make use of gsutil or equivalent tools to find all empty files. Then, they can delete these and re-download (with a separate invocation).

alxmrs avatar Nov 23 '22 20:11 alxmrs

@alxmrs I would like to work on this, could you please assign this to me

nagavenkateshgavini avatar Mar 03 '24 00:03 nagavenkateshgavini

I think @fredzyda can help you with the assignment. Though, I don’t think any one else is working on it right now, feel free to take a crack at the implementation. :)

Before you do, check with @mahrsee to see if this is still an issue — it may have been fixed in wdl2.

alxmrs avatar Mar 03 '24 01:03 alxmrs

We haven't observed this issue recently, and it's not implemented in WDL and WDLv2. However, implementing this would serve as a great safety check, so we should proceed with it.

@nagavenkateshgavini,

Pointers for WDL:

Pointers for WDLv2:

  • We can incorporate validation logic in main().

mahrsee1997 avatar Mar 08 '24 09:03 mahrsee1997