datasets icon indicating copy to clipboard operation
datasets copied to clipboard

Unexpected EOF Errors During Actinopterygii Genomes Download in RefSeq

Open mkrg01 opened this issue 7 months ago • 16 comments

Background:

Encountered multiple unexpected EOF errors while attempting to download the RefSeq genomes of Actinopterygii (taxon id: 7898) using datasets version 15.30.0.

Steps to Reproduce:

  1. Initial Download Command:

    datasets download genome taxon 7898 --dehydrated --reference --annotated --include gbff --assembly-source RefSeq --filename data/raw_data/Actinopterygii_dataset.zip
    
  2. Unzipping the Package:

    unzip data/raw_data/Actinopterygii_dataset.zip -d data/raw_data/Actinopterygii_dataset
    
  3. Rehydration Process (Error Occurs Here):

    datasets rehydrate --directory data/raw_data/Actinopterygii_dataset/ --no-progressbar
    

Observed Error Messages:

During the rehydration step, the process repeatedly fails with unexpected EOF errors. The error log is as follows:

Collecting 177 genome records [------------------------------------------------]   0% 0/177
Downloading: data/raw_data/Actinopterygii_dataset.zip    167kB valid zip structure -- files not checked
Validating package [================================================] 100% 4/4
Error: 
unexpected EOF
[repeated multiple times]

Use datasets rehydrate <command> --help for detailed help about a command.

I would greatly appreciate your assistance in addressing this matter.

mkrg01 avatar Dec 03 '23 05:12 mkrg01