datasets icon indicating copy to clipboard operation
datasets copied to clipboard

--geo-location flag runs into invalid zip archive error

Open joverlee521 opened this issue 4 months ago • 3 comments

Hi NCBI Datasets team,

Today I've tried a couple geolocations with the --geo-location flag and have run into the invalid zip archive error every time.

My attempt with state level "WA"
$ ./datasets download virus genome taxon sars-cov-2             --geo-location WA             --filename data/ncbi_dataset.zip --debug
2024/02/28 19:03:50 
GET /datasets/v2alpha/taxonomy/taxon_suggest/sars-cov-2?exact_match=true&tax_rank_filter=higher_taxon&taxon_resource_filter=TAXON_RESOURCE_FILTER_ALL HTTP/1.1
Host: api.ncbi.nlm.nih.gov
User-Agent: OpenAPI-Generator/16.6.0/go
Accept: application/json
Ncbi-Phid: 55DD9889E6F9F0E2D8D045A9
X-Datasets-Client: datasets-cli
X-Datasets-Client-Arch: amd64
X-Datasets-Client-Cmd: download virus genome taxon sars-cov-2 --geo-location WA --filename data/ncbi_dataset.zip --debug
X-Datasets-Client-Os: linux
X-Datasets-Client-Version: 16.6.0
Accept-Encoding: gzip


2024/02/28 19:03:51 
HTTP/2.0 200 OK
Content-Security-Policy: upgrade-insecure-requests
Content-Type: application/json
Date: Wed, 28 Feb 2024 19:03:51 GMT
Grpc-Metadata-Via: h2 linkerd
Ncbi-Phid: 55DD9889E6F9F0E2D8D045A9.1.1
Server: Apache
Strict-Transport-Security: max-age=31536000; includeSubDomains; preload
X-Datasets-Version: 16.6.0
X-Ua-Compatible: IE=Edge
X-Xss-Protection: 1; mode=block


2024/02/28 19:03:51 
POST /datasets/v2alpha/taxonomy HTTP/1.1
Host: api.ncbi.nlm.nih.gov
User-Agent: OpenAPI-Generator/16.6.0/go
Content-Length: 53
Accept: application/json
Content-Type: application/json
Ncbi-Phid: 55DD9889E6F9F0E2D8D045A9
X-Datasets-Client: datasets-cli
X-Datasets-Client-Arch: amd64
X-Datasets-Client-Cmd: download virus genome taxon sars-cov-2 --geo-location WA --filename data/ncbi_dataset.zip --debug
X-Datasets-Client-Os: linux
X-Datasets-Client-Version: 16.6.0
Accept-Encoding: gzip

{"returned_content":"METADATA","taxons":["2697049"]}

2024/02/28 19:03:51 
HTTP/2.0 200 OK
Content-Security-Policy: upgrade-insecure-requests
Content-Type: application/json
Date: Wed, 28 Feb 2024 19:03:51 GMT
Grpc-Metadata-Via: h2 linkerd
Ncbi-Phid: 55DD9889E6F9F0E2D8D045A9.2.1
Server: Apache
Strict-Transport-Security: max-age=31536000; includeSubDomains; preload
X-Datasets-Version: 16.6.0
X-Ua-Compatible: IE=Edge
X-Xss-Protection: 1; mode=block


2024/02/28 19:03:51 
POST /datasets/v2alpha/virus/genome/download HTTP/1.1
Host: api.ncbi.nlm.nih.gov
User-Agent: OpenAPI-Generator/16.6.0/go
Content-Length: 189
Accept: application/zip
Accept: application/json
Content-Type: application/json
Ncbi-Phid: 55DD9889E6F9F0E2D8D045A9
X-Datasets-Client: datasets-cli
X-Datasets-Client-Arch: amd64
X-Datasets-Client-Cmd: download virus genome taxon sars-cov-2 --geo-location WA --filename data/ncbi_dataset.zip --debug
X-Datasets-Client-Os: linux
X-Datasets-Client-Version: 16.6.0
Accept-Encoding: gzip

{"annotated_only":false,"complete_only":false,"format":"tsv","geo_location":"WA","host":"","include_sequence":["GENOME"],"pangolin_classification":"","refseq_only":false,"taxon":"2697049"}

2024/02/28 19:03:51 
HTTP/2.0 200 OK
Content-Disposition: attachment; filename=ncbi_dataset.zip
Content-Security-Policy: upgrade-insecure-requests
Content-Type: application/zip
Date: Wed, 28 Feb 2024 19:03:51 GMT
Grpc-Metadata-Logging-Activity: download
Grpc-Metadata-Logging-Annotated_only: False
Grpc-Metadata-Logging-Refseq_only: False
Grpc-Metadata-Logging-Service: virus
Grpc-Metadata-Logging-Taxon: 2697049
Grpc-Metadata-Via: h2 linkerd
Ncbi-Phid: 55DD9889E6F9F0E2D8D045A9.3.1
Server: Apache
Strict-Transport-Security: max-age=31536000; includeSubDomains; preload
Vary: Accept-Encoding
X-Datasets-Version: 16.6.0
X-Ua-Compatible: IE=Edge
X-Xss-Protection: 1; mode=block


Downloading: data/ncbi_dataset.zip    112kB done
Downloading: data/ncbi_dataset.zip    112kB invalid zip archive
Validating package []

Use datasets download virus genome taxon <command> --help for detailed help about a command.
My attempt with country level "USA"
$ ./datasets download virus genome taxon sars-cov-2 --geo-location USA --filename data/ncbi_dataset.zip --debug
2024/02/28 18:50:55 
GET /datasets/v2alpha/taxonomy/taxon_suggest/sars-cov-2?exact_match=true&tax_rank_filter=higher_taxon&taxon_resource_filter=TAXON_RESOURCE_FILTER_ALL HTTP/1.1
Host: api.ncbi.nlm.nih.gov
User-Agent: OpenAPI-Generator/16.6.0/go
Accept: application/json
Ncbi-Phid: 76BF10892A975A708F9C4692
X-Datasets-Client: datasets-cli
X-Datasets-Client-Arch: amd64
X-Datasets-Client-Cmd: download virus genome taxon sars-cov-2 --geo-location USA --filename data/ncbi_dataset.zip --debug
X-Datasets-Client-Os: linux
X-Datasets-Client-Version: 16.6.0
Accept-Encoding: gzip


2024/02/28 18:50:56 
HTTP/2.0 200 OK
Content-Security-Policy: upgrade-insecure-requests
Content-Type: application/json
Date: Wed, 28 Feb 2024 18:50:56 GMT
Grpc-Metadata-Via: h2 linkerd
Ncbi-Phid: 76BF10892A975A708F9C4692.1.1
Server: Apache
Strict-Transport-Security: max-age=31536000; includeSubDomains; preload
X-Datasets-Version: 16.6.0
X-Ua-Compatible: IE=Edge
X-Xss-Protection: 1; mode=block


2024/02/28 18:50:56 
POST /datasets/v2alpha/taxonomy HTTP/1.1
Host: api.ncbi.nlm.nih.gov
User-Agent: OpenAPI-Generator/16.6.0/go
Content-Length: 53
Accept: application/json
Content-Type: application/json
Ncbi-Phid: 76BF10892A975A708F9C4692
X-Datasets-Client: datasets-cli
X-Datasets-Client-Arch: amd64
X-Datasets-Client-Cmd: download virus genome taxon sars-cov-2 --geo-location USA --filename data/ncbi_dataset.zip --debug
X-Datasets-Client-Os: linux
X-Datasets-Client-Version: 16.6.0
Accept-Encoding: gzip

{"returned_content":"METADATA","taxons":["2697049"]}

2024/02/28 18:50:56 
HTTP/2.0 200 OK
Content-Security-Policy: upgrade-insecure-requests
Content-Type: application/json
Date: Wed, 28 Feb 2024 18:50:56 GMT
Grpc-Metadata-Via: h2 linkerd
Ncbi-Phid: 76BF10892A975A708F9C4692.2.1
Server: Apache
Strict-Transport-Security: max-age=31536000; includeSubDomains; preload
X-Datasets-Version: 16.6.0
X-Ua-Compatible: IE=Edge
X-Xss-Protection: 1; mode=block


2024/02/28 18:50:56 
POST /datasets/v2alpha/virus/genome/download HTTP/1.1
Host: api.ncbi.nlm.nih.gov
User-Agent: OpenAPI-Generator/16.6.0/go
Content-Length: 190
Accept: application/zip
Accept: application/json
Content-Type: application/json
Ncbi-Phid: 76BF10892A975A708F9C4692
X-Datasets-Client: datasets-cli
X-Datasets-Client-Arch: amd64
X-Datasets-Client-Cmd: download virus genome taxon sars-cov-2 --geo-location USA --filename data/ncbi_dataset.zip --debug
X-Datasets-Client-Os: linux
X-Datasets-Client-Version: 16.6.0
Accept-Encoding: gzip

{"annotated_only":false,"complete_only":false,"format":"tsv","geo_location":"USA","host":"","include_sequence":["GENOME"],"pangolin_classification":"","refseq_only":false,"taxon":"2697049"}

2024/02/28 18:50:56 
HTTP/2.0 200 OK
Content-Disposition: attachment; filename=ncbi_dataset.zip
Content-Security-Policy: upgrade-insecure-requests
Content-Type: application/zip
Date: Wed, 28 Feb 2024 18:50:56 GMT
Grpc-Metadata-Logging-Activity: download
Grpc-Metadata-Logging-Annotated_only: False
Grpc-Metadata-Logging-Refseq_only: False
Grpc-Metadata-Logging-Service: virus
Grpc-Metadata-Logging-Taxon: 2697049
Grpc-Metadata-Via: h2 linkerd
Ncbi-Phid: 76BF10892A975A708F9C4692.3.1
Server: Apache
Strict-Transport-Security: max-age=31536000; includeSubDomains; preload
Vary: Accept-Encoding
X-Datasets-Version: 16.6.0
X-Ua-Compatible: IE=Edge
X-Xss-Protection: 1; mode=block


Downloading: data/ncbi_dataset.zip    8.25MB done
Downloading: data/ncbi_dataset.zip    8.25MB invalid zip archive
Validating package []

Use datasets download virus genome taxon <command> --help for detailed help about a command.
My attempt with continent level "Africa"
$ ./datasets download virus genome taxon sars-cov-2             --geo-location Africa             --filename data/ncbi_dataset.zip --debug
2024/02/28 19:02:41 
GET /datasets/v2alpha/taxonomy/taxon_suggest/sars-cov-2?exact_match=true&tax_rank_filter=higher_taxon&taxon_resource_filter=TAXON_RESOURCE_FILTER_ALL HTTP/1.1
Host: api.ncbi.nlm.nih.gov
User-Agent: OpenAPI-Generator/16.6.0/go
Accept: application/json
Ncbi-Phid: E35746682FB5DDAAA893F10F
X-Datasets-Client: datasets-cli
X-Datasets-Client-Arch: amd64
X-Datasets-Client-Cmd: download virus genome taxon sars-cov-2 --geo-location Africa --filename data/ncbi_dataset.zip --debug
X-Datasets-Client-Os: linux
X-Datasets-Client-Version: 16.6.0
Accept-Encoding: gzip


2024/02/28 19:02:42 
HTTP/2.0 200 OK
Content-Security-Policy: upgrade-insecure-requests
Content-Type: application/json
Date: Wed, 28 Feb 2024 19:02:42 GMT
Grpc-Metadata-Via: h2 linkerd
Ncbi-Phid: E35746682FB5DDAAA893F10F.1.1
Server: Apache
Strict-Transport-Security: max-age=31536000; includeSubDomains; preload
X-Datasets-Version: 16.6.0
X-Ua-Compatible: IE=Edge
X-Xss-Protection: 1; mode=block


2024/02/28 19:02:42 
POST /datasets/v2alpha/taxonomy HTTP/1.1
Host: api.ncbi.nlm.nih.gov
User-Agent: OpenAPI-Generator/16.6.0/go
Content-Length: 53
Accept: application/json
Content-Type: application/json
Ncbi-Phid: E35746682FB5DDAAA893F10F
X-Datasets-Client: datasets-cli
X-Datasets-Client-Arch: amd64
X-Datasets-Client-Cmd: download virus genome taxon sars-cov-2 --geo-location Africa --filename data/ncbi_dataset.zip --debug
X-Datasets-Client-Os: linux
X-Datasets-Client-Version: 16.6.0
Accept-Encoding: gzip

{"returned_content":"METADATA","taxons":["2697049"]}

2024/02/28 19:02:42 
HTTP/2.0 200 OK
Content-Security-Policy: upgrade-insecure-requests
Content-Type: application/json
Date: Wed, 28 Feb 2024 19:02:42 GMT
Grpc-Metadata-Via: h2 linkerd
Ncbi-Phid: E35746682FB5DDAAA893F10F.2.1
Server: Apache
Strict-Transport-Security: max-age=31536000; includeSubDomains; preload
X-Datasets-Version: 16.6.0
X-Ua-Compatible: IE=Edge
X-Xss-Protection: 1; mode=block


2024/02/28 19:02:42 
POST /datasets/v2alpha/virus/genome/download HTTP/1.1
Host: api.ncbi.nlm.nih.gov
User-Agent: OpenAPI-Generator/16.6.0/go
Content-Length: 193
Accept: application/zip
Accept: application/json
Content-Type: application/json
Ncbi-Phid: E35746682FB5DDAAA893F10F
X-Datasets-Client: datasets-cli
X-Datasets-Client-Arch: amd64
X-Datasets-Client-Cmd: download virus genome taxon sars-cov-2 --geo-location Africa --filename data/ncbi_dataset.zip --debug
X-Datasets-Client-Os: linux
X-Datasets-Client-Version: 16.6.0
Accept-Encoding: gzip

{"annotated_only":false,"complete_only":false,"format":"tsv","geo_location":"Africa","host":"","include_sequence":["GENOME"],"pangolin_classification":"","refseq_only":false,"taxon":"2697049"}

2024/02/28 19:02:42 
HTTP/2.0 200 OK
Content-Disposition: attachment; filename=ncbi_dataset.zip
Content-Security-Policy: upgrade-insecure-requests
Content-Type: application/zip
Date: Wed, 28 Feb 2024 19:02:42 GMT
Grpc-Metadata-Logging-Activity: download
Grpc-Metadata-Logging-Annotated_only: False
Grpc-Metadata-Logging-Refseq_only: False
Grpc-Metadata-Logging-Service: virus
Grpc-Metadata-Logging-Taxon: 2697049
Grpc-Metadata-Via: h2 linkerd
Ncbi-Phid: E35746682FB5DDAAA893F10F.3.1
Server: Apache
Strict-Transport-Security: max-age=31536000; includeSubDomains; preload
Vary: Accept-Encoding
X-Datasets-Version: 16.6.0
X-Ua-Compatible: IE=Edge
X-Xss-Protection: 1; mode=block


Downloading: data/ncbi_dataset.zip    855B invalid zip archive
Downloading: data/ncbi_dataset.zip    855B invalid zip archive
Validating package []

Use datasets download virus genome taxon <command> --help for detailed help about a command.

joverlee521 avatar Feb 28 '24 19:02 joverlee521