kingfisher-download
kingfisher-download copied to clipboard
Feature request: allow downloading from GSA
Since there are data deposited in GSA but not in SRA/ENA, it would be useful to add the option to download from GSA.
I was able to use Aspera for downloading from GSA using a command like:
./ascp -i [/path/to/the/key/file] -P33001 -QT -l100m -k1 [email protected]:/gsa3/<data set ID>/<run ID> [/path/to/your/local/directory/]
Hi,
Interesting feature request. I'm not very familiar with GSA, though, and couldn't find any (English) documentation that gives a guide as to how to interact programmatically. Implementing this in kingfisher would require some way to:
- Given a run ID, query for the data set ID (or better yet for the entire file download path)
- Since kingfisher can make use of a bioproject ID to search, some analogous way to search with a GSA bioproject ID
Any ideas? How did you know how to construct your example aspera command line? Thanks.
Yes, there is indeed a problem of documentation there. I was only able to construct this command after consulting with the GSA support team. I should say that they were pretty quick and helpful. They can be contacted at: [email protected] Maybe adding the feature to Kingfisher will make GSA more accessible to users (there's lots of data there).
Thanks. I've contacted them.