pysradb
pysradb copied to clipboard
[BUG] aspera
Following my previous issue - I still don't get the fastq files with aspera, only empty folders, with the following code:
from pysradb.sraweb import SRAweb
SRA_OUR_DIR = "/data/NCBI_data/"
db = SRAweb()
gse_to_srp = db.gse_to_srp("GSE226189")
print("gse_to_srp shape:", gse_to_srp.shape)
display(gse_to_srp.head(2))
metadata = db.sra_metadata(gse_to_srp["study_accession"].to_list(), detailed=True)
print(metadata.shape)
display(metadata.head(2))
db.download(df=metadata.head(1),
url_col="ena_fastq_http_1",
use_ascp=True,
#threads=8,
skip_confirmation=True,#don't ask for permmision to download
out_dir=SRA_OUR_DIR)
OS: AWS EC2, Ubuntu 22.04.2 LTS anaconda3 Python 3.11.5
when the url_col is the default I do get the .sra files. The link in column "ena_fastq_http_1" seems fine (http://ftp.sra.ebi.ac.uk/vol1/fastq/SRR236/077/SRR23630177/SRR23630177_1.fastq.gz)