ncbi-genome-download
ncbi-genome-download copied to clipboard
anyway to exclude plasmid sequences from genomes downloaded?
Hi there,
I wonder if there is a way to download only chromosome sequences in genomes containing both plasmids and chromosomes? Or, is it able to exclude plasmid sequences? For example, I only need chromosome sequences in .faa files wich contained sequences from plasmids as well. I guess it is difficult to do that using the current script. Any solution? Thank you!
Li
Hi, as the plasmids and the chromosomes are contained in the same assembly and thus the same file, it's not possible to exclude contents during the download. You could possibly do this in some sort of post-processing script that removes all but the largest record from a file, assuming you're dealing with fully assembled genomes and not draft genomes.