ncbi-genome-download icon indicating copy to clipboard operation
ncbi-genome-download copied to clipboard

anyway to exclude plasmid sequences from genomes downloaded?

Open Liao-PRIC opened this issue 4 years ago • 1 comments

Hi there,

I wonder if there is a way to download only chromosome sequences in genomes containing both plasmids and chromosomes? Or, is it able to exclude plasmid sequences? For example, I only need chromosome sequences in .faa files wich contained sequences from plasmids as well. I guess it is difficult to do that using the current script. Any solution? Thank you!

Li

Liao-PRIC avatar Feb 26 '20 05:02 Liao-PRIC

Hi, as the plasmids and the chromosomes are contained in the same assembly and thus the same file, it's not possible to exclude contents during the download. You could possibly do this in some sort of post-processing script that removes all but the largest record from a file, assuming you're dealing with fully assembled genomes and not draft genomes.

kblin avatar Feb 27 '20 07:02 kblin