staramr icon indicating copy to clipboard operation
staramr copied to clipboard

Add support for multiple types of biological data files

Open apetkau opened this issue 5 years ago • 1 comments

It would be nice if staramr could support multiple types of input files (such as Genbank) and also compressed versions of each of this files (e.g., gzipped fasta). As an example, see the description of input for Abricate.

Conversion between different formats can likely use BioPython's SeqIO functionality.

Detection of file formats should also not depend on the extension (e.g., .fasta for fasta, .gz for gzipped) since this tool is integrated into Galaxy, which internally names all input files as .dat. Ideally, the file contents should be used to detect the type of file passed to staramr instead of the extension.

apetkau avatar Mar 08 '19 16:03 apetkau

We can leave this for a later release (as it's not as high of priority) so I'm switching back to unassigned.

apetkau avatar Apr 01 '19 15:04 apetkau