diamond icon indicating copy to clipboard operation
diamond copied to clipboard

blast tab output: stitle should not include the sseqid

Open deprekate opened this issue 4 years ago • 4 comments

Currently the 'stitle' option for blast tab output is also including the subject id. For proper output, the subject id should be trimmed off subject_name

$ blastx -query seq1.fna -db nr  -outfmt '6 qseqid sseqid pident stitle'
seq1	gb|KRM34755.1|	        90.000	hypothetical protein FC44_GL000800 [Lactobacillus intestinalis DSM 6629]
seq1	ref|WP_057808743.1|	90.000	hypothetical protein [Lactobacillus intestinalis]
seq1	ref|WP_135960259.1|	90.000	hypothetical protein [Lactobacillus intestinalis]
seq1	ref|WP_154881438.1|	90.000	hypothetical protein [Lactobacillaceae bacterium]

$ diamond  blastx -q seq1.fna -d nr  --outfmt 6 qseqid sseqid pident stitle
seq1	WP_154881438.1	        90.0	WP_154881438.1 hypothetical protein [Lactobacillaceae bacterium]
seq1	WP_057808743.1	        90.0	WP_057808743.1 hypothetical protein [Lactobacillus intestinalis]
seq1	WP_135960259.1	        90.0	WP_135960259.1 hypothetical protein [Lactobacillus intestinalis] >TGY16829.1 hypothetical protein E5351_02205 [Lactobacillus intestinalis]
seq1	KRM34755.1	        90.0	KRM34755.1 hypothetical protein FC44_GL000800 [Lactobacillus intestinalis DSM 6629]

deprekate avatar Jun 12 '20 01:06 deprekate

Ok, I will provide an option to trim the title accordingly.

bbuchfink avatar Jun 15 '20 09:06 bbuchfink

Awesome, I have been using diamond for years, and I just recently realized there was an stitle option (I had been using blastdbcmd to get the stitle, and merging into the diamond output). Thanks for the great tool : )

deprekate avatar Jun 15 '20 20:06 deprekate

Should default behavior be to only show the stitle and not require a seperate flag/arguement?

If a user wants both the qseqid (a second time) and stitle they would use: --outfmt 6 qseqid sseqid pident qseqid stitle

deprekate avatar Jun 19 '20 21:06 deprekate

Yes, it probably should. But I don't like to break compatibility between versions, which is why adding an option came to mind.

bbuchfink avatar Jun 22 '20 08:06 bbuchfink