dammit
dammit copied to clipboard
include BUSCO hits in gff3
Camille,
I want to retain all contigs for which any annotation data exists, including BUSCO hits. This would be easy if the BUSCO results were including in the final gff3 file in some way. What do you think about including them?
I definitely support it. Do you have ideas on how to represent these in the gff3? The standard really demands having start/end coordinates for each feature; I suppose I could just have the features span the entire transcript (which seems kludgy/inaccurate), or pull coordinates from the BLAST or hmmer outputs. What do you think?
Been looking into this. The output from BUSCO is very raw -- many hits per BUSCO, sometimes multiple per transcript, combined from both hmmer and tblastn. I think the short term solution for 1.0 will just be to set the start and end as the start and end coordinates of the transript, and think about dealing with it more intelligently later.