SeqFindR icon indicating copy to clipboard operation
SeqFindR copied to clipboard

Investigate the effects of using hsp.identities/float(record.query_length) >= tol vs max_identity >= tol

Open mscook opened this issue 11 years ago • 2 comments

mscook avatar Jun 11 '13 02:06 mscook

I use the following in Dryad to allow users to specify a minimum % identity over a minimum % of gene length. (where lenCutOff is the percentage of original gene length that must be covered by the match & identCutoff is the percentage identity across the match itself)

if ( float(hsp.identities) / float(hsp.align_length) * float(100) ) > float(identCutoff)
and ( float(hsp.align_length) / float(alignment.length) * float(100) > lenCutoff):

happykhan avatar Jul 16 '13 05:07 happykhan

Thanks Nabil. Discussed with Brian & Nouri today and am going to implement some additional scoring schemes. I like the one above.

mscook avatar Mar 06 '14 14:03 mscook