dfast_core icon indicating copy to clipboard operation
dfast_core copied to clipboard

pseudogene prediction

Open eddykay310 opened this issue 4 years ago • 1 comments

Hi

Please does DFAST handle replicate pseudogenes in the case of stop codons that fragment genes and therefore generate multiple hits for a single gene.

Thanks

eddykay310 avatar Aug 22 '21 21:08 eddykay310

Sorry for the late response. Yes, CDSs are fragmented when there is a stop codon inside of it. DFAST first try to find such fragmented CDSs based on the coverage against the reference sequence. Then, they are re-aligned to the same reference protein after extending the both 5'- and 3'-ends of the CDSs. When the stop codon or frameshift is found in the extended region, the CDS is annotated as pseudogene.

nigyta avatar Nov 19 '21 00:11 nigyta