gffread icon indicating copy to clipboard operation
gffread copied to clipboard

An error was found in the generated protein sequence

Open Tang-pro opened this issue 10 months ago • 2 comments

Hi @alephreish @gpertea

I extracted CDS sequences and protein sequences based on the CDS contained in the GTF. gffread Y2.gtf -g Genome.fa -x cds.fa -y pep.fa

But I found the cds sequence in this ID is normal, but the protein sequence contains some . Image

Image

Tang-pro avatar Feb 08 '25 02:02 Tang-pro

(For the record: I do not maintain gffread).

The "CDS" sequence you pasted (I had to OCR it - pls do not paste sequence screenshots!!) has multiple stop-codons. Stops are indicated by dots by default in gffread (check out the -S argument).

alephreish avatar Feb 13 '25 17:02 alephreish

@alephreish

Okay, thanks.

Tang-pro avatar Feb 14 '25 01:02 Tang-pro