genomad icon indicating copy to clipboard operation
genomad copied to clipboard

How to run genomad on only complete sequences (DTR or ITR)?

Open ZongzhiWu opened this issue 1 year ago • 1 comments

How to run genomad on only complete sequences (DTR or ITR), but not on fragments.

ZongzhiWu avatar Jan 25 '24 06:01 ZongzhiWu

You can filter those using the topology column of the summary file. If you want to process only the punitively complete genomes, you'd need to filter fragments before running geNomad. If you are comfortable with Python, you can use the Sequence class within geNomad to achieve that. It has properties that indicate whether a sequence has DTRs or ITRs (see here).

apcamargo avatar Jan 25 '24 22:01 apcamargo