Roary icon indicating copy to clipboard operation
Roary copied to clipboard

roary and cd-hit

Open zhichusun opened this issue 2 years ago • 1 comments

I want to know how to remove redundant genes from pan-genome, I analyzed 100 genomes by roary software and got pan_genome_reference.fa file which contains all core and auxiliary genes, I then use cd-hit-est to remove this file Redundant genes in , how can I remove the analyzed redundant genes from gene_presence_absence.csv afterwards, any help is greatly appreciated

zhichusun avatar Apr 06 '22 14:04 zhichusun

I need to understand more about CD-HIT and roary, my question is if CD-hit generates the annotation of protein clusters and if yes how does it do that ? what database is used for annotation of protein clusters because it simply creates lot of hypothetical proteins which if i blast against ncbi refseq database , it will show me what that protein is.

gapabh avatar May 25 '22 12:05 gapabh