PhyloFisher icon indicating copy to clipboard operation
PhyloFisher copied to clipboard

Paralogs added have the exact same name as ortholog

Open matiasWanntorp opened this issue 1 year ago • 4 comments

Hello!

I've used "working_dataset_constructor.py" and have all of the multifastas with ortholog, paralogs plus the newly extracted sequences. I noticed that the paralog sequences in my .fas files have identical names to the corresponding ortholog sequences. I wondered whether this is intended or if perhaps something is wrong with my database (custom) or metadata?

Thank you ahead for any insight!

Edit: After looking at the "build_database.py" it seems like the paralogs should be renamed by adding ".." + a random 5 digit number however this doesn't seem to have happened when constructing the database. At least this is my interpretation of the function "paralog_name(abbrev, keys)" in "build_database.py". Could I perhaps do this post database construction and pre "working_dataset_constructor.py"?

matiasWanntorp avatar Jan 08 '24 20:01 matiasWanntorp