FastOMA
FastOMA copied to clipboard
Avoid length limit for input fasta ids
The enforced length limit on the fasta IDs is annoying.
https://github.com/DessimozLab/FastOMA/blob/38b299167435f16c781dfe065d96e184d6decf8d/FastOMA/check_input.py#L50
we could replace all the IDs with a lookup table. however, there are a few things to consider:
- doing it before OMAmer mapping means the results cannot easily be reused / existing omamer placements are invalid.
- debugging will involve an extra (mental) mapping step.
- generally requires all reporting steps to replace the mapping again.
One option could be to store the mapping in a sqlite database. comments wellcome.