Augustus icon indicating copy to clipboard operation
Augustus copied to clipboard

Script join_aug_pred.pl incompatible with option --uniqueGeneId

Open marchoeppner opened this issue 3 years ago • 1 comments

As the title states, the script for joining Augustus predictions across split jobs does not work if the option "--uniqueGeneId=true" is invoked.

This is because of the regexp used to determine a gene start:

https://github.com/Gaius-Augustus/Augustus/blob/5a1c7baef970d0a27a32866168c25d05cd387ac7/scripts/join_aug_pred.pl#L103

I suppose the trailing 'g' could simply be ommitted to fix this. It's just a bit of an odd behavior that took me a few minutes to track down. I imagine others may run into this as well.

Cheers, Marc

marchoeppner avatar Apr 09 '21 12:04 marchoeppner

I agree, it's safe to ommitt the trailing g since we are matching the hash marks and the term start gene. That is true for the current Augustus. @MarioStanke is it true older Augustus versions, too?

I made a commit to fix this for current and old Augustus at https://github.com/Gaius-Augustus/Augustus/commit/af437758b8210355321d4d6362f3dec44355b392 and https://github.com/Gaius-Augustus/Augustus/commit/25c266dac5e99d02875cf981eca0c2688af9a1cc but we might have to revert it for old Augustus versions if the string "### gene" is not unique for gene starts in the old gff output format.

KatharinaHoff avatar Apr 14 '21 07:04 KatharinaHoff