funannotate icon indicating copy to clipboard operation
funannotate copied to clipboard

funannotate update with mysql

Open Nitin123-4 opened this issue 5 months ago • 6 comments

Hi Team,

I did the required settings for funannotate to use mysql i.e. https://github.com/PASApipeline/PASApipeline/wiki/setting-up-pasa-mysql.

It is running fine with 12 cpus.

funannotate update -i Genome_funannotate_train/ --cpus 12 --pasa_db mysql

Any idea how long will it take to complete?

Previous annotation consists of: 28,846 protein coding gene models and 683 non-coding gene models.

Nitin123-4 avatar Jan 23 '24 18:01 Nitin123-4

Hi Team, Any idea about this ?

Nitin123-4 avatar Jan 30 '24 23:01 Nitin123-4

I guess it depends on how many transcripts you had in the initial round. It should be faster than when it ran train. I'm assuming that you ran train with the same mysql setup. The update step runs PASA genome comparison, so it takes your gene models from predict and looks at the alignments in the PASA database -- it can predict some new genes and/or alter some coding sequences. It runs this step two times iteratively. If it is still running after 7 days then I'd think something is wrong. I would guess several hours would be more typical.

nextgenusfs avatar Jan 31 '24 00:01 nextgenusfs

Hi Thanks for your reply. I ran funannotate train with sqlite. I am running update with mysql.

Nitin123-4 avatar Jan 31 '24 01:01 Nitin123-4

That won't work.

nextgenusfs avatar Jan 31 '24 01:01 nextgenusfs

Oh Okay. So funannotate train should also be with mysql and update also then it should be faster?

Nitin123-4 avatar Jan 31 '24 18:01 Nitin123-4

Yeah it tries to reuse the database. You could just run a fresh update but need to pass all the reads, etc and direct to new output folder. Update is capable of running the data from scratch as well.

nextgenusfs avatar Jan 31 '24 20:01 nextgenusfs