TALON icon indicating copy to clipboard operation
TALON copied to clipboard

Issues in identification of genes/isoforms based on the reference GTF in Talon:

Open aramaiah opened this issue 4 years ago • 3 comments

I used the Talon in Bioconda. I ran the Talon using blank GTF as reference for de novo genes identification. It was successful. However, when I used the output of Talon run-1 as input GTF reference, for identifying the genes using subreads data, it was keeping giving me the errors during 'talon_abundance' and 'talon_create_GTF' steps. I have used variety of names for -a and -g, but could not able to succeed. Could you help me to solve this issue?

aramaiah avatar Mar 25 '20 01:03 aramaiah

Hi, When you initialize a TALON database with a blank GTF, you essentially get an empty database, which leads to problems downstream as you noticed because no genome build or annotation names got added. What I would recommend to get around this is to initialize with a dummy GTF like the one I've uploaded here- even having one mock GTF entry in the file means that the annotation name and build will be added to the database, where the downstream scripts should be able to find them. It is possible you will hit some other unusual behaviors since we didn't develop TALON for de novo annotation from the outset, but I'm happy to answer more questions as they come up. Best, Dana

dewyman avatar Mar 25 '20 16:03 dewyman

Hi, Thank you for your response. I have used an actual GTF (output of first Talon run) as input and I am doing a new run with isoseq data for novel transcript discovery. I had used the test gtf that you had shared for the previous run. I am happy to share the data if you wants to look at them.

Best, Arun

On Wed, Mar 25, 2020 at 9:52 AM Dana Wyman [email protected] wrote:

Hi, When you initialize a TALON database with a blank GTF, you essentially get an empty database, which leads to problems downstream as you noticed because no genome build or annotation names got added. What I would recommend to get around this is to initialize with a dummy GTF like the one I've uploaded here https://github.com/mortazavilab/TALON/blob/master/src/talon/blank.gtf- even having one mock GTF entry in the file means that the annotation name and build will be added to the database, where the downstream scripts should be able to find them. It is possible you will hit some other unusual behaviors since we didn't develop TALON for de novo annotation from the outset, but I'm happy to answer more questions as they come up. Best, Dana

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/mortazavilab/TALON/issues/55#issuecomment-603956845, or unsubscribe https://github.com/notifications/unsubscribe-auth/ALXZBMG3N5W4UGMHT4FREB3RJIZEJANCNFSM4LTDQMIA .

aramaiah avatar Mar 25 '20 21:03 aramaiah

Hi Arun, I'd be happy to take a look. You can send the GTF to me at [email protected]. Best, Dana

dewyman avatar Mar 25 '20 21:03 dewyman