CAFE icon indicating copy to clipboard operation
CAFE copied to clipboard

CAFE Report File Analysis error

Open CLAIRE-cuhk opened this issue 3 years ago • 2 comments

Hello,

I encountered an error when I try to parse the cafe report. I found a similar issue posted before and saying this error below is caused by the additional internal node label. I tried to delete the internal node labels in the input tree to fix the problem but didn't work. Can anyone tell me which exact part should I delete?

Here is my original tree: (Dro:155.769,(Mus:140.192,(((Aedi:41.5384,Cul:41.5384)1:41.5384,(Ang:41.5384,Ans:41.5384)1:41.5384)1:41.5384,(Tri:109.038,((((Ped:31.1538,Fra:31.1538)1:31.1538,(Bla:46.7307,((Zoo:15.5769,Cry:15.5769)1:15.5769,Per:31.1538)1:15.5769)1:15.5769)1:15.5769,((Cim:25.9615,Cin25.9615)83.6:25.9615,Lao:51.923)1:25.9615)1:15.5769,(Apia:74.7691,(Lin:56.0768,((Nyl:18.6923,Cam:18.6923)1:18.6923,(Mon:18.6923,Cyp:18.6923)1:18.6923)1:18.6923)1:18.6923)1:18.6923)1:15.5769)1:15.5769)1:15.5769)1:15.5769)

	CAFE Report File Analysis
	09.21.2020 | 10:24:20

Parsing format information...


Initializing output structures...


Counting changes per branch... [*-------------------------------------------------] 0.061% complete.Traceback (most recent call last): File "./cafetutorial_report_analysis.py", line 371, in results_main, node_fams_main = cra(inlines_main, results_main, node_fams_main, linestart_main, ancfilename, sorted_nodes, 1); File "./cafetutorial_report_analysis.py", line 193, in cra anccount = int(tlinfo[curanc][4]); ValueError: invalid literal for int() with base 10: '4_11'

CLAIRE-cuhk avatar Sep 21 '20 02:09 CLAIRE-cuhk

Hi, I think this is happening because the report_analysis script unfortunately can't handle node labels on the tree. All of your internal nodes seem to have the label '1' (except for one that is '83.6'). These labels get carried over to each gene family tree in the CAFE output. To fix, you can either re-run CAFE without the internal labels on the original tree, or remove the labels from all trees in the CAFE report file with some regex. Let us know if you need any more help! -Gregg Thomas

On Sun, Sep 20, 2020 at 8:43 PM CLAIRE-cuhk [email protected] wrote:

Hello,

I encountered an error when I try to parse the cafe report. I found a similar issue posted before and saying this error below is caused by the additional internal node label. I tried to delete the internal node labels in the input tree to fix the problem but didn't work. Can anyone tell me which exact part should I delete? Here is my original tree:

(Drosophilamelanogaster:155.769,(Muscadomestica:140.192,(((Aedesaegypti:41.5384,Culexquinquefasciatus:41.5384)1:41.5384,(Anophelesgambiae:41.5384,Anophelesstephensi:41.5384)1:41.5384)1:41.5384,(Triboliumcastaneum:109.038,((((Pediculushumanuscorporis:31.1538,Frankliniellaoccidentalis:31.1538)1:31.1538,(Blattellagermanica:46.7307,((Zootermopsisnevadensis:15.5769,Cryptotermessecundus:15.5769)1:15.5769,Periplanetaamericana:31.1538)1:15.5769)1:15.5769)1:15.5769,((Cimexlectularius:25.9615,Cinaracedri:25.9615)83.6:25.9615,Laodelphaxstriatellus:51.923)1:25.9615)1:15.5769,(Apismellifera:74.7691,(Linepithemahumil:56.0768,((Nylanderiafulva:18.6923,Camponotusfloridanus:18.6923)1:18.6923,(Monomoriumpharaonis:18.6923,Cyphomyrmexcostatus:18.6923)1:18.6923)1:18.6923)1:18.6923)1:18.6923)1:15.5769)1:15.5769)1:15.5769)1:15.5769)

CAFE Report File Analysis 09.21.2020 | 10:24:20


Parsing format information...

Initializing output structures...

Counting changes per branch... [*-------------------------------------------------] 0.061% complete.Traceback (most recent call last): File "./cafetutorial_report_analysis.py", line 371, in results_main, node_fams_main = cra(inlines_main, results_main, node_fams_main, linestart_main, ancfilename, sorted_nodes, 1); File "./cafetutorial_report_analysis.py", line 193, in cra anccount = int(tlinfo[curanc][4]); ValueError: invalid literal for int() with base 10: '4_11'

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/hahnlab/CAFE/issues/85, or unsubscribe https://github.com/notifications/unsubscribe-auth/AC7RJCJADRNXFVYKPZS452DSG24U7ANCNFSM4RUANXTA .

gwct avatar Sep 21 '20 03:09 gwct

Hi, I think this is happening because the report_analysis script unfortunately can't handle node labels on the tree. All of your internal nodes seem to have the label '1' (except for one that is '83.6'). These labels get carried over to each gene family tree in the CAFE output. To fix, you can either re-run CAFE without the internal labels on the original tree, or remove the labels from all trees in the CAFE report file with some regex. Let us know if you need any more help! -Gregg Thomas

Many thanks for your help! I was able to finish the run after removing all internal node labels.

CLAIRE-cuhk avatar Sep 28 '20 06:09 CLAIRE-cuhk