FragPipe icon indicating copy to clipboard operation
FragPipe copied to clipboard

Freequant: corrupted pepXML file. iTraq4 experiment

Open wyu opened this issue 1 year ago • 3 comments

Hi, we're encountering an error while searching an iTraq4 dataset from CPTAC. Looks like one of the pepXML is corrupted judging on the file sizes. How should we work around the problem in order to complete the analysis?

Thanks! Wen

Directory of G:\contrib\A_36_GC_early_onset_PDC000214\results_isoform\N5381T5760

10/17/2022 09:38 PM 129,792,842 FN01_N5381T5760_180min_10ug_C1_053015.pepXML 10/17/2022 09:38 PM 130,252,487 FN02_N5381T5760_180min_10ug_C2_053015.pepXML 10/17/2022 09:39 PM 130,096,557 FN03_N5381T5760_180min_10ug_C1_053015.pepXML 10/17/2022 09:39 PM 131,206,680 FN04_N5381T5760_180min_10ug_C2_053015.pepXML 10/17/2022 09:40 PM 133,732,627 FN05_N5381T5760_180min_10ug_C1_053015.pepXML 10/17/2022 09:41 PM 89,938,341 FN06_N5381T5760_180min_10ug_C2_053015.pepXML 10/17/2022 09:41 PM 136,035,527 FN07_N5381T5760_180min_10ug_C1_053015.pepXML

log_2022-10-20_09-49-13.txt

  • Upload your log file (If a log file hasn't been generated, go to the 'Run' tab in FragPipe, click 'Export Log', zip the resulting "log_[date_time].txt" file to avoid truncation, then attach the zipped file by drag & drop here.)

wyu avatar Oct 20 '22 14:10 wyu

the pepXML is the output from MSFragger right? If the file is corrupted, why dont you run Fragger again?

prvst avatar Oct 20 '22 18:10 prvst

FN06_N5381T5760_180min_10ug_C2_053015.zip

Hi, I took a closer look at the offending pepXML file attached. Looks like it has the proper ending tag. The file is quite a bit smaller than the other ones in the folder. Not sure if re-run the search would help.

... </search_hit> </search_result> </spectrum_query> </msms_run_summary> </msms_pipeline_analysis>

Thanks! Wen

Process 'FreeQuant' finished, exit code: 0 FreeQuant [Work dir: G:\contrib\A_36_GC_early_onset_PDC000214\results_isoform\N5381T5760] E:\Tools\FragPipe\FragPipe-jpe-18\fragpipe\tools\philosopher_v4.4.0_windows_amd64\philosopher.exe freequant --ptw 0.4 --tol 10 --dir G:\contrib\A_36_GC_early_onset_PDC000214\mzML\N5381T5760 INFO[04:29:36] Executing Label-free quantification v4.4.0
INFO[04:30:06] Indexing PSM information
INFO[04:30:11] Reading spectra and tracing peaks
INFO[04:30:11] Processing FN01_N5381T5760_180min_10ug_C1_053015 INFO[04:30:47] Processing FN02_N5381T5760_180min_10ug_C2_053015 INFO[04:31:25] Processing FN03_N5381T5760_180min_10ug_C1_053015 INFO[04:32:01] Processing FN04_N5381T5760_180min_10ug_C2_053015 INFO[04:32:39] Processing FN05_N5381T5760_180min_10ug_C1_053015 INFO[04:33:18] Processing FN06_N5381T5760_180min_10ug_C2_053015 panic: Cannot decode packed binary. XML syntax error on line 3802344: unexpected EOF

goroutine 1 [running]: philosopher/lib/msg.callLogrus({0xc0c1c69e50, 0x4d}, {0xa912c0, 0x1?}) /workspace/philosopher/lib/msg/msg.go:319 +0x354 philosopher/lib/msg.DecodeMsgPck({0xbe0040?, 0xc034ff8240?}, {0xa912c0, 0x5}) /workspace/philosopher/lib/msg/msg.go:120 +0x6e philosopher/lib/psi.(*IndexedMzML).Parse(0xc1235b3ed0, {0xc104648000, 0x63}) /workspace/philosopher/lib/psi/psi.go:115 +0x17b philosopher/lib/mzn.(*MsData).Read(0xc1235b45d0, {0xc104648000, 0x63}) /workspace/philosopher/lib/mzn/mzn.go:194 +0x66 philosopher/lib/qua.peakIntensity({0x0, {{0x0, 0x0}, {0x0, 0x0}, {0x0, 0x0}, {0x0, 0x0}, {0x0, ...}, ...}, ...}, ...) /workspace/philosopher/lib/qua/lfq.go:92 +0xd5d philosopher/lib/qua.RunLabelFreeQuantification({{0x0, 0x0}, {0x0, 0x0}, {0xa90d40, 0x4}, {0xc00013a0c0, 0x38}, {0x0, 0x0}, ...}) /workspace/philosopher/lib/qua/qua.go:46 +0xed philosopher/cmd.glob..func6(0x20342e0?, {0xa90ca0?, 0x6?, 0x6?}) /workspace/philosopher/cmd/freequant.go:50 +0x29a github.com/spf13/cobra.(*Command).execute(0x20342e0, {0xc000166060, 0x6, 0x6}) /home/prvst/go/pkg/mod/github.com/spf13/[email protected]/command.go:844 +0x663 github.com/spf13/cobra.(*Command).ExecuteC(0x2032b40) /home/prvst/go/pkg/mod/github.com/spf13/[email protected]/command.go:945 +0x37c github.com/spf13/cobra.(*Command).Execute(...) /home/prvst/go/pkg/mod/github.com/spf13/[email protected]/command.go:885 philosopher/cmd.Execute() /workspace/philosopher/cmd/root.go:35 +0x25 main.main() /workspace/philosopher/main.go:25 +0x90 Process 'FreeQuant' finished, exit code: 2 Process returned non-zero exit code, stopping ERRO[04:33:37] Cannot decode packed binary. XML syntax error on line 3802344: unexpected EOF

Cancelling 207 remaining tasks

wyu avatar Oct 20 '22 19:10 wyu

Can you convert your mzML files again using the latest msconvert ?

prvst avatar Oct 21 '22 15:10 prvst

Thanks for your suggestion!

I repeated the mzML conversion with the latest "msconvert" from "ProteoWizard3.0.22293.189308e". Before launching the search on the complete dataset, I was able to successfully search one arbitrary iTraq4 plex (N13T236). However, the plex "N5381T5760" that gave me the error before still yielded the same error as shown in the log file.

Do you think that something is silently wrong about the "FN06_N5381T5760_180min_10ug_C2_053015" run? If so, is there a way to bypass it, so we can complete the overall search? Missing one run from a 24-fractions experiment is probably OK.

Thanks! Wen

log_2022-10-26_12-35-23_failed_plex.txt fully

wyu avatar Oct 26 '22 16:10 wyu

Can you share the file ?

prvst avatar Oct 26 '22 16:10 prvst

Hi, here is the pepXML file I was referring to. Let me know if you need the whole plex.

Wen

FN06_N5381T5760_180min_10ug_C2_053015.zip

wyu avatar Oct 26 '22 16:10 wyu

Sorry I was referring to the raw and mzML files

prvst avatar Oct 26 '22 16:10 prvst

Hi, can you see if you can access the Box folder via the link? The upload will take some time, but the folder should be there.

https://az.box.com/s/1ar441189nakjlvdoe5kcn7vgboigzsl

Wen

wyu avatar Oct 26 '22 17:10 wyu

Hi, the Box folder containing the raw, mzML and MsFragger output is now ready. I've also placed the sample manifest in the folder for your reference.

https://az.box.com/s/1ar441189nakjlvdoe5kcn7vgboigzsl

Thanks! Wen

wyu avatar Oct 26 '22 20:10 wyu

you forgot to send the database

prvst avatar Oct 27 '22 16:10 prvst

Actually, that's unnecessary. I took the faulty mzML and converted myself from raw again, and got the same problem; the file does not seem to convert all the way to the end. The most likely reason is that you might have a bad raw file. My suggestion is that you try copying from the source again, if possible. In case not, just put it aside.

prvst avatar Oct 27 '22 16:10 prvst

Thanks for looking into the issue! I was able to complete the search once I removed the offending pepXML and innteract* files in this folder.

Wen

wyu avatar Nov 04 '22 16:11 wyu