FragPipe icon indicating copy to clipboard operation
FragPipe copied to clipboard

TMT-integrator feature-request/discussion: summarize/group intensities by modified sequence

Open MiguelCos opened this issue 1 year ago • 4 comments

Dear Fragpipe team,

As always, many thanks for the amazing work you do developing maintaining Fragpipe and all its tools.

Would it be possible to offer intensity grouping by modified sequence?

We have noticed that, when grouping by peptide, we are not able to quantitatively differentiate between different modification versions of the same peptide sequence after processing with TMT-integrator.

I would illustrate why we think this is essencial with an example of our applications:

We are interested in N-terminomics/analysis of proteolytic processing, and it crucial for us to differentiate between N-terminally acetylated peptides vs N-terminally-TMT-tagged peptides. This is not possible with the current sequence-focused approach.

I know that it is possible to summarize by PTM site, but I believe that we then lose the information for non-modified peptides; please correct me if this is not right.

I would really appreciate your feedback on this front.

Best wishes, Miguel

MiguelCos avatar Aug 17 '22 15:08 MiguelCos

Hi Miguel,

You would need to discuss with Hui-yin, who is the lead TMT-Integrator developer, if she can add another Index (peptide+modification) to TMT-Integrator.

The problem is, do you need to distinguish just some modifications, or all? e.g. PNSTM[+16]EMK PNSTMEM[+16]K PNSTM[+16]EM[+16]K Acetyl-PNSTM[+16]EMK Acetyl-PNSTMEM[+16]K …

So many possibilities to group (e.g. ignoring common mods like M+16 or not). Also for TMT, what would one do with TMT midications (there could be partial labeling and full labeling)

So it gets complicated as a general cases. Perhaps something can be done specifically for termiomics data.

Best Alexey

From: Miguel Cosenza-Contreras @.> Sent: Wednesday, August 17, 2022 11:02 AM To: Nesvilab/FragPipe @.> Cc: Subscribed @.***> Subject: [Nesvilab/FragPipe] TMT-integrator feature-request/discussion: summarize/group intensities by modified sequence (Issue #801)

External Email - Use Caution

Dear Fragpipe team,

As always, many thanks for the amazing work you do developing maintaining Fragpipe and all its tools.

Would it be possible to offer intensity grouping by modified sequence?

We have noticed that, when grouping by peptide, we are not able to quantitatively differentiate between different modification versions of the same peptide sequence after processing with TMT-integrator.

I would illustrate why we think this is essencial with an example of our applications:

We are interested in N-terminomics/analysis of proteolytic processing, and it crucial for us to differentiate between N-terminally acetylated peptides vs N-terminally-TMT-tagged peptides. This is not possible with the current sequence-focused approach.

I know that it is possible to summarize by PTM site, but I believe that we then lose the information for non-modified peptides; please correct me if this is not right.

I would really appreciate your feedback on this front.

Best wishes, Miguel

— Reply to this email directly, view it on GitHubhttps://github.com/Nesvilab/FragPipe/issues/801, or unsubscribehttps://github.com/notifications/unsubscribe-auth/AIIMM63I3PKWWZJ4KPP4ERLVZT5FXANCNFSM56Z4RM6A. You are receiving this because you are subscribed to this thread.Message ID: @.@.>>


Electronic Mail is not secure, may not be read every day, and should not be used for urgent or sensitive issues

anesvi avatar Aug 17 '22 17:08 anesvi

Hello Alexey,

Many thanks for your answer and your input.

It is true that it is not necessary to differentiate between every modification, but some are particularly interesting biologically.

In terms of N-terminomics + TMT-experiments: N-terminal Acetylation vs N-terminal TMT help us know if a truncation comes from potential proteolytic processing or something like a shifted translation initiation site.

I am working on some attempts to summarize the psm.tsv based on modified sequences in this repo, and it seems to work well for single mixture experiments, but it would get more tricky for me for multi-mixture experiments.

Maybe Hui-yin would have an idea of how complicated it would be to implement some kind of selective peptide+modification based summarization (i.e. defining interesting modifications), and/or if it is worth the effort.

Best wishes, Miguel

MiguelCos avatar Aug 19 '22 09:08 MiguelCos

Hi Miguel,

Sorry for the late reply. It took me some time thinking how to properly define a new index that fits your needs. Based on your description, I think you would like to compare peptide sequences with and without a specified modification at the same time, right? So, the new index should be able to indicate peptides even if they don't have the modification. Another question I have is how to distinguish modified peptides. For example, a peptide, VETGVLKPGMVVTFAPVNVTTEVK, is assigned with two different modifications (as listed below), would you consider them as the same or different modified peptides?

01CPTAC_CCRCC_P_JHU_20171106_LUMOS_f01.41377.41377.4 | VETGVLKPGMVVTFAPVNVTTEVK | 10M(15.9949), 24K(229.1629), 7K(229.1629), N-term(229.1629) 01CPTAC_CCRCC_P_JHU_20171106_LUMOS_f01.45754.45754.4 | VETGVLKPGMVVTFAPVNVTTEVK | 24K(229.1629), 7K(229.1629), N-term(229.1629)

Maybe we can have this discussion in private? My email address is: @.*** Thanks.

Huiyin

Miguel Cosenza-Contreras @.***> 於 2022年8月19日 週五 下午5:27寫道:

Hello Alexey,

Many thanks for your answer and your input.

It is true that it is not necessary to differentiate between every modification, but some are particularly interesting biologically.

In terms of N-terminomics + TMT-experiments: N-terminal Acetylation vs N-terminal TMT help us know if a truncation comes from potential proteolytic processing or something like a shifted translation initiation site.

I am working on some attempts to summarize the psm.tsv based on modified sequences in this repo https://github.com/MiguelCos/summarize_psm_tsv_fragpipe, and it seems to work well for single mixture experiments, but it would get more tricky for me for multi-mixture experiments.

Maybe Hui-yin would have an idea of how complicated it would be to implement some kind of selective peptide+modification based summarization (i.e. defining interesting modifications), and/or if it is worth the effort.

Best wishes, Miguel

— Reply to this email directly, view it on GitHub https://github.com/Nesvilab/FragPipe/issues/801#issuecomment-1220457733, or unsubscribe https://github.com/notifications/unsubscribe-auth/ALAWWA35MSH5Z5CAZJT6LK3VZ5HRXANCNFSM56Z4RM6A . You are receiving this because you are subscribed to this thread.Message ID: @.***>

-- Hui-Yin Chang, 張彙音 Assistant Professor Department of Biomedical Sciences and Engineering National Central University, Taiwan

huiyinc avatar Aug 20 '22 05:08 huiyinc