pyopenms-docs Potential useful information to review and add to the readthedocs page

Potential useful information to review and add to the readthedocs page - from learning unit 4

Open timosachsenberg opened this issue 1 year ago • 2 comments

Quantitative Behavior of Mass Spectrometers To apply the theory we learnt in the last chapters of this unit to quantitative analyses with mass spectrometers, we need to think about the limitations that come from the physical processes behind this method first. The parts crucial for quantification are the separation, ionization and the detection. After eluting from the column, the analyte is ionized so that the number of ions for this analyte should be proportional to the concentration of it in the input sample. At the detector, a signal (ion current) is measured that is proportional to the number of ions arriving at it. However, this process has some limitations that make quantification a hard task: Saturation: The detector has an upper limit for an ion current to be reported. Too many ions hitting the detector at the same time result in a saturated signal. This limits the linear/dynamic range of this method. Ionization efficiency: Different species are ionized more easily than others. That means, that the response factors are different among species and signal intensities for different species can not be compared absolutely without any correction. Ionization efficiency of a molecule in ESI for example, depends on factors like (non-)polarity and surface activity of the molecule. Noise: The matrix competes with the analyte for ionization. We can not be sure that the signal we are measuring comes from the analyte. Another problem is that the signal for an analyte will be "split" and recorded at different retention times and different m/z values: Elution profiles: Not all of the analyte comes out of the column at the same time. Due to peak broadening, we will record bell shaped elution profiles instead of sharp peaks. Isotope profiles: Because of different isotopic compositions, the analyte actually occurs as several peaks at different m/z values (so called isotope profiles or ladders). Charge states: During ionization, some analyte molecules might be charged differently than others. Their isotope ladders will occur at proportions of the actual mass on the m/z axis. That means, we have to sum the signals of different regions on an LCMS map.

Deisotoping Along the m/z axis, there is a similar problem as on the last page. Due to charge states and isotopes in the composition of the analytes, we will not observe a single peak in the MS spectrum. However, we can infer the charge, the mass, the average isotope composition and eventually the isotope profile step by step in the following way: The distance between neighbouring (non-noise) peaks corresponds to one atom substituted with its heavier isotope. This results in an increased mass by one neutron, represented as a distance of roughly 1/z on the m/z axis. So if we observe distances of around 0.33 Thompson between neighbouring isotopic peaks, the charge of this measured ion was most likely 3. After the charge was determined, the mass can easily be calculated by multiplying the observed m/z values with the charge. When the mass is known, we can calculate the average isotope composition of an average amino acid with this mass and model the (still discretized) isotope pattern using a binomial distribution. If we now convolve these peaks with Gaussians (whose standard deviations depend on the resolution of the instrument), we will obtain the expected continuous isotope pattern and can select the relevant peaks that we have to sum to get the overall intensity for this particular species. After finding an isotope profile, one has to keep in mind that other isotope profiles might correspond to the same species at different m/z values due to different ionization. However, because of a relationship between intensities and the charge of an ion on some devices, one has to be careful when combining them for absolute quantification. The last pages have shown, that we have to look in both retention time and m/z dimension for traces of intensities for a particular species. These two-dimensional patterns are called features. The use and a more detailed explanation of how to find and evaluate features, will be given in learning units 5A and 5B.

Quantitative Data – MS1 Spectra The way of using MS1 spectra for quantitative proteomics is simply to load a peptide sample onto the LC column coupled to an MS instrument. For simplicity, assume that every sample (a patient, a given experimental condition, a time point in a time series, etc.) is run as only one LC-MS experiment, thus no pre-analysis separation is performed. In MS1 Spectra, different ionized species in the same spectrum result in different peaks. The mass of a peptide (peak) is usually found during several consecutive MS scans, depending on how much time it takes the analyte (peptides) to elute from the column (corresponding to the width of its chromatographic peak).

Comparing intensities of different analytes in the same spectrum is not possible because they have different response factors. Peptides/metabolites that differ only by a stable isotope label will have identical response factors – their intensities can be compared within the same spectrum. This is the basis for isotopic labels.

Quantitative Data – LC-MS Maps Numerous spectra are acquired with rates up to dozens per second over the course of an LC-MS run. Ideally, it should be easy to identify corresponding spectra from different (sub)samples based on their retention time. However, the exact retention time of an analyte (and thus the occurrence of its chromatographic peak) may shift from run to run. An analyte can also occur in several spectra from the same sub-sample as outlined above. The relative simplicity of comparing only one spectrum from each sample is therefore lost in this approach. A useful way to visualize a quantitative LC-MS experiment is to stack the spectra, yielding maps.

DDA To produce tandem mass spectra two common modes are established: the data dependent acquisition and the data independent acquisition In DDA, from one survey scan several ion species (i.e. m/z values) are picked (most commonly the top abundant ones) and further analyzed. For some time after the survey scan, these ions are selectively collected and subjected each to the collision chamber and the product ions analyzed.

With DDA, a broad coverage on MS2 can be accomplished, though the variability may be high. The number of ion species picked also coins the commonly used name for the setting, generalized with the number n: Top-n acquisition. The higher n, the more fragmentations have to be conducted and time consumed, in which newly eluting analytes might get missed. The lower n, the more low abundance ions might get unfragmented.

DIA

In DIA, the tandem spectra from the complete mass range of the analyte is collected. In practice, the system is set to sequentially isolate and fragment subsequent mass windows of certain width (say 10 Th). The overlapping of fragment spectra and the unknown precursor mass of the fragments pose a nontrivial challenge to data analysis for identification.

Suppose we have a set of n samples, each containing a set of molecular components. The majority of the components are the same in each sample, and in our context the components are mainly peptides, but in some cases they are proteins. In quantitative proteomics, the task is to explore how the abundance of the corresponding peptides varies from sample to sample. Due to the way the instruments detect the ions, relative quantification is the easiest form of quantification, meaning that, instead of the absolute concentration, we measure fold changes in the molecules between samples. The relative measurements are performed either within a sample or across the samples, and both labeled methods and label-free methods exist.

We will divide the discussion into methods related to label-free peptide methods and label-based peptide methods. In this context, a label is simply something attached to the peptides of a sample to enable the distinction of this sample from a differently labeled or unlabeled sample. The labelling technologies can be naturally grouped into in vivo labelling and in vitro labelling. Isobaric labelling strategy such as iTRAQ and TMT, will be introduced in LU5C in details.

Label-free quantification is a method that aims to determine the relative amount of proteins in two or more biological samples. It may be based on precursor signal intensity or on spectral counting. The first method is useful when applied to high precision mass spectra. In contrast, spectral counting simply counts the number of spectra identified for a given peptide in different biological samples and then integrates the results for all measured peptides of the protein(s) that are quantified. The computational framework includes detecting peptides, matching the corresponding peptides across multiple maps, selecting discriminatory peptides. MS1 or MS2? Using label-free MS methods to quantify peptide samples will not give any indication of the identity of the components under analysis, but has a greater potential for discovering low-abundance molecules compared to MS/MS-based methods, because the instrument does not need to spend time in MS/MS mode. If interesting candidates are discovered, these can be selected for subsequent identification by MS/MS if suitable spectrometers are used.

Labeling techniques The idea of labelling techniques is to introduce a label in one sample and a different (or no label) in another. The mixing of labeled samples allows a relative quantification between two (or more) samples. Many labeling techniques exploit stable isotope labeling. Different isotopes of the same element behave chemically basically identically (Following isotopes are often used: .1/2H,.12/13C,14/15N,16/18O ). Their masses differ, however, so the MS can distinguish them. Advantages Both samples are treated identically, systematic errors affect them in the same way. It can be easily annotated manually (e.g., by looking for pairs of peaks). Disadvantages Labels can be expensive, difficult, unreliable to introduce. Labeling in vivo is not always possible, not all techniques support in vitro labeling. Chemical labeling and Metabolic labeling 1, Chemical labeling means that peptides are modified chemically after extraction. The label is usually attached covalently at specific functional groups (e.g., N-terminus, specific side chains). It does not involve a perturbation of the in vivo system. Labeling occurs late (during sample preparation) and thus does not account for variance introduced in the early steps. e.g. iTRAQ, TMT 2, Stable isotope labels can also be integrated by ‘feeding’ the organism with labeled metabolites, e.g., amino acids, nitrogen sources, glucose. Full incorporation of the label can take a while. It requires perturbation of the in vivo system, depending on the size. It's quite expensive. Labeling occurs early in the study, results in higher reproducibility.

Applications Quantitative proteome analysis, the global analysis of protein expression, is increasingly being used as a method to study steady-state and perturbation-induced changes in protein profiles. It helps to better understand the structure, function, and control of biologic systems and processes. Applications to systems biology

"Quantitative proteomics can be successfully used for characterizing alterations in protein abundance, finding novel protein-protein and protein-peptide interactions. Further, it can directly compare activation of entire signaling networks in response to individual stimuli and discover critical differences in their circuits that account for alterations of cell response." Aebersold, Ruedi, Beate Rist, and Steven P. Gygi. "Quantitative proteome analysis: methods and applications." Annals of the New York Academy of Sciences 919.1 (2000): 33-47.

SWATH (Sequential Windowed data independent Acquisition of the Total High-resolution Mass Spectra) acquisition was not listed in the graph above. It is a global quantitative strategy that is usually compared with SRM/MRM mentioned on previous page. The idea is to collect a MS and MS/MS spectrum at high resolution on every analyte for the quantitation of everything in the sample. Unlike SRM that each MS2 series is a record of one peptide across LC, the MS scan data of SWATH acquisition is independent and complete fragment ion map of sample is recorded by cycle.

A special variety of DIA is SWATH (Sequential window acquisition of all theoretical mass spectra). It was first introduced in use with a Triple-TOF system. Here, the quadrupole isolates sequentially 25 Th precursor windows across a mass range of interest during the complete elution time of the coupled LC. The ions in these windows are fragmented and analysed.

Mar 09 '23 14:03 timosachsenberg

pyopenms-docs pyopenms-docs copied to clipboard

Potential useful information to review and add to the readthedocs page - from learning unit 4

pyopenms-docs
pyopenms-docs copied to clipboard