Full-Spectrum Prediction of Peptides Tandem Mass Spectra using Deep Neural Network.

07:00 EST 13th February 2020 | BioPortfolio

Summary of "Full-Spectrum Prediction of Peptides Tandem Mass Spectra using Deep Neural Network."

The ability to predict tandem mass (MS/MS) spectra from peptide sequences can significantly enhance our understanding of the peptide fragmentation process and could improve peptide identification in proteomics. However, current approaches for predicting high-energy collisional dissociation (HCD) spectra are limited to predict the intensities of expected ion types, i.e., the a/b/c/x/y/z ions and their neutral loss derivatives (referred to as backbone ions). In practice, backbone ions only account for <70% of total ion intensities in HCD spectra, indicating many intense ions are ignored by current predictors. In this paper, we present a deep learning approach that can predict the complete spectra (both backbone and non-backbone ions) directly from peptide sequences. We made no assumptions or expectations on which kind of ions to predict but instead predicting the intensities for all possible m/z. Training this model needs no annotations of fragment ion nor any prior knowledge of the fragmentation rules. Our analyses show that the predicted 2+ and 3+ HCD spectra are highly similar to the experimental spectra, with average full-spectrum cosine similarities of 0.820 (+/- 0.088) and 0.786 (+/- 0.085), respectively, very close to the similarities between the experimental replicated spectra. In contrast, the best-performed backbone only models can only achieve an average similarity below 0.75 and 0.70 for 2+ and 3+ spectra, respectively. Furthermore, we developed a multi-task learning (MTL) approach for predicting spectra of insufficient training samples, which allows our model to make accurate predictions for electron transfer dissociation (ETD) spectra and HCD spectra of less abundant charges (1+ and 4+).


Journal Details

This article was published in the following journal.

Name: Analytical chemistry
ISSN: 1520-6882


DeepDyve research library

PubMed Articles [15729 Associated PubMed Articles listed on BioPortfolio]

A combined strategy of neuropeptide prediction and tandem mass spectrometry identifies evolutionarily conserved ancient neuropeptides in the sea anemone Nematostella vectensis.

Neuropeptides are a class of bioactive peptides shown to be involved in various physiological processes, including metabolism, development, and reproduction. Although neuropeptide candidates have been...

MESSAR: Automated recommendation of metabolite substructures from tandem mass spectra.

Despite the increasing importance of non-targeted metabolomics to answer various life science questions, extracting biochemically relevant information from metabolomics spectral data is still an incom...

ChimST: An Efficient Spectral Library Search Tool for Peptide Identification from Chimeric Spectra in Data-dependent Acquisition.

Accurate and sensitive identification of peptides from MS/MS spectra is a very challenging problem in computational shotgun proteomics. To tackle this problem, spectral library search has been one of ...

Determination of 10-Hydroxy-2-Decenoic Acid of Royal Jelly Using Near-Infrared Spectroscopy Combined with Chemometrics.

A rapid quantitative analysis model for determining the hydroxy-2-decenoic acid (10-HDA) content of royal jelly based on near-infrared spectroscopy combining with PLS has been developed. Firstly, near...

Assessment method for deamidation in proteins using carboxylic acid derivatization-liquid chromatography-tandem mass spectrometry.

An analytical method for the degree of protein deamidation has been developed by using carboxy group derivatization and liquid chromatography-tandem mass spectrometry (LCMS/MS). The fragment peptides ...

Clinical Trials [5543 Associated Clinical Trials listed on BioPortfolio]

Hyperspectral Imaging for Neoplasm Early Stage Detection

Background Information With the advance in cancer biology, we realize that malignant neoplasm is related with different biological patterns in metabolome and microbiota. Because of the pro...

Amniotic Fluid Tandem Mass Spectrometry for Pregnancies Complicated by NIH and Severe Symmetrical IUGR

The objective of this pilot study is to prospectively evaluate amniotic fluid of pregnancies complicated by non-immune hydrops and severe symmetrical intrauterine growth restriction by tan...

A Study to Look at Performance of MICRUSFRAME and GALAXY Coils for the Treatment of Intracranial Aneurysms

A post-market registry evaluating ruptured/unruptured aneurysms treated exclusively with Spectra Galaxy and Spectra Micrusframe coils

Feasibility of Multi-Spectral Endoscopic Imaging for Detection of Early Neoplasia in Barrett's Oesophagus

Multispectral imaging represents an exciting new field of investigation in endoscopic research. Multispectral imaging uses a specialised camera to detect multiple colours, allowing us to b...

An Evaluation of the Spectra Optia CMNC Collection Procedure

The purpose of this prospective, randomized, cross-over, multi-center study is to evaluate the performance of the Spectra Optia Apheresis System's CMNC Collection Procedure, compared to th...

Medical and Biotech [MESH] Definitions

A mass spectrometric technique that is used for the analysis of a wide range of biomolecules, such as glycoalkaloids, glycoproteins, polysaccharides, and peptides. Positive and negative fast atom bombardment spectra are recorded on a mass spectrometer fitted with an atom gun with xenon as the customary beam. The mass spectra obtained contain molecular weight recognition as well as sequence information.

A mass spectrometry technique using two (MS/MS) or more mass analyzers. With two in tandem, the precursor ions are mass-selected by a first mass analyzer, and focused into a collision region where they are then fragmented into product ions which are then characterized by a second mass analyzer. A variety of techniques are used to separate the compounds, ionize them, and introduce them to the first mass analyzer. For example, for in GC-MS/MS, GAS CHROMATOGRAPHY-MASS SPECTROMETRY is involved in separating relatively small compounds by GAS CHROMATOGRAPHY prior to injecting them into an ionization chamber for the mass selection.

Copies of DNA sequences which lie adjacent to each other in the same orientation (direct tandem repeats) or in the opposite direction to each other (INVERTED TANDEM REPEATS).

The full spectrum of FUNGI that exist within a particular biological niche such as an organism, soil, a body of water, etc.

An analytical method used in determining the identity of a chemical based on its mass using mass analyzers/mass spectrometers.

Quick Search

DeepDyve research library

Relevant Topics

Multiple Sclerosis MS
Multiple sclerosis (MS) is the most common disabling neurological condition affecting 100,000 young adults in the UK. The condition results from autoimmune damage to myelin, causing interference in nerve signaling. Symptoms experienced depend on the pa...

Bioinformatics is the application of computer software and hardware to the management of biological data to create useful information. Computers are used to gather, store, analyze and integrate biological and genetic information which can then be applied...

Searches Linking to this Article