dc.contributor.author
Altenburg, Tom
dc.contributor.author
Giese, Sven
dc.contributor.author
Wang, Shengbo
dc.contributor.author
Muth, Thilo
dc.contributor.author
Renard, Bernhard Y.
dc.date.accessioned
2022-04-22T14:44:10Z
dc.date.available
2022-04-22T14:44:10Z
dc.identifier.uri
https://refubium.fu-berlin.de/handle/fub188/34790
dc.identifier.uri
http://dx.doi.org/10.17169/refubium-34509
dc.description.abstract
Mass spectrometry-based proteomics provides a holistic snapshot of the entire protein set of living cells on a molecular level. Currently, only a few deep learning approaches exist that involve peptide fragmentation spectra, which represent partial sequence information of proteins. Commonly, these approaches lack the ability to characterize less studied or even unknown patterns in spectra because of their use of explicit domain knowledge. Here, to elevate unrestricted learning from spectra, we introduce ‘ad hoc learning of fragmentation’ (AHLF), a deep learning model that is end-to-end trained on 19.2 million spectra from several phosphoproteomic datasets. AHLF is interpretable, and we show that peak-level feature importance values and pairwise interactions between peaks are in line with corresponding peptide fragments. We demonstrate our approach by detecting post-translational modifications, specifically protein phosphorylation based on only the fragmentation spectrum without a database search. AHLF increases the area under the receiver operating characteristic curve (AUC) by an average of 9.4% on recent phosphoproteomic data compared with the current state of the art on this task. Furthermore, use of AHLF in rescoring search results increases the number of phosphopeptide identifications by a margin of up to 15.1% at a constant false discovery rate. To show the broad applicability of AHLF, we use transfer learning to also detect cross-linked peptides, as used in protein structure analysis, with an AUC of up to 94%.
en
dc.format.extent
16 Seiten
dc.rights.uri
https://creativecommons.org/licenses/by/4.0/
dc.subject
Machine learning
en
dc.subject
Mass spectrometry
en
dc.subject
Artificial Intelligence
en
dc.subject.ddc
000 Informatik, Informationswissenschaft, allgemeine Werke::000 Informatik, Wissen, Systeme::004 Datenverarbeitung; Informatik
dc.title
Ad hoc learning of peptide fragmentation from mass spectra enables an interpretable detection of phosphorylated and cross-linked peptides
dc.type
Wissenschaftlicher Artikel
dcterms.bibliographicCitation.doi
10.1038/s42256-022-00467-7
dcterms.bibliographicCitation.journaltitle
Nature Machine Intelligence
dcterms.bibliographicCitation.pagestart
378
dcterms.bibliographicCitation.pageend
388
dcterms.bibliographicCitation.volume
4
dcterms.bibliographicCitation.url
https://doi.org/10.1038/s42256-022-00467-7
refubium.affiliation
Mathematik und Informatik
refubium.resourceType.isindependentpub
no
dcterms.accessRights.openaire
open access
dcterms.isPartOf.eissn
2522-5839
refubium.resourceType.provider
WoS-Alert