Title:
A proteomics sample metadata representation for multiomics integration and big data analysis
Author(s):
Dai, Chengxin; Füllgrabe, Anja; Pfeuffer, Julianus; Solovyeva, Elizaveta M.; Deng, Jingwen; Moreno, Pablo; Kamatchinathan, Selvakumar; Kundu, Deepti Jaiswal; George, Nancy; Fexova, Silvie
Year of publication:
2021
Available Date:
2021-12-20T10:12:12Z
Abstract:
The amount of public proteomics data is rapidly increasing but there is no standardized format to describe the sample metadata and their relationship with the dataset files in a way that fully supports their understanding or reanalysis. Here we propose to develop the transcriptomics data format MAGE-TAB into a standard representation for proteomics sample metadata. We implement MAGE-TAB-Proteomics in a crowdsourcing project to manually curate over 200 public datasets. We also describe tools and libraries to validate and submit sample metadata-related information to the PRIDE repository. We expect that these developments will improve the reproducibility and facilitate the reanalysis and integration of public proteomics datasets.
Part of Identifier:
e-ISSN (online): 2041-1723
Keywords:
Data publication and archiving
Proteome informatics
Proteomics
Standardization
DDC-Classification:
004 Datenverarbeitung; Informatik
Publication Type:
Wissenschaftlicher Artikel
URL of the Original Publication:
DOI of the Original Publication:
Journaltitle:
Nature Communications
Department/institution:
Mathematik und Informatik
Institut für Informatik