dc.contributor.author
Notaro, Marco
dc.contributor.author
Schubach, Max
dc.contributor.author
Robinson, Peter N.
dc.contributor.author
Valentini, Giorgio
dc.date.accessioned
2018-06-08T10:31:35Z
dc.date.available
2017-11-06T11:31:56.060Z
dc.identifier.uri
https://refubium.fu-berlin.de/handle/fub188/20591
dc.identifier.uri
http://dx.doi.org/10.17169/refubium-23892
dc.description.abstract
Background The prediction of human gene–abnormal phenotype associations is a
fundamental step toward the discovery of novel genes associated with human
disorders, especially when no genes are known to be associated with a specific
disease. In this context the Human Phenotype Ontology (HPO) provides a
standard categorization of the abnormalities associated with human diseases.
While the problem of the prediction of gene–disease associations has been
widely investigated, the related problem of gene–phenotypic feature (i.e., HPO
term) associations has been largely overlooked, even if for most human genes
no HPO term associations are known and despite the increasing application of
the HPO to relevant medical problems. Moreover most of the methods proposed in
literature are not able to capture the hierarchical relationships between HPO
terms, thus resulting in inconsistent and relatively inaccurate predictions.
Results We present two hierarchical ensemble methods that we formally prove to
provide biologically consistent predictions according to the hierarchical
structure of the HPO. The modular structure of the proposed methods, that
consists in a “flat” learning first step and a hierarchical combination of the
predictions in the second step, allows the predictions of virtually any flat
learning method to be enhanced. The experimental results show that
hierarchical ensemble methods are able to predict novel associations between
genes and abnormal phenotypes with results that are competitive with state-of-
the-art algorithms and with a significant reduction of the computational
complexity. Conclusions Hierarchical ensembles are efficient computational
methods that guarantee biologically meaningful predictions that obey the true
path rule, and can be used as a tool to improve and make consistent the HPO
terms predictions starting from virtually any flat learning method. The
implementation of the proposed methods is available as an R package from the
CRAN repository.
en
dc.rights.uri
http://creativecommons.org/licenses/by/4.0/
dc.subject.ddc
600 Technik, Medizin, angewandte Wissenschaften::610 Medizin und Gesundheit
dc.title
Prediction of Human Phenotype Ontology terms by means of hierarchical ensemble
methods
dc.type
Wissenschaftlicher Artikel
dcterms.bibliographicCitation
BMC Bioinformatics. - 18 (2017), Artikel Nr. 449
dcterms.bibliographicCitation.doi
10.1186/s12859-017-1854-y
dcterms.bibliographicCitation.url
http://doi.org/10.1186/s12859-017-1854-y
refubium.affiliation
Charité - Universitätsmedizin Berlin
de
refubium.mycore.fudocsId
FUDOCS_document_000000028445
refubium.note.author
Der Artikel wurde in einer reinen Open-Access-Zeitschrift publiziert.
refubium.resourceType.isindependentpub
no
refubium.mycore.derivateId
FUDOCS_derivate_000000009080
dcterms.accessRights.openaire
open access