dc.contributor.author
Holtkamp, Agnes
dc.contributor.author
Elhennawy, Karim
dc.contributor.author
Cejudo Grano de Oro, José E.
dc.contributor.author
Krois, Joachim
dc.contributor.author
Paris, Sebastian
dc.contributor.author
Schwendicke, Falk
dc.date.accessioned
2021-09-10T07:30:08Z
dc.date.available
2021-09-10T07:30:08Z
dc.identifier.uri
https://refubium.fu-berlin.de/handle/fub188/31914
dc.identifier.uri
http://dx.doi.org/10.17169/refubium-31646
dc.description.abstract
Objectives: The present study aimed to train deep convolutional neural networks (CNNs) to detect caries lesions on Near-Infrared Light Transillumination (NILT) imagery obtained either in vitro or in vivo and to assess the models' generalizability. Methods: In vitro, 226 extracted posterior permanent human teeth were mounted in a diagnostic model in a dummy head. Then, NILT images were generated (DIAGNOcam, KaVo, Biberach), and images were segmented tooth-wise. In vivo, 1319 teeth from 56 patients were obtained and segmented similarly. Proximal caries lesions were annotated pixel-wise by three experienced dentists, reviewed by a fourth dentist, and then transformed into binary labels. We trained ResNet classification models on both in vivo and in vitro datasets and used 10-fold cross-validation for estimating the performance and generalizability of the models. We used GradCAM to increase explainability. Results: The tooth-level prevalence of caries lesions was 41% in vitro and 49% in vivo, respectively. Models trained and tested on in vivo data performed significantly better (mean ± SD accuracy: 0.78 ± 0.04) than those trained and tested on in vitro data (accuracy: 0.64 ± 0.15; p < 0.05). When tested in vitro, the models trained in vivo showed significantly lower accuracy (0.70 ± 0.01; p < 0.01). Similarly, when tested in vivo, models trained in vitro showed significantly lower accuracy (0.61 ± 0.04; p < 0.05). In both cases, this was due to decreases in sensitivity (by -27% for models trained in vivo and -10% for models trained in vitro). Conclusions: Using in vitro setups for generating NILT imagery and training CNNs comes with low accuracy and generalizability. Clinical significance: Studies employing in vitro imagery for developing deep learning models should be critically appraised for their generalizability. Applicable deep learning models for assessing NILT imagery should be trained on in vivo data.
en
dc.rights.uri
https://creativecommons.org/licenses/by/4.0/
dc.subject
artificial intelligence
en
dc.subject
digital imaging/radiology
en
dc.subject
mathematical modeling
en
dc.subject.ddc
600 Technik, Medizin, angewandte Wissenschaften::610 Medizin und Gesundheit::610 Medizin und Gesundheit
dc.title
Generalizability of Deep Learning Models for Caries Detection in Near-Infrared Light Transillumination Images
dc.type
Wissenschaftlicher Artikel
dcterms.bibliographicCitation.articlenumber
961
dcterms.bibliographicCitation.doi
10.3390/jcm10050961
dcterms.bibliographicCitation.journaltitle
Journal of Clinical Medicine
dcterms.bibliographicCitation.number
5
dcterms.bibliographicCitation.originalpublishername
MDPI AG
dcterms.bibliographicCitation.volume
10
refubium.affiliation
Charité - Universitätsmedizin Berlin
refubium.resourceType.isindependentpub
no
dcterms.accessRights.openaire
open access
dcterms.bibliographicCitation.pmid
33804562
dcterms.isPartOf.eissn
2077-0383