dc.contributor.author
Rogasch, Julian M. M.
dc.contributor.author
Michaels, Liza
dc.contributor.author
Baumgärtner, Georg L.
dc.contributor.author
Frost, Nikolaj
dc.contributor.author
Rückert, Jens-Carsten
dc.contributor.author
Neudecker, Jens
dc.contributor.author
Ochsenreither, Sebastian
dc.contributor.author
Gerhold, Manuela
dc.contributor.author
Schmidt, Bernd
dc.contributor.author
Schneider, Paul
dc.contributor.author
Amthauer, Holger
dc.contributor.author
Furth, Christian
dc.contributor.author
Penzkofer, Tobias
dc.date.accessioned
2025-10-02T16:27:43Z
dc.date.available
2025-10-02T16:27:43Z
dc.identifier.uri
https://refubium.fu-berlin.de/handle/fub188/49650
dc.identifier.uri
http://dx.doi.org/10.17169/refubium-49373
dc.description.abstract
Background
In patients with non-small cell lung cancer (NSCLC), accuracy of [18F]FDG-PET/CT for pretherapeutic lymph node (LN) staging is limited by false positive findings. Our aim was to evaluate machine learning with routinely obtainable variables to improve accuracy over standard visual image assessment.
Methods
Monocentric retrospective analysis of pretherapeutic [18F]FDG-PET/CT in 491 consecutive patients with NSCLC using an analog PET/CT scanner (training + test cohort, n = 385) or digital scanner (validation, n = 106). Forty clinical variables, tumor characteristics, and image variables (e.g., primary tumor and LN SUVmax and size) were collected. Different combinations of machine learning methods for feature selection and classification of N0/1 vs. N2/3 disease were compared. Ten-fold nested cross-validation was used to derive the mean area under the ROC curve of the ten test folds (“test AUC”) and AUC in the validation cohort. Reference standard was the final N stage from interdisciplinary consensus (histological results for N2/3 LNs in 96%).
Results
N2/3 disease was present in 190 patients (39%; training + test, 37%; validation, 46%; p = 0.09). A gradient boosting classifier (GBM) with 10 features was selected as the final model based on test AUC of 0.91 (95% confidence interval, 0.87–0.94). Validation AUC was 0.94 (0.89–0.98). At a target sensitivity of approx. 90%, test/validation accuracy of the GBM was 0.78/0.87. This was significantly higher than the accuracy based on “mediastinal LN uptake > mediastinum” (0.7/0.75; each p < 0.05) or combined PET/CT criteria (PET positive and/or LN short axis diameter > 10 mm; 0.68/0.75; each p < 0.001). Harmonization of PET images between the two scanners affected SUVmax and visual assessment of the LNs but did not diminish the AUC of the GBM.
Conclusions
A machine learning model based on routinely available variables from [18F]FDG-PET/CT improved accuracy in mediastinal LN staging compared to established visual assessment criteria. A web application implementing this model was made available.
en
dc.rights.uri
https://creativecommons.org/licenses/by/4.0/
dc.subject
machine learning
en
dc.subject
lymph node staging
en
dc.subject
artificial intelligence
en
dc.subject.ddc
600 Technik, Medizin, angewandte Wissenschaften::610 Medizin und Gesundheit::610 Medizin und Gesundheit
dc.title
A machine learning tool to improve prediction of mediastinal lymph node metastases in non-small cell lung cancer using routinely obtainable [18F]FDG-PET/CT parameters
dc.type
Wissenschaftlicher Artikel
dcterms.bibliographicCitation.doi
10.1007/s00259-023-06145-z
dcterms.bibliographicCitation.journaltitle
European Journal of Nuclear Medicine and Molecular Imaging
dcterms.bibliographicCitation.number
7
dcterms.bibliographicCitation.originalpublishername
Springer Nature
dcterms.bibliographicCitation.pagestart
2140
dcterms.bibliographicCitation.pageend
2151
dcterms.bibliographicCitation.volume
50
refubium.affiliation
Charité - Universitätsmedizin Berlin
refubium.funding
Springer Nature DEAL
refubium.resourceType.isindependentpub
no
dcterms.accessRights.openaire
open access
dcterms.bibliographicCitation.pmid
36820890
dcterms.isPartOf.issn
1619-7070
dcterms.isPartOf.eissn
1619-7089