Exploring the Relevance of Full Lung CT Scans in Deep Learning for Classifying Nodule Malignancy in Lung Cancer Screening

Keywords

Loading...
Thumbnail Image

Issue Date

2024-06-14

Language

en

Document type

Journal Title

Journal ISSN

Volume Title

Publisher

Title

ISSN

Volume

Issue

Startpage

Endpage

DOI

Abstract

1) Background: Current screening guidelines for radiologists in lung cancer state the relevance of examining not only nodule features in a CT scan, but also other abnormalities before classifying a nodule on malignancy risk. In deep learning models in literature, some models focus only on nodule features while others also take larger parts of the CT scan into account. However, due to their difference in model characteristics, it is hard to know which features are important to analyse. 2) Purpose: To develop a deep learning algorithm that can analyse the added value of outside-the-nodule (global) features on top of a nodule (local) feature focused model. 3) Materials and Methods: A two-stream deep learning algorithm was trained, where one stream analyses nodule block features and the other stream analyses features outside the nodule block. These parts are then combined to create a final nodule malignancy risk estimation. The model was trained with nodules from National Lung Screening Trial and validated with screening data from Danish Lung Cancer Screening Trial, and Multicentric Italian Lung Detection trial, and clinical data from Radboud UMC. From each testset, a nodule size-matched subset was gathered. Furthermore, the impact of screening variables emphysema and nodule location were analysed. Other experiments for model interpretability were performed using uncertainty cross-examinations and pixel attribution maps. 4) Results: For the test-data from screening trial DLCST the global, the local, and the combined model reached an AUC of 0.85, 0.93, and 0.93 respectively. For the screening trial of MILD, the models reached an AUC of 0.81, 0.97, 0.96 AUC. For the clinical data, the models had an AUC of 0.68, 0.90 and 0.89. Size matching benign and malignant nodules led to an overall decrease of performance for all models across all datasets. The screening variable nodule location showed significant influence on the outside-the-nodule malignancy risk estimations in both DLCST and MILD (p <0.001). The screening variable emphysema, which was only available in DLCST, did not have a significant influence (p = 0.091). Examination of uncertainty showed a decrease of each model performance on its own uncertain subset. Cross-examination showed a decrease in global model performance, when the local model was uncertain. However, if the global model was uncertain, the local model still performed well. Finally, Pixel-attribution examination shows the global model focuses on lung structure and bone tissue. 5) Conclusion: We managed to build the two-stream model and it converged to a reasonable performance. The combined model did not improve upon the local model, yet the global model performed above expectations without access to nodule characteristics. Furthermore, the global model verified the significance of Brock model variable nodule location for screening. Finally, pixel-attribution maps show focus on bone-tissue features, which needs further investigation. Index Terms—Lung cancer screening, Pulmonary nodule, Lung CT Scans, Deep learning, Medical Imaging, Machine learning, Emphysema, Upper lobe nodule, Bone-tissue, Coronary calcification, Brock model, Explainable AI, Pixel-attribution maps, Uncertainty

Description

Citation

Faculty

Faculteit der Sociale Wetenschappen