Radiomics is defined as the use of automated or semi-automated post-processing and analysis of multiple features derived from imaging exams. Extracted features might generate models able to predict the molecular profile of solid tumors. The aim of this study was to develop a predictive algorithm to define the mutational status of epidermal growth factor receptor (EGFR) in treatment-naive patients with advanced non-small cell lung cancer (NSCLC). Computed tomography (CT)-scans from 109 treatment-naive NSCLC patients (21 EGFR-mutant and 88 EGFR-wild type) underwent radiomics analysis in order to develop a machine learning model able to recognize EGFR-mutant from EGFR-WT patients via CT scans. A "test-retest" approach was used to identify stable radiomics features. The accuracy of the model was tested on an external validation set from another Institution and on a dataset from the Cancer Imaging Archive (TCIA). The machine learning model that considered both radiomic and clinical features (gender and smoking status) reached a diagnostic accuracy of 88.1% in our dataset with an AUC at the ROC curve of 0.85, while the accuracy in the datasets from TCIA and the external Institution were 76.6% and 83.3%, respectively. Furthermore, 17 distinct radiomics features detected at baseline CT scan were associated with subsequent development of T790M during treatment with an EGFR inhibitor. In conclusion, our machine learning model was able to identify EGFR-mutant patients in multiple validation sets with globally good accuracy, especially after data optimization. More comprehensive training sets might result in further improvement of radiomics-based algorithms.
Radiomic detection of EGFR mutations in NSCLC
Rossi, Giovanni;Fedeli, Alessandro;Zullo, Lodovica;Tagliamento, Marco;Genova, Carlo
2021-01-01
Abstract
Radiomics is defined as the use of automated or semi-automated post-processing and analysis of multiple features derived from imaging exams. Extracted features might generate models able to predict the molecular profile of solid tumors. The aim of this study was to develop a predictive algorithm to define the mutational status of epidermal growth factor receptor (EGFR) in treatment-naive patients with advanced non-small cell lung cancer (NSCLC). Computed tomography (CT)-scans from 109 treatment-naive NSCLC patients (21 EGFR-mutant and 88 EGFR-wild type) underwent radiomics analysis in order to develop a machine learning model able to recognize EGFR-mutant from EGFR-WT patients via CT scans. A "test-retest" approach was used to identify stable radiomics features. The accuracy of the model was tested on an external validation set from another Institution and on a dataset from the Cancer Imaging Archive (TCIA). The machine learning model that considered both radiomic and clinical features (gender and smoking status) reached a diagnostic accuracy of 88.1% in our dataset with an AUC at the ROC curve of 0.85, while the accuracy in the datasets from TCIA and the external Institution were 76.6% and 83.3%, respectively. Furthermore, 17 distinct radiomics features detected at baseline CT scan were associated with subsequent development of T790M during treatment with an EGFR inhibitor. In conclusion, our machine learning model was able to identify EGFR-mutant patients in multiple validation sets with globally good accuracy, especially after data optimization. More comprehensive training sets might result in further improvement of radiomics-based algorithms.File | Dimensione | Formato | |
---|---|---|---|
22 oct 20 Main Manuscript radiomics EGFR.docx
accesso aperto
Descrizione: Articolo su rivista
Tipologia:
Documento in Pre-print
Dimensione
99.56 kB
Formato
Microsoft Word XML
|
99.56 kB | Microsoft Word XML | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.