Hybrid Approach to Classifying Histological Subtypes of Non-small Cell Lung Cancer (NSCLC): Combining Radiomics and Deep Learning Features from CT Images.
Geon Oh, Yongha Gi, Jeongshim Lee, Hunjung Kim, Hong-Gyun Wu, Jong Min Park, Eunae Choi, Dongho Shin, Myonggeun Yoon, Boram Lee, Jaeman Son
{"title":"Hybrid Approach to Classifying Histological Subtypes of Non-small Cell Lung Cancer (NSCLC): Combining Radiomics and Deep Learning Features from CT Images.","authors":"Geon Oh, Yongha Gi, Jeongshim Lee, Hunjung Kim, Hong-Gyun Wu, Jong Min Park, Eunae Choi, Dongho Shin, Myonggeun Yoon, Boram Lee, Jaeman Son","doi":"10.1007/s10278-025-01442-5","DOIUrl":null,"url":null,"abstract":"<p><p>This study aimed to develop a hybrid model combining radiomics and deep learning features derived from computed tomography (CT) images to classify histological subtypes of non-small cell lung cancer (NSCLC). We analyzed CT images and radiomics features from 235 patients with NSCLC, including 110 with adenocarcinoma (ADC) and 112 with squamous cell carcinoma (SCC). The dataset was split into a training set (75%) and a test set (25%). External validation was conducted using the NSCLC-Radiomics database, comprising 24 patients each with ADC and SCC. A total of 1409 radiomics and 8192 deep features underwent principal component analysis (PCA) and ℓ2,1-norm minimization for feature reduction and selection. The optimal feature sets for classification included 27 radiomics features, 20 deep features, and 55 combined features (30 deep and 25 radiomics). The average area under the receiver operating characteristic curve (AUC) for radiomics, deep, and combined features were 0.6568, 0.6689, and 0.7209, respectively, across the internal and external test sets. Corresponding average accuracies were 0.6013, 0.6376, and 0.6564. The combined model demonstrated superior performance in classifying NSCLC subtypes, achieving higher AUC and accuracy in both test datasets. These results suggest that the proposed hybrid approach could enhance the accuracy and reliability of NSCLC subtype classification.</p>","PeriodicalId":516858,"journal":{"name":"Journal of imaging informatics in medicine","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of imaging informatics in medicine","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s10278-025-01442-5","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This study aimed to develop a hybrid model combining radiomics and deep learning features derived from computed tomography (CT) images to classify histological subtypes of non-small cell lung cancer (NSCLC). We analyzed CT images and radiomics features from 235 patients with NSCLC, including 110 with adenocarcinoma (ADC) and 112 with squamous cell carcinoma (SCC). The dataset was split into a training set (75%) and a test set (25%). External validation was conducted using the NSCLC-Radiomics database, comprising 24 patients each with ADC and SCC. A total of 1409 radiomics and 8192 deep features underwent principal component analysis (PCA) and ℓ2,1-norm minimization for feature reduction and selection. The optimal feature sets for classification included 27 radiomics features, 20 deep features, and 55 combined features (30 deep and 25 radiomics). The average area under the receiver operating characteristic curve (AUC) for radiomics, deep, and combined features were 0.6568, 0.6689, and 0.7209, respectively, across the internal and external test sets. Corresponding average accuracies were 0.6013, 0.6376, and 0.6564. The combined model demonstrated superior performance in classifying NSCLC subtypes, achieving higher AUC and accuracy in both test datasets. These results suggest that the proposed hybrid approach could enhance the accuracy and reliability of NSCLC subtype classification.