Predictive radiomics based ensemble machine learning approach in CT lung nodule diagnosis.

IF 1.8 Q3 ONCOLOGY
Arooj Nissar, A H Mir
{"title":"Predictive radiomics based ensemble machine learning approach in CT lung nodule diagnosis.","authors":"Arooj Nissar, A H Mir","doi":"10.1186/s43046-025-00326-7","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Computed tomography imaging, a non-invasive tool, is used around the globe by medical professionals to identify and diagnose lung cancer; a lethal disease with high rates of occurrence and mortality globally. Radiomics extracted from medical images, including computed tomography, in tandem with machine learning frameworks has received considerable focus and research for lung nodule identification.This investigation can help out clinicians to reach radiomics-based better and quicker decision support system for treatments and early diagnosis. However, it is still foggy and unclear which radiomics feature(s) to use for the prediction of pulmonary nodule. Consequently, this work is offered with an endeavor to efficiently apply machine learning techniques and radiomics to classify CT pulmonary nodules.</p><p><strong>Methods: </strong>Lung Image Data Consortium (LIDC), containing 1018 CT cancer cases, is put to use. The Wavelet Packet Transform is used in conjunction with geometrical features, gray level run length matrix, gray level co-occurrence method and gray level difference method techniques to extract radiomics. Two techniques, boosted and bagged ensemble classification trees, are employed to choose an apposite set of features. The categorization of nodules as malignant or benign is assessed by the utilization of cutting-edge machine learning models: Support Vector Machines, Boosted Classification Ensemble Tree, Decision Trees, Bagged Classification Ensemble Tree, RUSBoosted Ensemble Trees, Subspace Discriminant Ensemble and Subspace KNN Ensemble.</p><p><strong>Results: </strong>The findings reveal that the Ensemble Subspace KNN gives best AUROC (93.4%), accuracy (88.3%) and F1-score (85.2%) using BACET feature selection method. The best sensitivity is produced by FGSVM (97.1%). RUSBOCET gives best precision and specificity of 93.4% and 83.1% respectively.</p><p><strong>Conclusion: </strong>Lung Cancer remains the most common and deadly type of cancer. Early detection of lung lesions and nodules is crucial in the fight against lung cancer. The purpose of this study was to investigate radiomics based on geometrical, texture, and Daubechies WPT texture features for quantitative CT image analysis. The LIDC database was used in this study. Geometrical features, texture features based on three statistical methodologies (GLCM, GLDM GLRLM) and Daubechies WPT texture features are retrieved from the nodules. Using the ensemble EFS, BOCET and BACET, pertinent features were identified. Lastly, various cutting-edge ML classifiers were used to classify LC as malignant or benign. The out-turn shows that, using BACET EFS, Ensemble Subspace KNN gives best AUROC (93.4%), accuracy (88.3%) and F1-score (85.2%). FGSVM yields the best sensitivity of 97.1%. RUSBOCET gives best precision and best specificity of 93.4% and 83.1% respectively. Therefore, the methodology can be applied with efficacy to the CT based PN classification. Thus, the result can assist medical professionals in making better decisions and interventions.</p>","PeriodicalId":17301,"journal":{"name":"Journal of the Egyptian National Cancer Institute","volume":"37 1","pages":"68"},"PeriodicalIF":1.8000,"publicationDate":"2025-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the Egyptian National Cancer Institute","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1186/s43046-025-00326-7","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Computed tomography imaging, a non-invasive tool, is used around the globe by medical professionals to identify and diagnose lung cancer; a lethal disease with high rates of occurrence and mortality globally. Radiomics extracted from medical images, including computed tomography, in tandem with machine learning frameworks has received considerable focus and research for lung nodule identification.This investigation can help out clinicians to reach radiomics-based better and quicker decision support system for treatments and early diagnosis. However, it is still foggy and unclear which radiomics feature(s) to use for the prediction of pulmonary nodule. Consequently, this work is offered with an endeavor to efficiently apply machine learning techniques and radiomics to classify CT pulmonary nodules.

Methods: Lung Image Data Consortium (LIDC), containing 1018 CT cancer cases, is put to use. The Wavelet Packet Transform is used in conjunction with geometrical features, gray level run length matrix, gray level co-occurrence method and gray level difference method techniques to extract radiomics. Two techniques, boosted and bagged ensemble classification trees, are employed to choose an apposite set of features. The categorization of nodules as malignant or benign is assessed by the utilization of cutting-edge machine learning models: Support Vector Machines, Boosted Classification Ensemble Tree, Decision Trees, Bagged Classification Ensemble Tree, RUSBoosted Ensemble Trees, Subspace Discriminant Ensemble and Subspace KNN Ensemble.

Results: The findings reveal that the Ensemble Subspace KNN gives best AUROC (93.4%), accuracy (88.3%) and F1-score (85.2%) using BACET feature selection method. The best sensitivity is produced by FGSVM (97.1%). RUSBOCET gives best precision and specificity of 93.4% and 83.1% respectively.

Conclusion: Lung Cancer remains the most common and deadly type of cancer. Early detection of lung lesions and nodules is crucial in the fight against lung cancer. The purpose of this study was to investigate radiomics based on geometrical, texture, and Daubechies WPT texture features for quantitative CT image analysis. The LIDC database was used in this study. Geometrical features, texture features based on three statistical methodologies (GLCM, GLDM GLRLM) and Daubechies WPT texture features are retrieved from the nodules. Using the ensemble EFS, BOCET and BACET, pertinent features were identified. Lastly, various cutting-edge ML classifiers were used to classify LC as malignant or benign. The out-turn shows that, using BACET EFS, Ensemble Subspace KNN gives best AUROC (93.4%), accuracy (88.3%) and F1-score (85.2%). FGSVM yields the best sensitivity of 97.1%. RUSBOCET gives best precision and best specificity of 93.4% and 83.1% respectively. Therefore, the methodology can be applied with efficacy to the CT based PN classification. Thus, the result can assist medical professionals in making better decisions and interventions.

基于预测放射组学的集成机器学习方法在CT肺结节诊断中的应用。
背景:计算机断层扫描成像是一种非侵入性工具,在全球范围内被医疗专业人员用于识别和诊断肺癌;一种在全球范围内具有高发病率和高死亡率的致命疾病。从医学图像(包括计算机断层扫描)中提取的放射组学与机器学习框架相结合,在肺结节识别方面受到了相当大的关注和研究。这项研究可以帮助临床医生更好更快地获得基于放射学的治疗和早期诊断决策支持系统。然而,目前仍不清楚哪些放射组学特征可用于预测肺结节。因此,本工作旨在有效地应用机器学习技术和放射组学对CT肺结节进行分类。方法:利用肺影像资料联盟(LIDC)收录的1018例CT肿瘤病例。将小波包变换与几何特征、灰度行程矩阵、灰度共生法和灰度差分法等技术相结合,提取放射组学。采用两种技术,增强和袋装集成分类树,以选择合适的一组特征。利用尖端的机器学习模型来评估结节的恶性或良性分类:支持向量机,增强分类集成树,决策树,袋式分类集成树,rusboosting集成树,子空间判别集成和子空间KNN集成。结果:使用BACET特征选择方法,集成子空间KNN的AUROC(93.4%)、准确率(88.3%)和f1得分(85.2%)最佳。FGSVM的灵敏度最高(97.1%)。RUSBOCET的精密度和特异度分别为93.4%和83.1%。结论:肺癌仍然是最常见和最致命的癌症类型。早期发现肺部病变和结节对于抗击肺癌至关重要。本研究的目的是研究基于几何、纹理和Daubechies WPT纹理特征的放射组学,用于定量CT图像分析。本研究使用LIDC数据库。基于三种统计方法(GLCM、GLDM、GLRLM)提取结节的几何特征、纹理特征和Daubechies WPT纹理特征。使用集成的EFS、BOCET和BACET识别相关特征。最后,使用各种尖端的ML分类器对LC进行恶性或良性分类。结果表明,使用BACET EFS,集成子空间KNN给出了最好的AUROC(93.4%)、准确率(88.3%)和f1分数(85.2%)。FGSVM的灵敏度为97.1%。RUSBOCET的精密度和特异性分别为93.4%和83.1%。因此,该方法可以有效地应用于基于CT的PN分类。因此,该结果可以帮助医疗专业人员做出更好的决策和干预措施。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
3.50
自引率
0.00%
发文量
46
审稿时长
11 weeks
期刊介绍: As the official publication of the National Cancer Institute, Cairo University, the Journal of the Egyptian National Cancer Institute (JENCI) is an open access peer-reviewed journal that publishes on the latest innovations in oncology and thereby, providing academics and clinicians a leading research platform. JENCI welcomes submissions pertaining to all fields of basic, applied and clinical cancer research. Main topics of interest include: local and systemic anticancer therapy (with specific interest on applied cancer research from developing countries); experimental oncology; early cancer detection; randomized trials (including negatives ones); and key emerging fields of personalized medicine, such as molecular pathology, bioinformatics, and biotechnologies.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信