用于无创预测弥漫性胶质瘤分子亚型的三分类机器学习模型:一项双中心研究。

IF 2.9 2区 医学 Q2 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING
Quantitative Imaging in Medicine and Surgery Pub Date : 2025-06-06 Epub Date: 2025-05-29 DOI:10.21037/qims-24-2461
Meilin Zhu, Weishu Hou, Jiahao Gao, Fang Han, Shanshan Huang, Xiaohu Li, Longlin Yin, Jiawen Zhang
{"title":"用于无创预测弥漫性胶质瘤分子亚型的三分类机器学习模型:一项双中心研究。","authors":"Meilin Zhu, Weishu Hou, Jiahao Gao, Fang Han, Shanshan Huang, Xiaohu Li, Longlin Yin, Jiawen Zhang","doi":"10.21037/qims-24-2461","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Determining the molecular status of gliomas is crucial for evaluating treatment efficacy and prognosis. However, this process currently requires the invasive and cumbersome method of histological analysis. We aimed to develop and validate a non-invasive three-classification machine learning (ML) model to predict the three molecular subtypes of adult-type diffuse gliomas according to the 2021 World Health Organization classification of tumors of the central nervous system 5<sup>th</sup> edition (WHO CNS 5).</p><p><strong>Methods: </strong>This retrospective study included a total of 306 glioma patients, among whom 258 were from Center 1 (Huashan Hospital; 180 for the training and 78 for the internal validation set) and 48 were from Center 2 (The First Affiliated Hospital of Anhui Medical University; external validation set). Conventional magnetic resonance imaging (MRI) features of tumors were assessed, and the radiomics and Swin Transformer-based deep learning (RSTD) features were respectively extracted from tumor segmentation on axial three-dimensional contrast-enhanced T1-weighted (3D T1C) and T2-fluid-attenuated inversion recovery (T2-FLAIR) sequences. Three types of prediction models: conventional MRI (CM) model, RSTD model, and combined model were respectively trained using six ML classifiers [k-nearest neighbor (kNN), light gradient-boosting machine (LightGBM), random forest (RF), support vector machine (SVM), stochastic gradient descent (SGD), and extreme gradient boosting (XGBoost)] to identify the three major molecular subtypes of adult-type diffuse gliomas. The performance of the models was evaluated using the area under the receiver operating characteristic (ROC) curve (AUC), sensitivity, specificity, accuracy, precision, and F1-score.</p><p><strong>Results: </strong>XGBoost classifier was chosen as our algorithm for model construction due to its superior performance in the training and internal validation cohorts. The combined model, which incorporates CM features, RSTD features, as well as demographic features, achieved best performance in the internal [micro-AUC (0.905) and macro-AUC (0.878)] and external validation sets [micro-AUC (0.911) and macro-AUC (0.891)]. The SHapley Additive explanations (SHAP) and gradient-weighted class activation mapping (Grad-CAM) were used to explain the model.</p><p><strong>Conclusions: </strong>Our study constructed a three-classification ML model that combined CM features, RSTD features, and demographic characteristics, achieved promising performance in predicting molecular subtypes of diffuse glioma. The combined model provided a non-invasive, timely, and accurate diagnostic approach prior to patient treatment to assist clinical decision-making.</p>","PeriodicalId":54267,"journal":{"name":"Quantitative Imaging in Medicine and Surgery","volume":"15 6","pages":"5752-5768"},"PeriodicalIF":2.9000,"publicationDate":"2025-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12209611/pdf/","citationCount":"0","resultStr":"{\"title\":\"A three-classification machine learning model for non-invasive prediction of molecular subtypes in diffuse glioma: a two-center study.\",\"authors\":\"Meilin Zhu, Weishu Hou, Jiahao Gao, Fang Han, Shanshan Huang, Xiaohu Li, Longlin Yin, Jiawen Zhang\",\"doi\":\"10.21037/qims-24-2461\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Determining the molecular status of gliomas is crucial for evaluating treatment efficacy and prognosis. However, this process currently requires the invasive and cumbersome method of histological analysis. We aimed to develop and validate a non-invasive three-classification machine learning (ML) model to predict the three molecular subtypes of adult-type diffuse gliomas according to the 2021 World Health Organization classification of tumors of the central nervous system 5<sup>th</sup> edition (WHO CNS 5).</p><p><strong>Methods: </strong>This retrospective study included a total of 306 glioma patients, among whom 258 were from Center 1 (Huashan Hospital; 180 for the training and 78 for the internal validation set) and 48 were from Center 2 (The First Affiliated Hospital of Anhui Medical University; external validation set). Conventional magnetic resonance imaging (MRI) features of tumors were assessed, and the radiomics and Swin Transformer-based deep learning (RSTD) features were respectively extracted from tumor segmentation on axial three-dimensional contrast-enhanced T1-weighted (3D T1C) and T2-fluid-attenuated inversion recovery (T2-FLAIR) sequences. Three types of prediction models: conventional MRI (CM) model, RSTD model, and combined model were respectively trained using six ML classifiers [k-nearest neighbor (kNN), light gradient-boosting machine (LightGBM), random forest (RF), support vector machine (SVM), stochastic gradient descent (SGD), and extreme gradient boosting (XGBoost)] to identify the three major molecular subtypes of adult-type diffuse gliomas. The performance of the models was evaluated using the area under the receiver operating characteristic (ROC) curve (AUC), sensitivity, specificity, accuracy, precision, and F1-score.</p><p><strong>Results: </strong>XGBoost classifier was chosen as our algorithm for model construction due to its superior performance in the training and internal validation cohorts. The combined model, which incorporates CM features, RSTD features, as well as demographic features, achieved best performance in the internal [micro-AUC (0.905) and macro-AUC (0.878)] and external validation sets [micro-AUC (0.911) and macro-AUC (0.891)]. The SHapley Additive explanations (SHAP) and gradient-weighted class activation mapping (Grad-CAM) were used to explain the model.</p><p><strong>Conclusions: </strong>Our study constructed a three-classification ML model that combined CM features, RSTD features, and demographic characteristics, achieved promising performance in predicting molecular subtypes of diffuse glioma. The combined model provided a non-invasive, timely, and accurate diagnostic approach prior to patient treatment to assist clinical decision-making.</p>\",\"PeriodicalId\":54267,\"journal\":{\"name\":\"Quantitative Imaging in Medicine and Surgery\",\"volume\":\"15 6\",\"pages\":\"5752-5768\"},\"PeriodicalIF\":2.9000,\"publicationDate\":\"2025-06-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12209611/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Quantitative Imaging in Medicine and Surgery\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.21037/qims-24-2461\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/5/29 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q2\",\"JCRName\":\"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Quantitative Imaging in Medicine and Surgery","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.21037/qims-24-2461","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/5/29 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
引用次数: 0

摘要

背景:确定胶质瘤的分子状态对评估治疗效果和预后至关重要。然而,这一过程目前需要侵入性和繁琐的组织学分析方法。我们旨在根据2021年世界卫生组织中枢神经系统肿瘤分类第5版(WHO CNS 5),开发并验证一种非侵入性三分类机器学习(ML)模型,以预测成人型弥漫性胶质瘤的三种分子亚型。方法:本回顾性研究共纳入306例胶质瘤患者,其中258例来自华山医院第一中心;培训组180例,内部验证组78例),第二中心48例(安徽医科大学第一附属医院;外部验证集)。评估肿瘤的常规磁共振成像(MRI)特征,分别在轴向三维增强t1加权(3D T1C)和t2 -流体衰减反转恢复(T2-FLAIR)序列上提取肿瘤分割的放射组学和基于Swin变压器的深度学习(RSTD)特征。采用6个ML分类器[k-nearest neighbor (kNN)、light gradient-boosting machine (LightGBM)、random forest (RF)、support vector machine (SVM)、stochastic gradient- descent (SGD)、extreme gradient-boost (XGBoost)]分别训练了常规MRI (CM)模型、RSTD模型和组合模型3种预测模型,以识别成人型弥漫性胶质瘤的3种主要分子亚型。采用受试者工作特征(ROC)曲线下面积(AUC)、敏感性、特异性、准确性、精密度和f1评分来评价模型的性能。结果:选择XGBoost分类器作为我们的模型构建算法,因为它在训练和内部验证队列中表现优异。结合CM特征、RSTD特征和人口统计学特征的组合模型在内部验证集[微观auc(0.905)和宏观auc(0.878)]和外部验证集[微观auc(0.911)和宏观auc(0.891)]上取得了最佳性能。采用SHapley加性解释(SHAP)和梯度加权类激活映射(Grad-CAM)对模型进行解释。结论:我们的研究构建了一个结合CM特征、RSTD特征和人口学特征的三分类ML模型,在预测弥漫性胶质瘤的分子亚型方面取得了很好的效果。联合模型为患者治疗前提供了一种无创、及时、准确的诊断方法,以辅助临床决策。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A three-classification machine learning model for non-invasive prediction of molecular subtypes in diffuse glioma: a two-center study.

Background: Determining the molecular status of gliomas is crucial for evaluating treatment efficacy and prognosis. However, this process currently requires the invasive and cumbersome method of histological analysis. We aimed to develop and validate a non-invasive three-classification machine learning (ML) model to predict the three molecular subtypes of adult-type diffuse gliomas according to the 2021 World Health Organization classification of tumors of the central nervous system 5th edition (WHO CNS 5).

Methods: This retrospective study included a total of 306 glioma patients, among whom 258 were from Center 1 (Huashan Hospital; 180 for the training and 78 for the internal validation set) and 48 were from Center 2 (The First Affiliated Hospital of Anhui Medical University; external validation set). Conventional magnetic resonance imaging (MRI) features of tumors were assessed, and the radiomics and Swin Transformer-based deep learning (RSTD) features were respectively extracted from tumor segmentation on axial three-dimensional contrast-enhanced T1-weighted (3D T1C) and T2-fluid-attenuated inversion recovery (T2-FLAIR) sequences. Three types of prediction models: conventional MRI (CM) model, RSTD model, and combined model were respectively trained using six ML classifiers [k-nearest neighbor (kNN), light gradient-boosting machine (LightGBM), random forest (RF), support vector machine (SVM), stochastic gradient descent (SGD), and extreme gradient boosting (XGBoost)] to identify the three major molecular subtypes of adult-type diffuse gliomas. The performance of the models was evaluated using the area under the receiver operating characteristic (ROC) curve (AUC), sensitivity, specificity, accuracy, precision, and F1-score.

Results: XGBoost classifier was chosen as our algorithm for model construction due to its superior performance in the training and internal validation cohorts. The combined model, which incorporates CM features, RSTD features, as well as demographic features, achieved best performance in the internal [micro-AUC (0.905) and macro-AUC (0.878)] and external validation sets [micro-AUC (0.911) and macro-AUC (0.891)]. The SHapley Additive explanations (SHAP) and gradient-weighted class activation mapping (Grad-CAM) were used to explain the model.

Conclusions: Our study constructed a three-classification ML model that combined CM features, RSTD features, and demographic characteristics, achieved promising performance in predicting molecular subtypes of diffuse glioma. The combined model provided a non-invasive, timely, and accurate diagnostic approach prior to patient treatment to assist clinical decision-making.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Quantitative Imaging in Medicine and Surgery
Quantitative Imaging in Medicine and Surgery Medicine-Radiology, Nuclear Medicine and Imaging
CiteScore
4.20
自引率
17.90%
发文量
252
期刊介绍: Information not localized
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信