基于VASARI特征的WHO分级、异柠檬酸脱氢酶突变和1p19q共缺失状态的预测性机器学习模型:一项多中心研究。

IF 3.6 3区 医学 Q2 ONCOLOGY
American journal of cancer research Pub Date : 2024-08-25 eCollection Date: 2024-01-01 DOI:10.62347/MZLF2460
Wei Zhao, Chao Xie, Kukun Hanjiaerbieke, Rui Xu, Tuxunjiang Pahati, Shaoyu Wang, Junjie Li, Yunling Wang
{"title":"基于VASARI特征的WHO分级、异柠檬酸脱氢酶突变和1p19q共缺失状态的预测性机器学习模型:一项多中心研究。","authors":"Wei Zhao, Chao Xie, Kukun Hanjiaerbieke, Rui Xu, Tuxunjiang Pahati, Shaoyu Wang, Junjie Li, Yunling Wang","doi":"10.62347/MZLF2460","DOIUrl":null,"url":null,"abstract":"<p><p>The objective of our study was to develop predictive models using Visually Accessible Rembrandt Images (VASARI) magnetic resonance imaging (MRI) features combined with machine learning techniques to predict the World Health Organization (WHO) grade, isocitrate dehydrogenase (IDH) mutation status, and 1p19q co-deletion status of high-grade gliomas. To achieve this, we retrospectively included 485 patients with high-grade glioma from the First Affiliated Hospital of Xinjiang Medical University, of which 312 patients were randomly divided into a training set (n=218) and a test set (n=94) in a 7:3 ratio. Twenty-five VASARI MRI features were selected from an initial set of 30, and three machine learning models - Multilayer Perceptron (MP), Bernoulli Naive Bayes (BNB), and Logistic Regression (LR) - were trained using the training set. The most informative features were identified using recursive feature elimination. Model performance was assessed using the test set and an independent validation set of 173 patients from Beijing Tiantan Hospital. The results indicated that the MP model exhibited the highest predictive accuracy on the training set, achieving an area under the curve (AUC) close to 1, indicating perfect discrimination. However, its performance decreased in the test and validation sets; particularly for predicting the 1p19q co-deletion status, the AUC was only 0.703, suggesting potential overfitting. On the other hand, the BNB model demonstrated robust generalization on the test and validation sets, with AUC values of 0.8292 and 0.8106, respectively, for predicting IDH mutation status and 1p19q co-deletion status, indicating high accuracy, sensitivity, and specificity. The LR model also showed good performance with AUCs of 0.7845 and 0.8674 on the test and validation sets, respectively, for predicting IDH mutation status, although it was slightly inferior to the BNB model for the 1p19q co-deletion status. In conclusion, integrating VASARI MRI features with machine learning techniques shows promise for the non-invasive prediction of glioma molecular markers, which could guide treatment strategies and improve prognosis in glioma patients. Nonetheless, further model optimization and validation are necessary to enhance its clinical utility.</p>","PeriodicalId":7437,"journal":{"name":"American journal of cancer research","volume":"14 8","pages":"3826-3841"},"PeriodicalIF":3.6000,"publicationDate":"2024-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11387855/pdf/","citationCount":"0","resultStr":"{\"title\":\"Predictive machine learning models based on VASARI features for WHO grading, isocitrate dehydrogenase mutation, and 1p19q co-deletion status: a multicenter study.\",\"authors\":\"Wei Zhao, Chao Xie, Kukun Hanjiaerbieke, Rui Xu, Tuxunjiang Pahati, Shaoyu Wang, Junjie Li, Yunling Wang\",\"doi\":\"10.62347/MZLF2460\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>The objective of our study was to develop predictive models using Visually Accessible Rembrandt Images (VASARI) magnetic resonance imaging (MRI) features combined with machine learning techniques to predict the World Health Organization (WHO) grade, isocitrate dehydrogenase (IDH) mutation status, and 1p19q co-deletion status of high-grade gliomas. To achieve this, we retrospectively included 485 patients with high-grade glioma from the First Affiliated Hospital of Xinjiang Medical University, of which 312 patients were randomly divided into a training set (n=218) and a test set (n=94) in a 7:3 ratio. Twenty-five VASARI MRI features were selected from an initial set of 30, and three machine learning models - Multilayer Perceptron (MP), Bernoulli Naive Bayes (BNB), and Logistic Regression (LR) - were trained using the training set. The most informative features were identified using recursive feature elimination. Model performance was assessed using the test set and an independent validation set of 173 patients from Beijing Tiantan Hospital. The results indicated that the MP model exhibited the highest predictive accuracy on the training set, achieving an area under the curve (AUC) close to 1, indicating perfect discrimination. However, its performance decreased in the test and validation sets; particularly for predicting the 1p19q co-deletion status, the AUC was only 0.703, suggesting potential overfitting. On the other hand, the BNB model demonstrated robust generalization on the test and validation sets, with AUC values of 0.8292 and 0.8106, respectively, for predicting IDH mutation status and 1p19q co-deletion status, indicating high accuracy, sensitivity, and specificity. The LR model also showed good performance with AUCs of 0.7845 and 0.8674 on the test and validation sets, respectively, for predicting IDH mutation status, although it was slightly inferior to the BNB model for the 1p19q co-deletion status. In conclusion, integrating VASARI MRI features with machine learning techniques shows promise for the non-invasive prediction of glioma molecular markers, which could guide treatment strategies and improve prognosis in glioma patients. Nonetheless, further model optimization and validation are necessary to enhance its clinical utility.</p>\",\"PeriodicalId\":7437,\"journal\":{\"name\":\"American journal of cancer research\",\"volume\":\"14 8\",\"pages\":\"3826-3841\"},\"PeriodicalIF\":3.6000,\"publicationDate\":\"2024-08-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11387855/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"American journal of cancer research\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.62347/MZLF2460\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q2\",\"JCRName\":\"ONCOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"American journal of cancer research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.62347/MZLF2460","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

我们的研究旨在利用可视化伦勃朗图像(VASARI)磁共振成像(MRI)特征结合机器学习技术开发预测模型,以预测高级别胶质瘤的世界卫生组织(WHO)分级、异柠檬酸脱氢酶(IDH)突变状态和1p19q共缺失状态。为此,我们回顾性地纳入了新疆医科大学第一附属医院的485名高级别胶质瘤患者,将其中的312名患者按7:3的比例随机分为训练集(n=218)和测试集(n=94)。从初始的 30 个 VASARI MRI 特征集中选出 25 个特征,并使用训练集训练了三种机器学习模型--多层感知器(MP)、伯努利-奈维贝叶斯(BNB)和逻辑回归(LR)。使用递归特征消除法确定了信息量最大的特征。使用测试集和来自北京天坛医院的 173 名患者组成的独立验证集评估了模型性能。结果表明,MP 模型在训练集上的预测准确率最高,曲线下面积(AUC)接近 1,表明其具有完美的分辨能力。然而,其在测试集和验证集上的表现却有所下降;特别是在预测 1p19q 共缺失状态时,AUC 仅为 0.703,表明可能存在过度拟合。另一方面,BNB 模型在测试集和验证集上表现出很强的泛化能力,预测 IDH 突变状态和 1p19q 共缺失状态的 AUC 值分别为 0.8292 和 0.8106,表明该模型具有很高的准确性、灵敏度和特异性。在预测 IDH 突变状态时,LR 模型也表现出良好的性能,在测试集和验证集上的 AUC 分别为 0.7845 和 0.8674,但在预测 1p19q 共缺失状态时,LR 模型的 AUC 稍逊于 BNB 模型。总之,将 VASARI MRI 特征与机器学习技术相结合,有望对胶质瘤分子标记进行无创预测,从而指导治疗策略并改善胶质瘤患者的预后。然而,要提高其临床实用性,还需要对模型进行进一步的优化和验证。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Predictive machine learning models based on VASARI features for WHO grading, isocitrate dehydrogenase mutation, and 1p19q co-deletion status: a multicenter study.

The objective of our study was to develop predictive models using Visually Accessible Rembrandt Images (VASARI) magnetic resonance imaging (MRI) features combined with machine learning techniques to predict the World Health Organization (WHO) grade, isocitrate dehydrogenase (IDH) mutation status, and 1p19q co-deletion status of high-grade gliomas. To achieve this, we retrospectively included 485 patients with high-grade glioma from the First Affiliated Hospital of Xinjiang Medical University, of which 312 patients were randomly divided into a training set (n=218) and a test set (n=94) in a 7:3 ratio. Twenty-five VASARI MRI features were selected from an initial set of 30, and three machine learning models - Multilayer Perceptron (MP), Bernoulli Naive Bayes (BNB), and Logistic Regression (LR) - were trained using the training set. The most informative features were identified using recursive feature elimination. Model performance was assessed using the test set and an independent validation set of 173 patients from Beijing Tiantan Hospital. The results indicated that the MP model exhibited the highest predictive accuracy on the training set, achieving an area under the curve (AUC) close to 1, indicating perfect discrimination. However, its performance decreased in the test and validation sets; particularly for predicting the 1p19q co-deletion status, the AUC was only 0.703, suggesting potential overfitting. On the other hand, the BNB model demonstrated robust generalization on the test and validation sets, with AUC values of 0.8292 and 0.8106, respectively, for predicting IDH mutation status and 1p19q co-deletion status, indicating high accuracy, sensitivity, and specificity. The LR model also showed good performance with AUCs of 0.7845 and 0.8674 on the test and validation sets, respectively, for predicting IDH mutation status, although it was slightly inferior to the BNB model for the 1p19q co-deletion status. In conclusion, integrating VASARI MRI features with machine learning techniques shows promise for the non-invasive prediction of glioma molecular markers, which could guide treatment strategies and improve prognosis in glioma patients. Nonetheless, further model optimization and validation are necessary to enhance its clinical utility.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
3.80%
发文量
263
期刊介绍: The American Journal of Cancer Research (AJCR) (ISSN 2156-6976), is an independent open access, online only journal to facilitate rapid dissemination of novel discoveries in basic science and treatment of cancer. It was founded by a group of scientists for cancer research and clinical academic oncologists from around the world, who are devoted to the promotion and advancement of our understanding of the cancer and its treatment. The scope of AJCR is intended to encompass that of multi-disciplinary researchers from any scientific discipline where the primary focus of the research is to increase and integrate knowledge about etiology and molecular mechanisms of carcinogenesis with the ultimate aim of advancing the cure and prevention of this increasingly devastating disease. To achieve these aims AJCR will publish review articles, original articles and new techniques in cancer research and therapy. It will also publish hypothesis, case reports and letter to the editor. Unlike most other open access online journals, AJCR will keep most of the traditional features of paper print that we are all familiar with, such as continuous volume, issue numbers, as well as continuous page numbers to retain our comfortable familiarity towards an academic journal.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信