Decision tree model for predicting ovarian tumor malignancy based on clinical markers and preoperative circulating blood cells.

IF 3.3 3区 医学 Q2 MEDICAL INFORMATICS
Yingjia Li, Xingping Zhao, Yanhua Zhou, Lina Gong, Enuo Peng
{"title":"Decision tree model for predicting ovarian tumor malignancy based on clinical markers and preoperative circulating blood cells.","authors":"Yingjia Li, Xingping Zhao, Yanhua Zhou, Lina Gong, Enuo Peng","doi":"10.1186/s12911-025-02934-8","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>Ovarian cancer is a serious malignant tumor threatening women's health. The early diagnosis and effective treatments of ovarian cancer remain inadequate, and about 70% of ovarian cancers are in advanced stages when discovered. This study aimed to use the decision tree method of artificial intelligence machine learning to build a model for predicting the benign and malignant degree of ovarian cancer patients.</p><p><strong>Study design: </strong>A total of 758 patients were included in the study. These patients were diagnosed by B-ultrasound, CT or MR. The clinicopathological features and circulating blood cell indexes were recorded and analyzed. The prediction model of benign and malignant ovarian tumors was constructed by CART decision tree, and the receiver operating characteristic (ROC) curve was drawn to evaluate the predictive value of the decision tree model.</p><p><strong>Results: </strong>It was found that significant predictor variables included age, disease duration, patient general condition and menopausal status, ascites, tumor size, HE4, CA125, ROMA index, and blood routine related indicators (except for basophil count percentage and absolute value). In the constructed decision tree model, ROMA_after was the root node with the maximum information gain. ROMA_after, Mass size (MR/CT), HE4, CA125, platelet number, lymphocyte ratio, white blood cell count, post-menopause, hematocrit and mean platelet volume were important indicators in the decision tree model. The area under the receiver operating characteristic curve of this model for predicting benign and malignant ovarian cancer was 0.86.</p><p><strong>Conclusions: </strong>The decision tree model was successfully constructed based on clinical indicators and preoperative circulating blood cells, and showed better results in predicting benign and malignant ovarian cancer than alone imaging indicators or biomarkers among our data, which means that our model can more accurately predict benign and malignant ovarian cancer.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"25 1","pages":"94"},"PeriodicalIF":3.3000,"publicationDate":"2025-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11844102/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Informatics and Decision Making","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12911-025-02934-8","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
引用次数: 0

Abstract

Objective: Ovarian cancer is a serious malignant tumor threatening women's health. The early diagnosis and effective treatments of ovarian cancer remain inadequate, and about 70% of ovarian cancers are in advanced stages when discovered. This study aimed to use the decision tree method of artificial intelligence machine learning to build a model for predicting the benign and malignant degree of ovarian cancer patients.

Study design: A total of 758 patients were included in the study. These patients were diagnosed by B-ultrasound, CT or MR. The clinicopathological features and circulating blood cell indexes were recorded and analyzed. The prediction model of benign and malignant ovarian tumors was constructed by CART decision tree, and the receiver operating characteristic (ROC) curve was drawn to evaluate the predictive value of the decision tree model.

Results: It was found that significant predictor variables included age, disease duration, patient general condition and menopausal status, ascites, tumor size, HE4, CA125, ROMA index, and blood routine related indicators (except for basophil count percentage and absolute value). In the constructed decision tree model, ROMA_after was the root node with the maximum information gain. ROMA_after, Mass size (MR/CT), HE4, CA125, platelet number, lymphocyte ratio, white blood cell count, post-menopause, hematocrit and mean platelet volume were important indicators in the decision tree model. The area under the receiver operating characteristic curve of this model for predicting benign and malignant ovarian cancer was 0.86.

Conclusions: The decision tree model was successfully constructed based on clinical indicators and preoperative circulating blood cells, and showed better results in predicting benign and malignant ovarian cancer than alone imaging indicators or biomarkers among our data, which means that our model can more accurately predict benign and malignant ovarian cancer.

基于临床指标和术前循环血细胞预测卵巢肿瘤恶性的决策树模型。
目的:卵巢癌是危害妇女健康的严重恶性肿瘤。卵巢癌的早期诊断和有效治疗仍然不足,约70%的卵巢癌在发现时已处于晚期。本研究旨在利用人工智能机器学习的决策树方法,构建卵巢癌患者良恶性程度预测模型。研究设计:共纳入758例患者。经b超、CT或mr诊断,记录临床病理特征及循环血细胞指标。采用CART决策树构建卵巢良恶性肿瘤预测模型,并绘制受试者工作特征(ROC)曲线,评价决策树模型的预测价值。结果:发现年龄、病程、患者一般情况及绝经状态、腹水、肿瘤大小、HE4、CA125、ROMA指数、血常规相关指标(除嗜碱性粒细胞计数百分比及绝对值外)均为显著预测变量。在构建的决策树模型中,ROMA_after为信息增益最大的根节点。ROMA_after、Mass size (MR/CT)、HE4、CA125、血小板数量、淋巴细胞比、白细胞计数、绝经后、红细胞压积、平均血小板体积是决策树模型的重要指标。该模型预测卵巢癌良恶性的受试者工作特征曲线下面积为0.86。结论:基于临床指标和术前循环血细胞构建的决策树模型构建成功,预测卵巢癌良恶性的效果优于我们数据中单独的影像学指标或生物标志物,说明我们的模型可以更准确地预测卵巢癌良恶性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
7.20
自引率
5.70%
发文量
297
审稿时长
1 months
期刊介绍: BMC Medical Informatics and Decision Making is an open access journal publishing original peer-reviewed research articles in relation to the design, development, implementation, use, and evaluation of health information technologies and decision-making for human health.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信