基于改进型 XGBoost 的胃癌患者失衡生存预测,具有成本敏感性和病灶损失性

IF 3 4区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Expert Systems Pub Date : 2024-07-03 DOI:10.1111/exsy.13666
Liangchen Xu, Chonghui Guo
{"title":"基于改进型 XGBoost 的胃癌患者失衡生存预测,具有成本敏感性和病灶损失性","authors":"Liangchen Xu, Chonghui Guo","doi":"10.1111/exsy.13666","DOIUrl":null,"url":null,"abstract":"Accurate prediction of gastric cancer survival state is one of great significant tasks for clinical decision‐making. Many advanced machine learning classification techniques have been applied to predict the survival status of cancer patients in three or 5 years, however, many of them have a low sensitivity because of class imbalance. This is a non‐negligible problem due to the poor prognosis of gastric cancer patients. Furthermore, models in the medical domain require strong interpretability to increase their applicability. Due to the better performance and interpretability of the XGBoost model, we design a loss function taking into account cost sensitive and focal loss from the algorithm level for XGBoost to deal with the imbalance problem. We apply the improved model into the prediction of the survival status of gastric cancer patients and analyse the important related features. We use two types of indicators to evaluate the model, and we also design the confusion matrix of two models' predictive results to compare two models. The results show that the improved model has better performance. Furthermore, we calculate the importance of features related to survival with three different time periods and analyse their evolution, which are consistent with existing clinical research or further expand their research conclusions. These all support for clinically relevant decision‐making and has the potential to expand into survival prediction of other cancer patients.","PeriodicalId":51053,"journal":{"name":"Expert Systems","volume":null,"pages":null},"PeriodicalIF":3.0000,"publicationDate":"2024-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Imbalanced survival prediction for gastric cancer patients based on improved XGBoost with cost sensitive and focal loss\",\"authors\":\"Liangchen Xu, Chonghui Guo\",\"doi\":\"10.1111/exsy.13666\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Accurate prediction of gastric cancer survival state is one of great significant tasks for clinical decision‐making. Many advanced machine learning classification techniques have been applied to predict the survival status of cancer patients in three or 5 years, however, many of them have a low sensitivity because of class imbalance. This is a non‐negligible problem due to the poor prognosis of gastric cancer patients. Furthermore, models in the medical domain require strong interpretability to increase their applicability. Due to the better performance and interpretability of the XGBoost model, we design a loss function taking into account cost sensitive and focal loss from the algorithm level for XGBoost to deal with the imbalance problem. We apply the improved model into the prediction of the survival status of gastric cancer patients and analyse the important related features. We use two types of indicators to evaluate the model, and we also design the confusion matrix of two models' predictive results to compare two models. The results show that the improved model has better performance. Furthermore, we calculate the importance of features related to survival with three different time periods and analyse their evolution, which are consistent with existing clinical research or further expand their research conclusions. These all support for clinically relevant decision‐making and has the potential to expand into survival prediction of other cancer patients.\",\"PeriodicalId\":51053,\"journal\":{\"name\":\"Expert Systems\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2024-07-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Expert Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1111/exsy.13666\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1111/exsy.13666","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

摘要

准确预测胃癌患者的生存状态是临床决策的重要任务之一。许多先进的机器学习分类技术已被应用于预测癌症患者三年或五年后的生存状况,然而,由于类的不平衡,许多分类技术的灵敏度较低。由于胃癌患者的预后较差,这是一个不可忽视的问题。此外,医疗领域的模型需要较强的可解释性,以提高其适用性。由于 XGBoost 模型具有更好的性能和可解释性,我们为 XGBoost 设计了一个损失函数,从算法层面考虑了成本敏感损失和焦点损失,以解决不平衡问题。我们将改进后的模型应用于胃癌患者生存状况的预测,并分析了相关的重要特征。我们使用两类指标对模型进行评估,并设计了两个模型预测结果的混淆矩阵来比较两个模型。结果表明,改进后的模型性能更好。此外,我们还计算了三个不同时间段内与生存相关的特征的重要性,并分析了其演变过程,这些结果与现有的临床研究一致,或进一步扩展了其研究结论。这些都为临床相关决策提供了支持,并有可能扩展到其他癌症患者的生存预测。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Imbalanced survival prediction for gastric cancer patients based on improved XGBoost with cost sensitive and focal loss
Accurate prediction of gastric cancer survival state is one of great significant tasks for clinical decision‐making. Many advanced machine learning classification techniques have been applied to predict the survival status of cancer patients in three or 5 years, however, many of them have a low sensitivity because of class imbalance. This is a non‐negligible problem due to the poor prognosis of gastric cancer patients. Furthermore, models in the medical domain require strong interpretability to increase their applicability. Due to the better performance and interpretability of the XGBoost model, we design a loss function taking into account cost sensitive and focal loss from the algorithm level for XGBoost to deal with the imbalance problem. We apply the improved model into the prediction of the survival status of gastric cancer patients and analyse the important related features. We use two types of indicators to evaluate the model, and we also design the confusion matrix of two models' predictive results to compare two models. The results show that the improved model has better performance. Furthermore, we calculate the importance of features related to survival with three different time periods and analyse their evolution, which are consistent with existing clinical research or further expand their research conclusions. These all support for clinically relevant decision‐making and has the potential to expand into survival prediction of other cancer patients.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Expert Systems
Expert Systems 工程技术-计算机:理论方法
CiteScore
7.40
自引率
6.10%
发文量
266
审稿时长
24 months
期刊介绍: Expert Systems: The Journal of Knowledge Engineering publishes papers dealing with all aspects of knowledge engineering, including individual methods and techniques in knowledge acquisition and representation, and their application in the construction of systems – including expert systems – based thereon. Detailed scientific evaluation is an essential part of any paper. As well as traditional application areas, such as Software and Requirements Engineering, Human-Computer Interaction, and Artificial Intelligence, we are aiming at the new and growing markets for these technologies, such as Business, Economy, Market Research, and Medical and Health Care. The shift towards this new focus will be marked by a series of special issues covering hot and emergent topics.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信