Establish a predictive model for high-risk de novo metastatic prostate cancer patients by machine learning.

Po-Jung Su, Yu-Ann Fang, Yung-Chun Chang, Yung-Chia Kuo, Yung-Chang Lin
{"title":"Establish a predictive model for high-risk de novo metastatic prostate cancer patients by machine learning.","authors":"Po-Jung Su, Yu-Ann Fang, Yung-Chun Chang, Yung-Chia Kuo, Yung-Chang Lin","doi":"10.1200/jgo.2019.5.suppl.13","DOIUrl":null,"url":null,"abstract":"13 Background: For de novo metastatic prostate cancer (mPC)) patients, their prognosis may be really different. Some of these patients response very well to hormone therapy with durable survival, but others may be not. For those poor prognosis patients, if we could predict them as high risk patients when diagnosed, and provide aggressive upfront chemotherapy or novel hormonal therapy, they might get better treatment outcomes. Methods: We used data of prostate cancer patients from 2000 to 2016 in Chang Gung Research Database. There are 799 de novo mPC patients with castration. We predicted the possibility for these patients progressed to metastatic castration-resistant prostate cancer (mCRPC) in 1 year and find the high risk group patients. Then we figured out the best features for prediction from the best classifier with Recursive Feature Elimination. Results: The de nove mPC patients who pregressed to mCRPC in 1 year, whose mOS is 21.9 months is worse than who progressed to mCRPC beyond 1 year significantly, whose mOS is 80.7 months. (adjusted hazard ratio[aHR]: 6.43, P<0.001). The overall performance of machine learning by XGBoost is the best in all predictive models for high risk patients. (AUC=0.7000, Accuracy=0.7143). We excluded the features with missing data over 50%, then put all other features in the model. (AUC=0.7042, Accuracy=0.7239). But we got the best performance with only 11 features, including age, time from diagnosis to castration, nadir PSA, hemoglobin, eosinophil/white blood cell ratio, alkaline phosphatase, alanine transaminase, blood urea nitrogen, creatinine, prothrombin time, and secondary primary cancer, by Recursive Feature Elimination. (AUC=0.7131, Accuracy=0.7267). Conclusions: We found the predictive model has better predictive accuracy and shorter manuscript time with less features selected by Recursive Feature Elimination.We can predict high risk group in de novo mPC patients and make better clinical decision for treatment with this XGBoost model.","PeriodicalId":15862,"journal":{"name":"Journal of global oncology","volume":"27 24 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2019-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of global oncology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1200/jgo.2019.5.suppl.13","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

13 Background: For de novo metastatic prostate cancer (mPC)) patients, their prognosis may be really different. Some of these patients response very well to hormone therapy with durable survival, but others may be not. For those poor prognosis patients, if we could predict them as high risk patients when diagnosed, and provide aggressive upfront chemotherapy or novel hormonal therapy, they might get better treatment outcomes. Methods: We used data of prostate cancer patients from 2000 to 2016 in Chang Gung Research Database. There are 799 de novo mPC patients with castration. We predicted the possibility for these patients progressed to metastatic castration-resistant prostate cancer (mCRPC) in 1 year and find the high risk group patients. Then we figured out the best features for prediction from the best classifier with Recursive Feature Elimination. Results: The de nove mPC patients who pregressed to mCRPC in 1 year, whose mOS is 21.9 months is worse than who progressed to mCRPC beyond 1 year significantly, whose mOS is 80.7 months. (adjusted hazard ratio[aHR]: 6.43, P<0.001). The overall performance of machine learning by XGBoost is the best in all predictive models for high risk patients. (AUC=0.7000, Accuracy=0.7143). We excluded the features with missing data over 50%, then put all other features in the model. (AUC=0.7042, Accuracy=0.7239). But we got the best performance with only 11 features, including age, time from diagnosis to castration, nadir PSA, hemoglobin, eosinophil/white blood cell ratio, alkaline phosphatase, alanine transaminase, blood urea nitrogen, creatinine, prothrombin time, and secondary primary cancer, by Recursive Feature Elimination. (AUC=0.7131, Accuracy=0.7267). Conclusions: We found the predictive model has better predictive accuracy and shorter manuscript time with less features selected by Recursive Feature Elimination.We can predict high risk group in de novo mPC patients and make better clinical decision for treatment with this XGBoost model.
建立基于机器学习的高风险新发转移性前列腺癌患者预测模型。
背景:对于新发转移性前列腺癌(mPC)患者,他们的预后可能真的不同。其中一些患者对激素治疗反应很好,生存时间很长,但其他患者可能没有。对于预后较差的患者,若能在诊断时将其预测为高危患者,并给予积极的前期化疗或新颖的激素治疗,可能会获得较好的治疗效果。方法:我们使用长庚研究数据库2000 - 2016年前列腺癌患者的数据。有799例新发mPC患者被阉割。我们预测这些患者在1年内发展为转移性去势抵抗性前列腺癌(mCRPC)的可能性,并找到高危组患者。然后用递归特征消去法从最佳分类器中找出用于预测的最佳特征。结果:1年内进展为mCRPC的原发性mPC患者的mOS为21.9个月,明显低于1年以上进展为mCRPC的患者的mOS为80.7个月。(校正风险比[aHR]: 6.43, P<0.001)。在所有高风险患者的预测模型中,XGBoost机器学习的整体性能是最好的。(AUC = 0.7000、准确性= 0.7143)。我们排除了丢失数据超过50%的特征,然后将所有其他特征放入模型中。(AUC = 0.7042、准确性= 0.7239)。但采用递归特征消去法筛选出年龄、诊断至去势时间、最低PSA、血红蛋白、嗜酸性粒细胞/白细胞比、碱性磷酸酶、丙氨酸转氨酶、血尿素氮、肌酐、凝血酶原时间、继发原发癌等11个特征,结果最佳。(AUC = 0.7131、准确性= 0.7267)。结论:采用递归特征消去法选取的特征较少,预测模型具有较好的预测精度和较短的预测时间。XGBoost模型可以预测新发mPC患者的高危人群,为临床治疗决策提供依据。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
审稿时长
20 weeks
期刊介绍: The Journal of Global Oncology (JGO) is an online only, open access journal focused on cancer care, research and care delivery issues unique to countries and settings with limited healthcare resources. JGO aims to provide a home for high-quality literature that fulfills a growing need for content describing the array of challenges health care professionals in resource-constrained settings face. Article types include original reports, review articles, commentaries, correspondence/replies, special articles and editorials.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信