Risk prediction of QTc prolongation occurrence in cancer patients treated with commonly used oral tyrosine kinase inhibitors: machine learning modeling or conventional statistical analysis better?

IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS
Hsiang-Wen Lin, Tien-Chao Lin, Chien-Ning Hsu, Tzu-Pei Yeh, Yu-Chieh Chen, Liang-Chih Liu, Chen-Yuan Lin
{"title":"Risk prediction of QTc prolongation occurrence in cancer patients treated with commonly used oral tyrosine kinase inhibitors: machine learning modeling or conventional statistical analysis better?","authors":"Hsiang-Wen Lin, Tien-Chao Lin, Chien-Ning Hsu, Tzu-Pei Yeh, Yu-Chieh Chen, Liang-Chih Liu, Chen-Yuan Lin","doi":"10.1186/s12911-025-03091-8","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Cancer patients receiving targeted therapies need to prevent QTc prolongation and life-threatening cardiovascular (CV) events to maintain a balanced benefit-risk ratio. This study aimed to develop an optimal prediction model for QTc prolongation risk and estimate its risk probability in cancer patients treated with oral tyrosine kinase inhibitors (TKIs).</p><p><strong>Methods: </strong>This retrospective cohort study analyzed electronic medical records (EMR) of cancer patients newly treated with commonly used oral TKIs at a medical center between January 2016 and December 2020. QTc prolongation was defined as ≥ 450 ms in males and ≥ 470 ms in females using Bazett's formula. The study followed four key steps: (1) Managing missing data, (2) Identifying important variables, (3) Training and testing the best prediction models, (4). Estimating risk probability and determining cut-off points. Both univariate logistic regression (LR) and supervised machine learning (ML) approaches were used for variable selection. The backward LR method and seven ML algorithms were applied to train and test the prediction models. The best model was identified based on model performance, fitting criteria, area under the receiver operating characteristic curve (AUROC), risk probability cut-off points, and clinical relevance.</p><p><strong>Results: </strong>The statistical 12-parameter model demonstrated excellent performance (AUROC = 0.89, sensitivity = 0.91, specificity = 0.75) and strong discrimination ability for risk probability prediction (AUROC = 0.78, cut-off = 0.46), outperforming other ML models. In the final best model: the baseline risk probability of QTc prolongation was 0.13, even in the absence of other contributing factors. Baseline QTc prolongation and a history of cardiovascular disease (excluding arrhythmia, cardiomyopathy, etc.) contributed the most to incremental risk probability (0.471 and 0.282, respectively), after controlling for other factors. The remaining 10 factors each contributed to an increased probability of QTc prolongation for more than 0.14 probability.</p><p><strong>Conclusions: </strong>A logistic regression model utilizing 12 easily accessible variables from EMRs outperformed ML models in predicting the risk probability of QTc prolongation in cancer patients newly treated with five oral TKIs. These findings serve as a valuable clinical reference for integrating digital monitoring into cardiovascular care for cancer survivors undergoing targeted therapy with TKIs. They also underscore the importance of screening baseline ECG before initiating TKIs to assess the risk of QTc prolongation, facilitating early intervention and prevention in the future.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"25 1","pages":"310"},"PeriodicalIF":3.8000,"publicationDate":"2025-08-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12355842/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Informatics and Decision Making","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12911-025-03091-8","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Cancer patients receiving targeted therapies need to prevent QTc prolongation and life-threatening cardiovascular (CV) events to maintain a balanced benefit-risk ratio. This study aimed to develop an optimal prediction model for QTc prolongation risk and estimate its risk probability in cancer patients treated with oral tyrosine kinase inhibitors (TKIs).

Methods: This retrospective cohort study analyzed electronic medical records (EMR) of cancer patients newly treated with commonly used oral TKIs at a medical center between January 2016 and December 2020. QTc prolongation was defined as ≥ 450 ms in males and ≥ 470 ms in females using Bazett's formula. The study followed four key steps: (1) Managing missing data, (2) Identifying important variables, (3) Training and testing the best prediction models, (4). Estimating risk probability and determining cut-off points. Both univariate logistic regression (LR) and supervised machine learning (ML) approaches were used for variable selection. The backward LR method and seven ML algorithms were applied to train and test the prediction models. The best model was identified based on model performance, fitting criteria, area under the receiver operating characteristic curve (AUROC), risk probability cut-off points, and clinical relevance.

Results: The statistical 12-parameter model demonstrated excellent performance (AUROC = 0.89, sensitivity = 0.91, specificity = 0.75) and strong discrimination ability for risk probability prediction (AUROC = 0.78, cut-off = 0.46), outperforming other ML models. In the final best model: the baseline risk probability of QTc prolongation was 0.13, even in the absence of other contributing factors. Baseline QTc prolongation and a history of cardiovascular disease (excluding arrhythmia, cardiomyopathy, etc.) contributed the most to incremental risk probability (0.471 and 0.282, respectively), after controlling for other factors. The remaining 10 factors each contributed to an increased probability of QTc prolongation for more than 0.14 probability.

Conclusions: A logistic regression model utilizing 12 easily accessible variables from EMRs outperformed ML models in predicting the risk probability of QTc prolongation in cancer patients newly treated with five oral TKIs. These findings serve as a valuable clinical reference for integrating digital monitoring into cardiovascular care for cancer survivors undergoing targeted therapy with TKIs. They also underscore the importance of screening baseline ECG before initiating TKIs to assess the risk of QTc prolongation, facilitating early intervention and prevention in the future.

常用口服酪氨酸激酶抑制剂治疗的癌症患者QTc延长发生的风险预测:机器学习建模还是传统统计分析更好?
背景:接受靶向治疗的癌症患者需要防止QTc延长和危及生命的心血管(CV)事件,以保持平衡的获益-风险比。本研究旨在建立口服酪氨酸激酶抑制剂(TKIs)治疗癌症患者QTc延长风险的最佳预测模型,并估计其风险概率。方法:本回顾性队列研究分析了2016年1月至2020年12月在某医疗中心新接受常用口服TKIs治疗的癌症患者的电子病历(EMR)。QTc延长定义为男性≥450 ms,女性≥470 ms,采用Bazett公式。该研究遵循四个关键步骤:(1)管理缺失数据,(2)识别重要变量,(3)训练和测试最佳预测模型,(4)。评估风险概率并确定截止点。单变量逻辑回归(LR)和监督机器学习(ML)方法均用于变量选择。采用后向LR方法和7种ML算法对预测模型进行训练和测试。根据模型性能、拟合标准、受试者工作特征曲线下面积(AUROC)、风险概率截止点和临床相关性确定最佳模型。结果:统计12参数模型表现出优良的性能(AUROC = 0.89,灵敏度= 0.91,特异性= 0.75)和较强的风险概率预测判别能力(AUROC = 0.78, cut-off = 0.46),优于其他ML模型。在最终的最佳模型中:即使在没有其他因素的情况下,QTc延长的基线风险概率为0.13。在控制其他因素后,基线QTc延长和心血管病史(不包括心律失常、心肌病等)对增加风险概率的贡献最大(分别为0.471和0.282)。其余10个因素均增加了QTc延长的概率,其概率超过0.14。结论:利用emr中12个易于获取的变量的逻辑回归模型在预测新接受五种口服TKIs治疗的癌症患者QTc延长的风险概率方面优于ML模型。这些发现为将数字监测整合到接受TKIs靶向治疗的癌症幸存者的心血管护理中提供了有价值的临床参考。他们还强调了在开始tki之前筛查基线心电图的重要性,以评估QTc延长的风险,促进未来的早期干预和预防。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
7.20
自引率
5.70%
发文量
297
审稿时长
1 months
期刊介绍: BMC Medical Informatics and Decision Making is an open access journal publishing original peer-reviewed research articles in relation to the design, development, implementation, use, and evaluation of health information technologies and decision-making for human health.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信