Classification of coronary artery disease using radial artery pulse wave analysis via machine learning

IF 3.3 3区 医学 Q2 MEDICAL INFORMATICS
Yi Lyu, Hai-Mei Wu, Hai-Xia Yan, Rui Guo, Yu-Jie Xiong, Rui Chen, Wen-Yue Huang, Jing Hong, Rong Lyu, Yi-Qin Wang, Jin Xu
{"title":"Classification of coronary artery disease using radial artery pulse wave analysis via machine learning","authors":"Yi Lyu, Hai-Mei Wu, Hai-Xia Yan, Rui Guo, Yu-Jie Xiong, Rui Chen, Wen-Yue Huang, Jing Hong, Rong Lyu, Yi-Qin Wang, Jin Xu","doi":"10.1186/s12911-024-02666-1","DOIUrl":null,"url":null,"abstract":"Coronary artery disease (CAD) is a major global cardiovascular health threat and the leading cause of death in many countries. The disease has a significant impact in China, where it has become the leading cause of death. There is an urgent need to develop non-invasive, rapid, cost-effective, and reliable techniques for the early detection of CAD using machine learning (ML). Six hundred eight participants were divided into three groups: healthy, hypertensive, and CAD. The raw data of pulse wave from those participants was collected. The data were de-noised, normalized, and analyzed using several applications. Seven ML classifiers were used to model the processed data, including Decision Tree (DT), Random Forest (RF), Gradient Boosting Decision Tree (GBDT), Extra Trees (ET), Extreme Gradient Boosting (XGBoost), Light Gradient Boosting (LightGBM), and Unbiased Boosting with Categorical Features (CatBoost). The Extra Trees classifier demonstrated the best classification performance. After tunning, the results performance evaluation on test set are: 0.8579 accuracy, 0.9361 AUC, 0.8561 recall, 0.8581 precision, 0.8571 F1 score, 0.7859 kappa coefficient, and 0.7867 MCC. The top 10 feature importances of ET model are w/t1, t3/tmax, tmax, t3/t1, As, hf/3, tf/3/tmax, tf/5, w and tf/3/t1. Radial artery pulse wave can be used to identify healthy, hypertensive and CAD participants by using Extra Trees Classifier. This method provides a potential pathway to recognize CAD patients by using a simple, non-invasive, and cost-effective technique.","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":null,"pages":null},"PeriodicalIF":3.3000,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Informatics and Decision Making","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12911-024-02666-1","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
引用次数: 0

Abstract

Coronary artery disease (CAD) is a major global cardiovascular health threat and the leading cause of death in many countries. The disease has a significant impact in China, where it has become the leading cause of death. There is an urgent need to develop non-invasive, rapid, cost-effective, and reliable techniques for the early detection of CAD using machine learning (ML). Six hundred eight participants were divided into three groups: healthy, hypertensive, and CAD. The raw data of pulse wave from those participants was collected. The data were de-noised, normalized, and analyzed using several applications. Seven ML classifiers were used to model the processed data, including Decision Tree (DT), Random Forest (RF), Gradient Boosting Decision Tree (GBDT), Extra Trees (ET), Extreme Gradient Boosting (XGBoost), Light Gradient Boosting (LightGBM), and Unbiased Boosting with Categorical Features (CatBoost). The Extra Trees classifier demonstrated the best classification performance. After tunning, the results performance evaluation on test set are: 0.8579 accuracy, 0.9361 AUC, 0.8561 recall, 0.8581 precision, 0.8571 F1 score, 0.7859 kappa coefficient, and 0.7867 MCC. The top 10 feature importances of ET model are w/t1, t3/tmax, tmax, t3/t1, As, hf/3, tf/3/tmax, tf/5, w and tf/3/t1. Radial artery pulse wave can be used to identify healthy, hypertensive and CAD participants by using Extra Trees Classifier. This method provides a potential pathway to recognize CAD patients by using a simple, non-invasive, and cost-effective technique.
通过机器学习利用桡动脉脉搏波分析进行冠状动脉疾病分类
冠状动脉疾病(CAD)是全球心血管健康的主要威胁,也是许多国家的主要死因。在中国,冠心病的影响非常大,已成为导致死亡的主要原因。利用机器学习(ML)开发无创、快速、经济、可靠的早期检测 CAD 的技术迫在眉睫。68 名参与者被分为三组:健康组、高血压组和 CAD 组。收集了这些参与者的脉搏波原始数据。数据经过去噪、归一化处理,并使用多个应用程序进行分析。在对处理后的数据建模时使用了七种 ML 分类器,包括决策树 (DT)、随机森林 (RF)、梯度提升决策树 (GBDT)、额外树 (ET)、极端梯度提升 (XGBoost)、轻梯度提升 (LightGBM) 和带分类特征的无偏提升 (CatBoost)。Extra Trees 分类器的分类效果最好。经过调整后,测试集的性能评估结果如下准确率为 0.8579,AUC 为 0.9361,召回率为 0.8561,精度为 0.8581,F1 分数为 0.8571,卡帕系数为 0.7859,MCC 为 0.7867。ET 模型的前 10 个重要特征是 w/t1、t3/tmax、tmax、t3/t1、As、hf/3、tf/3/tmax、tf/5、w 和 tf/3/t1。利用 Extra Trees 分类器,桡动脉脉搏波可用于识别健康、高血压和 CAD 患者。这种方法提供了一种利用简单、无创和经济有效的技术识别 CAD 患者的潜在途径。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
7.20
自引率
5.70%
发文量
297
审稿时长
1 months
期刊介绍: BMC Medical Informatics and Decision Making is an open access journal publishing original peer-reviewed research articles in relation to the design, development, implementation, use, and evaluation of health information technologies and decision-making for human health.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信