Data Modeling Using Vital Sign Dynamics for In-hospital Mortality Classification in Patients with Acute Coronary Syndrome.

IF 2.3 Q3 MEDICAL INFORMATICS
Sarawuth Limprasert, Ajchara Phu-Ang
{"title":"Data Modeling Using Vital Sign Dynamics for In-hospital Mortality Classification in Patients with Acute Coronary Syndrome.","authors":"Sarawuth Limprasert,&nbsp;Ajchara Phu-Ang","doi":"10.4258/hir.2023.29.2.120","DOIUrl":null,"url":null,"abstract":"<p><strong>Objectives: </strong>This study compared feature selection by machine learning or expert recommendation in the performance of classification models for in-hospital mortality among patients with acute coronary syndrome (ACS) who underwent percutaneous coronary intervention (PCI).</p><p><strong>Methods: </strong>A dataset of 1,123 patients with ACS who underwent PCI was analyzed. After assigning 80% of instances to the training set through random splitting, we performed feature scaling and resampling with the synthetic minority over-sampling technique and Tomek link method. We compared two feature selection.</p><p><strong>Methods: </strong>recursive feature elimination with cross-validation (RFECV) and selection by interventional cardiologists. We used five simple models: support vector machine (SVM), random forest, decision tree, logistic regression, and artificial neural network. The performance metrics were accuracy, recall, and the false-negative rate, measured with 10-fold cross-validation in the training set and validated in the test set.</p><p><strong>Results: </strong>Patients' mean age was 66.22 ± 12.88 years, and 33.63% had ST-elevation ACS. Fifteen of 34 features were selected as important with the RFECV method, while the experts chose 11 features. All models with feature selection by RFECV had higher accuracy than the models with expert-chosen features. In the training set, the random forest model had the highest accuracy (0.96 ± 0.01) and recall (0.97 ± 0.02). After validation in the test set, the SVM model displayed the highest accuracy (0.81) and a recall of 0.61.</p><p><strong>Conclusions: </strong>Models with feature selection by RFECV had higher accuracy than those with feature selection by experts in identifying patients with ACS at high risk for in-hospital mortality.</p>","PeriodicalId":12947,"journal":{"name":"Healthcare Informatics Research","volume":"29 2","pages":"120-131"},"PeriodicalIF":2.3000,"publicationDate":"2023-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/1d/67/hir-2023-29-2-120.PMC10209722.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Healthcare Informatics Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4258/hir.2023.29.2.120","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
引用次数: 0

Abstract

Objectives: This study compared feature selection by machine learning or expert recommendation in the performance of classification models for in-hospital mortality among patients with acute coronary syndrome (ACS) who underwent percutaneous coronary intervention (PCI).

Methods: A dataset of 1,123 patients with ACS who underwent PCI was analyzed. After assigning 80% of instances to the training set through random splitting, we performed feature scaling and resampling with the synthetic minority over-sampling technique and Tomek link method. We compared two feature selection.

Methods: recursive feature elimination with cross-validation (RFECV) and selection by interventional cardiologists. We used five simple models: support vector machine (SVM), random forest, decision tree, logistic regression, and artificial neural network. The performance metrics were accuracy, recall, and the false-negative rate, measured with 10-fold cross-validation in the training set and validated in the test set.

Results: Patients' mean age was 66.22 ± 12.88 years, and 33.63% had ST-elevation ACS. Fifteen of 34 features were selected as important with the RFECV method, while the experts chose 11 features. All models with feature selection by RFECV had higher accuracy than the models with expert-chosen features. In the training set, the random forest model had the highest accuracy (0.96 ± 0.01) and recall (0.97 ± 0.02). After validation in the test set, the SVM model displayed the highest accuracy (0.81) and a recall of 0.61.

Conclusions: Models with feature selection by RFECV had higher accuracy than those with feature selection by experts in identifying patients with ACS at high risk for in-hospital mortality.

Abstract Image

Abstract Image

Abstract Image

急性冠脉综合征患者住院死亡率分类的生命体征动力学数据建模。
目的:本研究比较了机器学习特征选择和专家推荐对急性冠脉综合征(ACS)患者经皮冠状动脉介入治疗(PCI)住院死亡率分类模型的性能。方法:对1123例行PCI治疗的ACS患者数据集进行分析。通过随机分割将80%的实例分配到训练集后,我们使用合成少数派过采样技术和Tomek链接方法进行特征缩放和重采样。我们比较了两种特征选择。方法:交叉验证递归特征消除(RFECV)和介入心脏病专家选择。我们使用了五个简单的模型:支持向量机(SVM)、随机森林、决策树、逻辑回归和人工神经网络。性能指标是准确性、召回率和假阴性率,在训练集中进行10倍交叉验证,并在测试集中进行验证。结果:患者平均年龄66.22±12.88岁,其中33.63%为st段抬高型ACS。用RFECV方法从34个特征中选择了15个作为重要特征,而专家选择了11个特征。RFECV特征选择模型的准确率均高于专家特征选择模型。在训练集中,随机森林模型具有最高的准确率(0.96±0.01)和召回率(0.97±0.02)。经过测试集的验证,SVM模型的准确率最高(0.81),召回率为0.61。结论:RFECV特征选择模型识别院内死亡高危ACS患者的准确率高于专家特征选择模型。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Healthcare Informatics Research
Healthcare Informatics Research MEDICAL INFORMATICS-
CiteScore
4.90
自引率
6.90%
发文量
44
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信