Predicting atrial fibrillation in comorbid patients with arterial hypertension and chronic obstructive pulmonary disease using laboratory research methods: a machine learning approach

Digital Diagnostics Pub Date : 2024-07-03 DOI:10.17816/dd626797

E. V. Kazantseva, A. Ivannikov, A. Tarzimanova, V. Podzolkov

{"title":"Predicting atrial fibrillation in comorbid patients with arterial hypertension and chronic obstructive pulmonary disease using laboratory research methods: a machine learning approach","authors":"E. V. Kazantseva, A. Ivannikov, A. Tarzimanova, V. Podzolkov","doi":"10.17816/dd626797","DOIUrl":null,"url":null,"abstract":"BACKGROUND: Arterial hypertension and chronic obstructive pulmonary disease have a deleterious effect on the structure of the heart, leading to the development of atrial fibrillation, which remains the leading cause of cerebral stroke and premature death [1]. Consequently, the early identification of atrial fibrillation risk factors in patients with arterial hypertension and chronic obstructive pulmonary disease is of paramount importance for the prevention of such conditions. This is why predictive cardiology employs machine learning methods, which are demonstrably superior to classical statistical methods of prediction [2–4]. \nAIM: The study aimed to develop a prognostic model of atrial fibrillation in comorbid patients with arterial hypertension and chronic obstructive pulmonary disease based on multilayer perceptron. \nMATERIALS AND METHODS: The study included 419 patients treated at the University Clinical Hospital No. 4 of the I.M. Sechenov First Moscow State Medical University. Group 1 consisted of 91 (21.7%) patients with a verified diagnosis of atrial fibrillation, while Group 2 comprised 328 (78.3%) patients without atrial fibrillation. The random forest machine learning algorithm was used to identify predictors, which were then utilized to develop a neural network of the multilayer perceptron type. This consisted of two layers: an input layer of 12 neurons with the ReLU activation function and an output layer that receives input data from the previous layer and transmits them to one output with the sigmoid activation function. The threshold value, sensitivity, specificity, and diagnostic efficiency of the obtained model were determined using receiver operating characteristic analysis with the calculation of the area under the curve (AUC). \nRESULTS: By the first stage of prognostic model development, the most significant predictors of atrial fibrillation development were selected by the random forest machine learning algorithm. The model was developed using three variables: C-reactive protein concentration (odds ratio, OR 1.04; 95% confidence interval, CI 1.015–1.067; p=0.002), erythrocyte sedimentation rate (OR 1.04; 95% CI 1.019–1.069; p=0.002), and creatinine concentration (OR 1.03; 95% CI 1.011–1.042; p 0.001). These variables were used to train a multilayer perceptron model on a test sample for 500 epochs. \nFollowing training, the developed model exhibited a sensitivity of 85%, a specificity of 80%, and a diagnostic efficiency of 79.6%. AUC amounted to 0.900. \nCONCLUSIONS: The study resulted in the development of a prognostic model based on the application of machine learning methods, which exhibited favorable metrics. This model may be considered a valuable tool for clinical practice.","PeriodicalId":34831,"journal":{"name":"Digital Diagnostics","volume":"30 S95","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Diagnostics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.17816/dd626797","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

BACKGROUND: Arterial hypertension and chronic obstructive pulmonary disease have a deleterious effect on the structure of the heart, leading to the development of atrial fibrillation, which remains the leading cause of cerebral stroke and premature death [1]. Consequently, the early identification of atrial fibrillation risk factors in patients with arterial hypertension and chronic obstructive pulmonary disease is of paramount importance for the prevention of such conditions. This is why predictive cardiology employs machine learning methods, which are demonstrably superior to classical statistical methods of prediction [2–4]. AIM: The study aimed to develop a prognostic model of atrial fibrillation in comorbid patients with arterial hypertension and chronic obstructive pulmonary disease based on multilayer perceptron. MATERIALS AND METHODS: The study included 419 patients treated at the University Clinical Hospital No. 4 of the I.M. Sechenov First Moscow State Medical University. Group 1 consisted of 91 (21.7%) patients with a verified diagnosis of atrial fibrillation, while Group 2 comprised 328 (78.3%) patients without atrial fibrillation. The random forest machine learning algorithm was used to identify predictors, which were then utilized to develop a neural network of the multilayer perceptron type. This consisted of two layers: an input layer of 12 neurons with the ReLU activation function and an output layer that receives input data from the previous layer and transmits them to one output with the sigmoid activation function. The threshold value, sensitivity, specificity, and diagnostic efficiency of the obtained model were determined using receiver operating characteristic analysis with the calculation of the area under the curve (AUC). RESULTS: By the first stage of prognostic model development, the most significant predictors of atrial fibrillation development were selected by the random forest machine learning algorithm. The model was developed using three variables: C-reactive protein concentration (odds ratio, OR 1.04; 95% confidence interval, CI 1.015–1.067; p=0.002), erythrocyte sedimentation rate (OR 1.04; 95% CI 1.019–1.069; p=0.002), and creatinine concentration (OR 1.03; 95% CI 1.011–1.042; p 0.001). These variables were used to train a multilayer perceptron model on a test sample for 500 epochs. Following training, the developed model exhibited a sensitivity of 85%, a specificity of 80%, and a diagnostic efficiency of 79.6%. AUC amounted to 0.900. CONCLUSIONS: The study resulted in the development of a prognostic model based on the application of machine learning methods, which exhibited favorable metrics. This model may be considered a valuable tool for clinical practice.

查看原文本刊更多论文

利用实验室研究方法预测动脉高血压和慢性阻塞性肺病合并症患者的心房颤动：一种机器学习方法

背景：动脉高血压和慢性阻塞性肺病会对心脏结构产生有害影响，导致心房颤动的发生，而心房颤动仍是脑卒中和过早死亡的主要原因[1]。因此，及早发现动脉高血压和慢性阻塞性肺病患者心房颤动的危险因素对预防此类疾病至关重要。因此，预测性心脏病学采用了机器学习方法，这种方法明显优于传统的统计预测方法[2-4]。目的：本研究旨在开发一种基于多层感知器的合并动脉高血压和慢性阻塞性肺病患者心房颤动预后模型。材料与方法：研究对象包括在莫斯科第一国立医科大学第四临床医院接受治疗的 419 名患者。第 1 组包括 91 名（21.7%）确诊为心房颤动的患者，第 2 组包括 328 名（78.3%）无心房颤动的患者。随机森林机器学习算法用于识别预测因子，然后利用这些预测因子开发多层感知器类型的神经网络。该网络由两层组成：一层是由 12 个神经元组成的输入层，具有 ReLU 激活函数；另一层是输出层，接收前一层的输入数据，并将其传输到一个具有 sigmoid 激活函数的输出端。利用接收器操作特征分析法确定了所获模型的阈值、灵敏度、特异性和诊断效率，并计算了曲线下面积（AUC）。结果：在建立预后模型的第一阶段，随机森林机器学习算法选出了对房颤发展最有意义的预测因子。该模型由三个变量组成C反应蛋白浓度（几率比，OR 1.04；95% 置信区间，CI 1.015-1.067；P=0.002）、红细胞沉降率（OR 1.04；95% CI 1.019-1.069；P=0.002）和肌酐浓度（OR 1.03；95% CI 1.011-1.042；P 0.001）。这些变量被用于对测试样本的多层感知器模型进行 500 次历时训练。训练后，所开发模型的灵敏度为 85%，特异度为 80%，诊断效率为 79.6%。AUC 为 0.900。结论：这项研究在应用机器学习方法的基础上开发了一个预后模型，该模型显示出良好的指标。该模型可被视为临床实践的重要工具。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊