{"title":"使用卡方自动交互检测模型有效预测约旦妇女心脏病死亡率:回顾性验证研究","authors":"Salam Bani Hani, Muayyad Ahmad","doi":"10.2196/48795","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Many current studies have claimed that the actual risk of heart disease among women is equal to that in men. Using a large machine learning algorithm (MLA) data set to predict mortality in women, data mining techniques have been used to identify significant aspects of variables that help in identifying the primary causes of mortality within this target category of the population.</p><p><strong>Objective: </strong>This study aims to predict mortality caused by heart disease among women, using an artificial intelligence technique-based MLA.</p><p><strong>Methods: </strong>A retrospective design was used to retrieve big data from the electronic health records of 2028 women with heart disease. Data were collected for Jordanian women who were admitted to public health hospitals from 2015 to the end of 2021. We checked the extracted data for noise, consistency issues, and missing values. After categorizing, organizing, and cleaning the extracted data, the redundant data were eliminated.</p><p><strong>Results: </strong>Out of 9 artificial intelligence models, the Chi-squared Automatic Interaction Detection model had the highest accuracy (93.25%) and area under the curve (0.825) among the build models. The participants were 62.6 (SD 15.4) years old on average. Angina pectoris was the most frequent diagnosis in the women's extracted files (n=1,264,000, 62.3%), followed by congestive heart failure (n=764,000, 37.7%). Age, systolic blood pressure readings with a cutoff value of >187 mm Hg, medical diagnosis (women diagnosed with congestive heart failure were at a higher risk of death [n=31, 16.58%]), pulse pressure with a cutoff value of 98 mm Hg, and oxygen saturation (measured using pulse oximetry) with a cutoff value of 93% were the main predictors for death among women.</p><p><strong>Conclusions: </strong>To predict the outcomes in this study, we used big data that were extracted from the clinical variables from the electronic health records. The Chi-squared Automatic Interaction Detection model-an MLA-confirmed the precise identification of the key predictors of cardiovascular mortality among women and can be used as a practical tool for clinical prediction.</p>","PeriodicalId":14706,"journal":{"name":"JMIR Cardio","volume":"7 ","pages":"e48795"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10401188/pdf/","citationCount":"2","resultStr":"{\"title\":\"Effective Prediction of Mortality by Heart Disease Among Women in Jordan Using the Chi-Squared Automatic Interaction Detection Model: Retrospective Validation Study.\",\"authors\":\"Salam Bani Hani, Muayyad Ahmad\",\"doi\":\"10.2196/48795\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Many current studies have claimed that the actual risk of heart disease among women is equal to that in men. Using a large machine learning algorithm (MLA) data set to predict mortality in women, data mining techniques have been used to identify significant aspects of variables that help in identifying the primary causes of mortality within this target category of the population.</p><p><strong>Objective: </strong>This study aims to predict mortality caused by heart disease among women, using an artificial intelligence technique-based MLA.</p><p><strong>Methods: </strong>A retrospective design was used to retrieve big data from the electronic health records of 2028 women with heart disease. Data were collected for Jordanian women who were admitted to public health hospitals from 2015 to the end of 2021. We checked the extracted data for noise, consistency issues, and missing values. After categorizing, organizing, and cleaning the extracted data, the redundant data were eliminated.</p><p><strong>Results: </strong>Out of 9 artificial intelligence models, the Chi-squared Automatic Interaction Detection model had the highest accuracy (93.25%) and area under the curve (0.825) among the build models. The participants were 62.6 (SD 15.4) years old on average. Angina pectoris was the most frequent diagnosis in the women's extracted files (n=1,264,000, 62.3%), followed by congestive heart failure (n=764,000, 37.7%). Age, systolic blood pressure readings with a cutoff value of >187 mm Hg, medical diagnosis (women diagnosed with congestive heart failure were at a higher risk of death [n=31, 16.58%]), pulse pressure with a cutoff value of 98 mm Hg, and oxygen saturation (measured using pulse oximetry) with a cutoff value of 93% were the main predictors for death among women.</p><p><strong>Conclusions: </strong>To predict the outcomes in this study, we used big data that were extracted from the clinical variables from the electronic health records. The Chi-squared Automatic Interaction Detection model-an MLA-confirmed the precise identification of the key predictors of cardiovascular mortality among women and can be used as a practical tool for clinical prediction.</p>\",\"PeriodicalId\":14706,\"journal\":{\"name\":\"JMIR Cardio\",\"volume\":\"7 \",\"pages\":\"e48795\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-07-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10401188/pdf/\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"JMIR Cardio\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2196/48795\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"Medicine\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"JMIR Cardio","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2196/48795","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 2
摘要
背景:目前许多研究都声称,女性患心脏病的实际风险与男性相同。使用大型机器学习算法(MLA)数据集预测妇女死亡率,数据挖掘技术已被用于确定变量的重要方面,这些变量有助于确定这一目标人群中死亡的主要原因。目的:本研究旨在利用基于人工智能技术的MLA预测女性心脏病死亡率。方法:采用回顾性设计,从2028例心脏病女性的电子健康记录中检索大数据。收集了2015年至2021年底在公共卫生医院住院的约旦妇女的数据。我们检查了提取的数据是否存在噪声、一致性问题和缺失值。对提取的数据进行分类、组织和清理后,消除了冗余数据。结果:在9个人工智能模型中,卡方自动交互检测模型在构建模型中准确率最高(93.25%),曲线下面积最高(0.825)。参与者的平均年龄为62.6岁(SD 15.4)。心绞痛是最常见的诊断(n= 126.4万,62.3%),其次是充血性心力衰竭(n= 76.4万,37.7%)。年龄、收缩压(临界值> 187mmhg)、医学诊断(诊断为充血性心力衰竭的女性死亡风险更高[n=31, 16.58%])、脉压(临界值为98 mm Hg)和血氧饱和度(使用脉搏血氧仪测量)(临界值为93%)是女性死亡的主要预测因素。结论:为了预测本研究的结果,我们使用了从电子健康记录中提取的临床变量的大数据。卡方自动交互检测模型-一种mla -确认了女性心血管死亡率关键预测因子的精确识别,并可作为临床预测的实用工具。
Effective Prediction of Mortality by Heart Disease Among Women in Jordan Using the Chi-Squared Automatic Interaction Detection Model: Retrospective Validation Study.
Background: Many current studies have claimed that the actual risk of heart disease among women is equal to that in men. Using a large machine learning algorithm (MLA) data set to predict mortality in women, data mining techniques have been used to identify significant aspects of variables that help in identifying the primary causes of mortality within this target category of the population.
Objective: This study aims to predict mortality caused by heart disease among women, using an artificial intelligence technique-based MLA.
Methods: A retrospective design was used to retrieve big data from the electronic health records of 2028 women with heart disease. Data were collected for Jordanian women who were admitted to public health hospitals from 2015 to the end of 2021. We checked the extracted data for noise, consistency issues, and missing values. After categorizing, organizing, and cleaning the extracted data, the redundant data were eliminated.
Results: Out of 9 artificial intelligence models, the Chi-squared Automatic Interaction Detection model had the highest accuracy (93.25%) and area under the curve (0.825) among the build models. The participants were 62.6 (SD 15.4) years old on average. Angina pectoris was the most frequent diagnosis in the women's extracted files (n=1,264,000, 62.3%), followed by congestive heart failure (n=764,000, 37.7%). Age, systolic blood pressure readings with a cutoff value of >187 mm Hg, medical diagnosis (women diagnosed with congestive heart failure were at a higher risk of death [n=31, 16.58%]), pulse pressure with a cutoff value of 98 mm Hg, and oxygen saturation (measured using pulse oximetry) with a cutoff value of 93% were the main predictors for death among women.
Conclusions: To predict the outcomes in this study, we used big data that were extracted from the clinical variables from the electronic health records. The Chi-squared Automatic Interaction Detection model-an MLA-confirmed the precise identification of the key predictors of cardiovascular mortality among women and can be used as a practical tool for clinical prediction.