利用行政医疗数据库设计预测缺血性中风的人工智能模型

Wai-Fai Tung, Fu-Hsing Wu, Po-Chou Chan, Hsuan-Hung Lin, Yung-fu Chen, Chih-Sheng Lin
{"title":"利用行政医疗数据库设计预测缺血性中风的人工智能模型","authors":"Wai-Fai Tung, Fu-Hsing Wu, Po-Chou Chan, Hsuan-Hung Lin, Yung-fu Chen, Chih-Sheng Lin","doi":"10.1109/IS3C50286.2020.00020","DOIUrl":null,"url":null,"abstract":"Ischemic stroke (IS) is the most common type of stroke, accounting to about 80% of the stroke, and is a major cause of morbidity and mortality worldwide. The risk factors of IS included older age, male gender, high BMI, smoking habit, hypertension, diabetes, hyperlipidemia, etc. Other factors, such as environmental metal exposure and atrial cardiopathy are also found to be risk factors. In this study, a balanced dataset, consisting of healthcare data of 36,880 IS patients and 36,880 non-IS patients matched with IS patients using indexed date, age, and sex, retrieved from a subset of the National Health Insurance Research Database (NHIRD) of Taiwan, an administrative healthcare database, were adopted for developing the AI models to predict events of ISs. Integrated Genetic Algorithm and Support Vector Machine (IGS) algorithm accompanied with 3 different fitness functions was applied for designing the predictive models. To select the best predictive performance from different AI models, tenfold cross validation were conducted for model training. The predictive performance of the designed models exhibits that predictive accuracy, sensitivity, specificity, and area under ROC curve (AUC) achieve 73.38-73.96%, 73.31-73.91%, 73.03-74.02%, and 0.808-0.813, respectively. The selected features including age, comorbidities, and other comorbidity-related variables are shown to be effective in designing strong predictive models (AUC>0.8) for predicting patients who are more likely to develop IS in the near future. Future works will focus on designing the predictive models using more effective AI technique, such as deep neural network (DNN), and useful variables, such as administrated drugs, to enhance the predictive performance.","PeriodicalId":143430,"journal":{"name":"2020 International Symposium on Computer, Consumer and Control (IS3C)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Designing AI Models for Predicting Ischemic Stroke Using Administrative Healthcare Database\",\"authors\":\"Wai-Fai Tung, Fu-Hsing Wu, Po-Chou Chan, Hsuan-Hung Lin, Yung-fu Chen, Chih-Sheng Lin\",\"doi\":\"10.1109/IS3C50286.2020.00020\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Ischemic stroke (IS) is the most common type of stroke, accounting to about 80% of the stroke, and is a major cause of morbidity and mortality worldwide. The risk factors of IS included older age, male gender, high BMI, smoking habit, hypertension, diabetes, hyperlipidemia, etc. Other factors, such as environmental metal exposure and atrial cardiopathy are also found to be risk factors. In this study, a balanced dataset, consisting of healthcare data of 36,880 IS patients and 36,880 non-IS patients matched with IS patients using indexed date, age, and sex, retrieved from a subset of the National Health Insurance Research Database (NHIRD) of Taiwan, an administrative healthcare database, were adopted for developing the AI models to predict events of ISs. Integrated Genetic Algorithm and Support Vector Machine (IGS) algorithm accompanied with 3 different fitness functions was applied for designing the predictive models. To select the best predictive performance from different AI models, tenfold cross validation were conducted for model training. The predictive performance of the designed models exhibits that predictive accuracy, sensitivity, specificity, and area under ROC curve (AUC) achieve 73.38-73.96%, 73.31-73.91%, 73.03-74.02%, and 0.808-0.813, respectively. The selected features including age, comorbidities, and other comorbidity-related variables are shown to be effective in designing strong predictive models (AUC>0.8) for predicting patients who are more likely to develop IS in the near future. Future works will focus on designing the predictive models using more effective AI technique, such as deep neural network (DNN), and useful variables, such as administrated drugs, to enhance the predictive performance.\",\"PeriodicalId\":143430,\"journal\":{\"name\":\"2020 International Symposium on Computer, Consumer and Control (IS3C)\",\"volume\":\"9 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 International Symposium on Computer, Consumer and Control (IS3C)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IS3C50286.2020.00020\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Symposium on Computer, Consumer and Control (IS3C)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IS3C50286.2020.00020","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

缺血性中风(IS)是最常见的中风类型,约占中风的80%,是世界范围内发病率和死亡率的主要原因。IS的危险因素包括高龄、男性、高BMI、吸烟习惯、高血压、糖尿病、高脂血症等。其他因素,如环境金属暴露和心房心脏病也被发现是危险因素。本研究采用平衡的数据集,包括36,880名IS患者和36,880名与IS患者匹配的非IS患者的医疗数据,使用索引日期,年龄和性别,检索自台湾国家健康保险研究数据库(NHIRD)的一个子集,用于开发AI模型来预测ISs事件。采用综合遗传算法和支持向量机(IGS)算法结合3种不同适应度函数设计预测模型。为了从不同的人工智能模型中选择最佳的预测性能,对模型进行了十次交叉验证。所设计模型的预测精度、灵敏度、特异性和ROC曲线下面积(AUC)分别达到73.38 ~ 73.96%、73.31 ~ 73.91%、73.03 ~ 74.02%和0.808 ~ 0.813。所选择的特征包括年龄、合并症和其他合并症相关变量,在设计强预测模型(AUC >.8)时被证明是有效的,用于预测近期更有可能发展为IS的患者。未来的工作将集中在使用更有效的人工智能技术(如深度神经网络(DNN))和有用的变量(如给药)来设计预测模型,以提高预测性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Designing AI Models for Predicting Ischemic Stroke Using Administrative Healthcare Database
Ischemic stroke (IS) is the most common type of stroke, accounting to about 80% of the stroke, and is a major cause of morbidity and mortality worldwide. The risk factors of IS included older age, male gender, high BMI, smoking habit, hypertension, diabetes, hyperlipidemia, etc. Other factors, such as environmental metal exposure and atrial cardiopathy are also found to be risk factors. In this study, a balanced dataset, consisting of healthcare data of 36,880 IS patients and 36,880 non-IS patients matched with IS patients using indexed date, age, and sex, retrieved from a subset of the National Health Insurance Research Database (NHIRD) of Taiwan, an administrative healthcare database, were adopted for developing the AI models to predict events of ISs. Integrated Genetic Algorithm and Support Vector Machine (IGS) algorithm accompanied with 3 different fitness functions was applied for designing the predictive models. To select the best predictive performance from different AI models, tenfold cross validation were conducted for model training. The predictive performance of the designed models exhibits that predictive accuracy, sensitivity, specificity, and area under ROC curve (AUC) achieve 73.38-73.96%, 73.31-73.91%, 73.03-74.02%, and 0.808-0.813, respectively. The selected features including age, comorbidities, and other comorbidity-related variables are shown to be effective in designing strong predictive models (AUC>0.8) for predicting patients who are more likely to develop IS in the near future. Future works will focus on designing the predictive models using more effective AI technique, such as deep neural network (DNN), and useful variables, such as administrated drugs, to enhance the predictive performance.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信