Based on Machine Learning Algorithm: Construction of an Early Prediction Model of Integrated Traditional Chinese and Western Medicine for Cognitive Impairment after Ischemic Stroke

Xinhao Chen, Chengxia Wei, Wuhui Wu, Lizhen Guo, Chengyuan Liu, Gendi Lu
{"title":"Based on Machine Learning Algorithm: Construction of an Early Prediction Model of Integrated Traditional Chinese and Western Medicine for Cognitive Impairment after Ischemic Stroke","authors":"Xinhao Chen, Chengxia Wei, Wuhui Wu, Lizhen Guo, Chengyuan Liu, Gendi Lu","doi":"10.1109/UV50937.2020.9426200","DOIUrl":null,"url":null,"abstract":"Purpose: Based on the risk factors of post stroke cognitive impairment (PSCI), combining the Constitution and Syndrome of Traditional Chinese Medicine, using a variety of Machine learning (ML) algorithms, to construct a prediction model with high accuracy and good fitting degree, so as to provide theoretical and data support for early screening and early prevention of ischemic stroke (IS) patients. Patients and methods: A retrospective analysis was conducted on 85 patients with acute ischemic stroke admitted to the Department of Neurology of a third grade a hospital of integrated Traditional Chinese and Western Medicine (TCM-WM) from June 2019 to January 2020. The patients were divided into three groups: Support Vector Machine (SVM), Random Forest (RF), Gradient Boosting Decision Tree (GBDT), ML algorithms were used to construct the risk prediction model of post-stroke cognitive impairment, and the prediction accuracy and area under curve (AUC) of receiver operating characteristic curve (ROC) were used to evaluate the prediction effect of the three models. Results: The average prediction accuracy of GBDT was 80.77 percent, the highest and the most stable. The average AUC area of GBDT was 0.85, which was larger than that of the other three ML algorithms, and the prediction effect was better. After analyzing the importance of the features obtained from the training of GBDT model, it is concluded that the features with the highest degree of discrimination for PSCI in this data set are as follows: Barthel index, Age, fasting blood glucose (FPG), blood homocysteine (Hcy). Based on GBDT algorithm, four GBDT models were obtained by training 75 percent, 80 percent, 85 percent and 90 percent training sets respectively. It was found that the prediction accuracy of the models with 85 percent and 90 percent training sets could reach 84.62 percent and 88.89 percent, indicating the potential of applying machine learning algorithm to the prediction of cognitive impairment after ischemic stroke. Conclusion: The ML algorithm is used to construct the early prediction model of TCM-WM integration for cognitive impairment after ischemic stroke, and analyze the influencing factors with strong correlation with PSCI, so as to carry out early detection, early diagnosis and early treatment of PSCI, so as to provide basis and reference for researchers who construct a large sample prediction model of cognitive impairment after ischemic stroke.","PeriodicalId":279871,"journal":{"name":"2020 5th International Conference on Universal Village (UV)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 5th International Conference on Universal Village (UV)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/UV50937.2020.9426200","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Purpose: Based on the risk factors of post stroke cognitive impairment (PSCI), combining the Constitution and Syndrome of Traditional Chinese Medicine, using a variety of Machine learning (ML) algorithms, to construct a prediction model with high accuracy and good fitting degree, so as to provide theoretical and data support for early screening and early prevention of ischemic stroke (IS) patients. Patients and methods: A retrospective analysis was conducted on 85 patients with acute ischemic stroke admitted to the Department of Neurology of a third grade a hospital of integrated Traditional Chinese and Western Medicine (TCM-WM) from June 2019 to January 2020. The patients were divided into three groups: Support Vector Machine (SVM), Random Forest (RF), Gradient Boosting Decision Tree (GBDT), ML algorithms were used to construct the risk prediction model of post-stroke cognitive impairment, and the prediction accuracy and area under curve (AUC) of receiver operating characteristic curve (ROC) were used to evaluate the prediction effect of the three models. Results: The average prediction accuracy of GBDT was 80.77 percent, the highest and the most stable. The average AUC area of GBDT was 0.85, which was larger than that of the other three ML algorithms, and the prediction effect was better. After analyzing the importance of the features obtained from the training of GBDT model, it is concluded that the features with the highest degree of discrimination for PSCI in this data set are as follows: Barthel index, Age, fasting blood glucose (FPG), blood homocysteine (Hcy). Based on GBDT algorithm, four GBDT models were obtained by training 75 percent, 80 percent, 85 percent and 90 percent training sets respectively. It was found that the prediction accuracy of the models with 85 percent and 90 percent training sets could reach 84.62 percent and 88.89 percent, indicating the potential of applying machine learning algorithm to the prediction of cognitive impairment after ischemic stroke. Conclusion: The ML algorithm is used to construct the early prediction model of TCM-WM integration for cognitive impairment after ischemic stroke, and analyze the influencing factors with strong correlation with PSCI, so as to carry out early detection, early diagnosis and early treatment of PSCI, so as to provide basis and reference for researchers who construct a large sample prediction model of cognitive impairment after ischemic stroke.
基于机器学习算法的中西医结合缺血性脑卒中认知功能损害早期预测模型构建
目的:基于脑卒中后认知功能障碍(PSCI)的危险因素,结合中医体质与证候,运用多种机器学习(ML)算法,构建准确率高、拟合程度好的预测模型,为缺血性脑卒中(IS)患者的早期筛查和早期预防提供理论和数据支持。患者与方法:回顾性分析2019年6月至2020年1月某三甲中西医结合医院神经内科收治的急性缺血性脑卒中患者85例。将患者分为三组,分别采用支持向量机(SVM)、随机森林(RF)、梯度增强决策树(GBDT)、ML算法构建脑卒中后认知功能障碍风险预测模型,并以受试者工作特征曲线(ROC)的预测准确率和曲线下面积(AUC)评价三种模型的预测效果。结果:GBDT的平均预测准确率为80.77%,最高且最稳定。GBDT的平均AUC面积为0.85,大于其他三种ML算法,预测效果较好。通过对GBDT模型训练得到的特征的重要性进行分析,得出该数据集中对PSCI辨别程度最高的特征为:Barthel指数、年龄、空腹血糖(FPG)、血同型半胱氨酸(Hcy)。基于GBDT算法,分别通过训练75%、80%、85%和90%的训练集得到4个GBDT模型。研究发现,85%和90%训练集的模型预测准确率分别达到84.62%和88.89%,表明机器学习算法应用于缺血性脑卒中后认知功能障碍预测的潜力。结论:利用ML算法构建缺血性脑卒中后认知功能障碍的中西医结合早期预测模型,分析与PSCI相关性强的影响因素,实现对PSCI的早发现、早诊断、早治疗,为研究者构建缺血性脑卒中后认知功能障碍大样本预测模型提供依据和参考。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信