缺血性中风的快速分诊:预测、预防和个性化医学背景下的机器学习驱动方法。

IF 6.5 2区 医学 Q1 Medicine
Yulu Zheng, Zheng Guo, Yanbo Zhang, Jianjing Shang, Leilei Yu, Ping Fu, Yizhi Liu, Xingang Li, Hao Wang, Ling Ren, Wei Zhang, Haifeng Hou, Xuerui Tan, Wei Wang
{"title":"缺血性中风的快速分诊:预测、预防和个性化医学背景下的机器学习驱动方法。","authors":"Yulu Zheng,&nbsp;Zheng Guo,&nbsp;Yanbo Zhang,&nbsp;Jianjing Shang,&nbsp;Leilei Yu,&nbsp;Ping Fu,&nbsp;Yizhi Liu,&nbsp;Xingang Li,&nbsp;Hao Wang,&nbsp;Ling Ren,&nbsp;Wei Zhang,&nbsp;Haifeng Hou,&nbsp;Xuerui Tan,&nbsp;Wei Wang","doi":"10.1007/s13167-022-00283-4","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Recognising the early signs of ischemic stroke (IS) in emergency settings has been challenging. Machine learning (ML), a robust tool for predictive, preventive and personalised medicine (PPPM/3PM), presents a possible solution for this issue and produces accurate predictions for real-time data processing.</p><p><strong>Methods: </strong>This investigation evaluated 4999 IS patients among a total of 10,476 adults included in the initial dataset, and 1076 IS subjects among 3935 participants in the external validation dataset. Six ML-based models for the prediction of IS were trained on the initial dataset of 10,476 participants (split participants into a training set [80%] and an internal validation set [20%]). Selected clinical laboratory features routinely assessed at admission were used to inform the models. Model performance was mainly evaluated by the area under the receiver operating characteristic (AUC) curve. Additional techniques-permutation feature importance (PFI), local interpretable model-agnostic explanations (LIME), and SHapley Additive exPlanations (SHAP)-were applied for explaining the black-box ML models.</p><p><strong>Results: </strong>Fifteen routine haematological and biochemical features were selected to establish ML-based models for the prediction of IS. The XGBoost-based model achieved the highest predictive performance, reaching AUCs of 0.91 (0.90-0.92) and 0.92 (0.91-0.93) in the internal and external datasets respectively. PFI globally revealed that demographic feature age, routine haematological parameters, haemoglobin and neutrophil count, and biochemical analytes total protein and high-density lipoprotein cholesterol were more influential on the model's prediction. LIME and SHAP showed similar local feature attribution explanations.</p><p><strong>Conclusion: </strong>In the context of PPPM/3PM, we used the selected predictors obtained from the results of common blood tests to develop and validate ML-based models for the diagnosis of IS. The XGBoost-based model offers the most accurate prediction. By incorporating the individualised patient profile, this prediction tool is simple and quick to administer. This is promising to support subjective decision making in resource-limited settings or primary care, thereby shortening the time window for the treatment, and improving outcomes after IS.</p><p><strong>Supplementary information: </strong>The online version contains supplementary material available at 10.1007/s13167-022-00283-4.</p>","PeriodicalId":54292,"journal":{"name":"Epma Journal","volume":null,"pages":null},"PeriodicalIF":6.5000,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9203613/pdf/","citationCount":"12","resultStr":"{\"title\":\"Rapid triage for ischemic stroke: a machine learning-driven approach in the context of predictive, preventive and personalised medicine.\",\"authors\":\"Yulu Zheng,&nbsp;Zheng Guo,&nbsp;Yanbo Zhang,&nbsp;Jianjing Shang,&nbsp;Leilei Yu,&nbsp;Ping Fu,&nbsp;Yizhi Liu,&nbsp;Xingang Li,&nbsp;Hao Wang,&nbsp;Ling Ren,&nbsp;Wei Zhang,&nbsp;Haifeng Hou,&nbsp;Xuerui Tan,&nbsp;Wei Wang\",\"doi\":\"10.1007/s13167-022-00283-4\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Recognising the early signs of ischemic stroke (IS) in emergency settings has been challenging. Machine learning (ML), a robust tool for predictive, preventive and personalised medicine (PPPM/3PM), presents a possible solution for this issue and produces accurate predictions for real-time data processing.</p><p><strong>Methods: </strong>This investigation evaluated 4999 IS patients among a total of 10,476 adults included in the initial dataset, and 1076 IS subjects among 3935 participants in the external validation dataset. Six ML-based models for the prediction of IS were trained on the initial dataset of 10,476 participants (split participants into a training set [80%] and an internal validation set [20%]). Selected clinical laboratory features routinely assessed at admission were used to inform the models. Model performance was mainly evaluated by the area under the receiver operating characteristic (AUC) curve. Additional techniques-permutation feature importance (PFI), local interpretable model-agnostic explanations (LIME), and SHapley Additive exPlanations (SHAP)-were applied for explaining the black-box ML models.</p><p><strong>Results: </strong>Fifteen routine haematological and biochemical features were selected to establish ML-based models for the prediction of IS. The XGBoost-based model achieved the highest predictive performance, reaching AUCs of 0.91 (0.90-0.92) and 0.92 (0.91-0.93) in the internal and external datasets respectively. PFI globally revealed that demographic feature age, routine haematological parameters, haemoglobin and neutrophil count, and biochemical analytes total protein and high-density lipoprotein cholesterol were more influential on the model's prediction. LIME and SHAP showed similar local feature attribution explanations.</p><p><strong>Conclusion: </strong>In the context of PPPM/3PM, we used the selected predictors obtained from the results of common blood tests to develop and validate ML-based models for the diagnosis of IS. The XGBoost-based model offers the most accurate prediction. By incorporating the individualised patient profile, this prediction tool is simple and quick to administer. This is promising to support subjective decision making in resource-limited settings or primary care, thereby shortening the time window for the treatment, and improving outcomes after IS.</p><p><strong>Supplementary information: </strong>The online version contains supplementary material available at 10.1007/s13167-022-00283-4.</p>\",\"PeriodicalId\":54292,\"journal\":{\"name\":\"Epma Journal\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":6.5000,\"publicationDate\":\"2022-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9203613/pdf/\",\"citationCount\":\"12\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Epma Journal\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1007/s13167-022-00283-4\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"Medicine\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Epma Journal","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s13167-022-00283-4","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 12

摘要

背景:在紧急情况下识别缺血性卒中(IS)的早期症状一直具有挑战性。机器学习(ML)是预测、预防和个性化医疗(PPPM/3PM)的强大工具,为这一问题提供了可能的解决方案,并为实时数据处理提供了准确的预测。方法:本研究评估了初始数据集中10476名成人中的4999名IS患者,以及外部验证数据集中3935名参与者中的1076名IS受试者。在10,476名参与者的初始数据集上训练了6个基于ml的IS预测模型(将参与者分为训练集[80%]和内部验证集[20%])。入院时常规评估的选定的临床实验室特征用于模型。模型的性能主要通过接收工作特性曲线下面积(AUC)来评价。其他技术-排列特征重要性(PFI),局部可解释模型不可知论解释(LIME)和SHapley加性解释(SHAP)-被用于解释黑箱ML模型。结果:选择15个常规血液学和生化特征,建立基于ml的IS预测模型。基于xgboost的模型获得了最高的预测性能,在内部和外部数据集中分别达到0.91(0.90-0.92)和0.92(0.91-0.93)的auc。全球PFI显示,人口统计学特征年龄、常规血液学参数、血红蛋白和中性粒细胞计数、生化分析总蛋白和高密度脂蛋白胆固醇对模型预测的影响更大。LIME和SHAP表现出相似的局部特征归因解释。结论:在PPPM/3PM的背景下,我们使用从普通血液检查结果中获得的选定预测因子来开发和验证基于ml的IS诊断模型。基于xgboost的模型提供了最准确的预测。通过结合个体化患者资料,这种预测工具简单快捷。这有望在资源有限的环境或初级保健中支持主观决策,从而缩短治疗的时间窗口,并改善is后的结果。补充信息:在线版本包含补充资料,下载地址:10.1007/s13167-022-00283-4。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Rapid triage for ischemic stroke: a machine learning-driven approach in the context of predictive, preventive and personalised medicine.

Rapid triage for ischemic stroke: a machine learning-driven approach in the context of predictive, preventive and personalised medicine.

Rapid triage for ischemic stroke: a machine learning-driven approach in the context of predictive, preventive and personalised medicine.

Rapid triage for ischemic stroke: a machine learning-driven approach in the context of predictive, preventive and personalised medicine.

Background: Recognising the early signs of ischemic stroke (IS) in emergency settings has been challenging. Machine learning (ML), a robust tool for predictive, preventive and personalised medicine (PPPM/3PM), presents a possible solution for this issue and produces accurate predictions for real-time data processing.

Methods: This investigation evaluated 4999 IS patients among a total of 10,476 adults included in the initial dataset, and 1076 IS subjects among 3935 participants in the external validation dataset. Six ML-based models for the prediction of IS were trained on the initial dataset of 10,476 participants (split participants into a training set [80%] and an internal validation set [20%]). Selected clinical laboratory features routinely assessed at admission were used to inform the models. Model performance was mainly evaluated by the area under the receiver operating characteristic (AUC) curve. Additional techniques-permutation feature importance (PFI), local interpretable model-agnostic explanations (LIME), and SHapley Additive exPlanations (SHAP)-were applied for explaining the black-box ML models.

Results: Fifteen routine haematological and biochemical features were selected to establish ML-based models for the prediction of IS. The XGBoost-based model achieved the highest predictive performance, reaching AUCs of 0.91 (0.90-0.92) and 0.92 (0.91-0.93) in the internal and external datasets respectively. PFI globally revealed that demographic feature age, routine haematological parameters, haemoglobin and neutrophil count, and biochemical analytes total protein and high-density lipoprotein cholesterol were more influential on the model's prediction. LIME and SHAP showed similar local feature attribution explanations.

Conclusion: In the context of PPPM/3PM, we used the selected predictors obtained from the results of common blood tests to develop and validate ML-based models for the diagnosis of IS. The XGBoost-based model offers the most accurate prediction. By incorporating the individualised patient profile, this prediction tool is simple and quick to administer. This is promising to support subjective decision making in resource-limited settings or primary care, thereby shortening the time window for the treatment, and improving outcomes after IS.

Supplementary information: The online version contains supplementary material available at 10.1007/s13167-022-00283-4.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Epma Journal
Epma Journal Medicine-Biochemistry (medical)
CiteScore
11.30
自引率
23.10%
发文量
0
期刊介绍: PMA Journal is a journal of predictive, preventive and personalized medicine (PPPM). The journal provides expert viewpoints and research on medical innovations and advanced healthcare using predictive diagnostics, targeted preventive measures and personalized patient treatments. The journal is indexed by PubMed, Embase and Scopus.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信