急性缺血性脑卒中患者一年内继发性癫痫的预测模型。

IF 6.4 1区 生物学 Q1 BIOLOGY
eLife Pub Date : 2024-11-14 DOI:10.7554/eLife.98759
Jinxin Liu, Haoyue He, Yanglingxi Wang, Jun Du, Kaixin Liang, Jun Xue, Yidan Liang, Peng Chen, Shanshan Tian, Yongbing Deng
{"title":"急性缺血性脑卒中患者一年内继发性癫痫的预测模型。","authors":"Jinxin Liu, Haoyue He, Yanglingxi Wang, Jun Du, Kaixin Liang, Jun Xue, Yidan Liang, Peng Chen, Shanshan Tian, Yongbing Deng","doi":"10.7554/eLife.98759","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Post-stroke epilepsy (PSE) is a critical complication that worsens both prognosis and quality of life in patients with ischemic stroke. An interpretable machine learning model was developed to predict PSE using medical records from four hospitals in Chongqing.</p><p><strong>Methods: </strong>Medical records, imaging reports, and laboratory test results from 21,459 ischemic stroke patients were collected and analyzed. Univariable and multivariable statistical analyses identified key predictive factors. The dataset was split into a 70% training set and a 30% testing set. To address the class imbalance, the Synthetic Minority Oversampling Technique combined with Edited Nearest Neighbors was employed. Nine widely used machine learning algorithms were evaluated using relevant prediction metrics, with SHAP (SHapley Additive exPlanations) used to interpret the model and assess the contributions of different features.</p><p><strong>Results: </strong>Regression analyses revealed that complications such as hydrocephalus, cerebral hernia, and deep vein thrombosis, as well as specific brain regions (frontal, parietal, and temporal lobes), significantly contributed to PSE. Factors such as age, gender, NIH Stroke Scale (NIHSS) scores, and laboratory results like WBC count and D-dimer levels were associated with increased PSE risk. Tree-based methods like Random Forest, XGBoost, and LightGBM showed strong predictive performance, achieving an AUC of 0.99.</p><p><strong>Conclusions: </strong>The model accurately predicts PSE risk, with tree-based models demonstrating superior performance. NIHSS score, WBC count, and D-dimer were identified as the most crucial predictors.</p><p><strong>Funding: </strong>The research is funded by Central University basic research young teachers and students research ability promotion sub-projec t(2023CDJYGRH-ZD06), and by Emergency Medicine Chongqing Key Laboratory Talent Innovation and development joint fund project (2024RCCX10).</p>","PeriodicalId":11640,"journal":{"name":"eLife","volume":"13 ","pages":""},"PeriodicalIF":6.4000,"publicationDate":"2024-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11563573/pdf/","citationCount":"0","resultStr":"{\"title\":\"Predictive models for secondary epilepsy in patients with acute ischemic stroke within one year.\",\"authors\":\"Jinxin Liu, Haoyue He, Yanglingxi Wang, Jun Du, Kaixin Liang, Jun Xue, Yidan Liang, Peng Chen, Shanshan Tian, Yongbing Deng\",\"doi\":\"10.7554/eLife.98759\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Post-stroke epilepsy (PSE) is a critical complication that worsens both prognosis and quality of life in patients with ischemic stroke. An interpretable machine learning model was developed to predict PSE using medical records from four hospitals in Chongqing.</p><p><strong>Methods: </strong>Medical records, imaging reports, and laboratory test results from 21,459 ischemic stroke patients were collected and analyzed. Univariable and multivariable statistical analyses identified key predictive factors. The dataset was split into a 70% training set and a 30% testing set. To address the class imbalance, the Synthetic Minority Oversampling Technique combined with Edited Nearest Neighbors was employed. Nine widely used machine learning algorithms were evaluated using relevant prediction metrics, with SHAP (SHapley Additive exPlanations) used to interpret the model and assess the contributions of different features.</p><p><strong>Results: </strong>Regression analyses revealed that complications such as hydrocephalus, cerebral hernia, and deep vein thrombosis, as well as specific brain regions (frontal, parietal, and temporal lobes), significantly contributed to PSE. Factors such as age, gender, NIH Stroke Scale (NIHSS) scores, and laboratory results like WBC count and D-dimer levels were associated with increased PSE risk. Tree-based methods like Random Forest, XGBoost, and LightGBM showed strong predictive performance, achieving an AUC of 0.99.</p><p><strong>Conclusions: </strong>The model accurately predicts PSE risk, with tree-based models demonstrating superior performance. NIHSS score, WBC count, and D-dimer were identified as the most crucial predictors.</p><p><strong>Funding: </strong>The research is funded by Central University basic research young teachers and students research ability promotion sub-projec t(2023CDJYGRH-ZD06), and by Emergency Medicine Chongqing Key Laboratory Talent Innovation and development joint fund project (2024RCCX10).</p>\",\"PeriodicalId\":11640,\"journal\":{\"name\":\"eLife\",\"volume\":\"13 \",\"pages\":\"\"},\"PeriodicalIF\":6.4000,\"publicationDate\":\"2024-11-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11563573/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"eLife\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.7554/eLife.98759\",\"RegionNum\":1,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"eLife","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.7554/eLife.98759","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

背景:脑卒中后癫痫(PSE)是缺血性脑卒中患者的一个重要并发症,会恶化预后和生活质量。本研究利用重庆市四家医院的医疗记录,开发了一个可解释的机器学习模型来预测 PSE:收集并分析了 21459 名缺血性脑卒中患者的病历、影像学报告和实验室检查结果。单变量和多变量统计分析确定了关键的预测因素。数据集分为 70% 的训练集和 30% 的测试集。为了解决类别不平衡的问题,采用了合成少数群体过度采样技术(Synthetic Minority Oversampling Technique)与编辑近邻技术(Edited Nearest Neighbors)相结合的方法。使用相关预测指标对九种广泛使用的机器学习算法进行了评估,并使用 SHAP(SHapley Additive exPlanations)来解释模型和评估不同特征的贡献:回归分析表明,脑积水、脑疝和深静脉血栓等并发症以及特定脑区(额叶、顶叶和颞叶)对 PSE 有显著影响。年龄、性别、美国国立卫生研究院卒中量表(NIHSS)评分以及白细胞计数和D-二聚体水平等实验室结果都与PSE风险的增加有关。随机森林、XGBoost 和 LightGBM 等基于树的方法显示出很强的预测能力,AUC 达到 0.99:该模型能准确预测 PSE 风险,其中基于树的模型表现更优。NIHSS评分、白细胞计数和D-二聚体被认为是最关键的预测因子:本研究受中央高校基础研究青年教师和学生科研能力提升子项目(2023CDJYGRH-ZD06)和急诊医学重庆市重点实验室人才创新发展联合基金项目(2024RCCX10)资助。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Predictive models for secondary epilepsy in patients with acute ischemic stroke within one year.

Background: Post-stroke epilepsy (PSE) is a critical complication that worsens both prognosis and quality of life in patients with ischemic stroke. An interpretable machine learning model was developed to predict PSE using medical records from four hospitals in Chongqing.

Methods: Medical records, imaging reports, and laboratory test results from 21,459 ischemic stroke patients were collected and analyzed. Univariable and multivariable statistical analyses identified key predictive factors. The dataset was split into a 70% training set and a 30% testing set. To address the class imbalance, the Synthetic Minority Oversampling Technique combined with Edited Nearest Neighbors was employed. Nine widely used machine learning algorithms were evaluated using relevant prediction metrics, with SHAP (SHapley Additive exPlanations) used to interpret the model and assess the contributions of different features.

Results: Regression analyses revealed that complications such as hydrocephalus, cerebral hernia, and deep vein thrombosis, as well as specific brain regions (frontal, parietal, and temporal lobes), significantly contributed to PSE. Factors such as age, gender, NIH Stroke Scale (NIHSS) scores, and laboratory results like WBC count and D-dimer levels were associated with increased PSE risk. Tree-based methods like Random Forest, XGBoost, and LightGBM showed strong predictive performance, achieving an AUC of 0.99.

Conclusions: The model accurately predicts PSE risk, with tree-based models demonstrating superior performance. NIHSS score, WBC count, and D-dimer were identified as the most crucial predictors.

Funding: The research is funded by Central University basic research young teachers and students research ability promotion sub-projec t(2023CDJYGRH-ZD06), and by Emergency Medicine Chongqing Key Laboratory Talent Innovation and development joint fund project (2024RCCX10).

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
eLife
eLife BIOLOGY-
CiteScore
12.90
自引率
3.90%
发文量
3122
审稿时长
17 weeks
期刊介绍: eLife is a distinguished, not-for-profit, peer-reviewed open access scientific journal that specializes in the fields of biomedical and life sciences. eLife is known for its selective publication process, which includes a variety of article types such as: Research Articles: Detailed reports of original research findings. Short Reports: Concise presentations of significant findings that do not warrant a full-length research article. Tools and Resources: Descriptions of new tools, technologies, or resources that facilitate scientific research. Research Advances: Brief reports on significant scientific advancements that have immediate implications for the field. Scientific Correspondence: Short communications that comment on or provide additional information related to published articles. Review Articles: Comprehensive overviews of a specific topic or field within the life sciences.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信