推进冲击预测:利用先验知识和自我控制的数据,提高模型的准确性和普遍性。

IF 3.3 3区 医学 Q2 MEDICAL INFORMATICS
Cheng-Yu Tsai, Xiu-Rong Huang, Po-Tsun Kuo, Tzu-Tao Chen, Yun-Kai Yeh, Kuan-Yuan Chen, Arnab Majumdar, Chien-Hua Tseng
{"title":"推进冲击预测:利用先验知识和自我控制的数据,提高模型的准确性和普遍性。","authors":"Cheng-Yu Tsai, Xiu-Rong Huang, Po-Tsun Kuo, Tzu-Tao Chen, Yun-Kai Yeh, Kuan-Yuan Chen, Arnab Majumdar, Chien-Hua Tseng","doi":"10.1186/s12911-025-03108-2","DOIUrl":null,"url":null,"abstract":"<p><strong>Objectives: </strong>Timely intervention in shock is vital, as delays over one hour greatly increase mortality. This study aims to develop an enhanced machine learning model that improves predictive performance by utilizing self-controlled data and applying feature engineering informed by medical knowledge to physiological waveforms, enabling the prediction of shock one hour in advance without relying on blood tests.</p><p><strong>Methods: </strong>Patient data and physiological waveforms were obtained from the Medical Information Mart for Intensive Care III (MIMIC-3) database. Shock was defined as a mean arterial pressure ≤ 65 mmHg for more than one minute, combined with serum lactate levels ≥ 2 mmol/L within 12 h before or after the hypotension event. Waveforms used for prediction were extracted from 30 min time-segment before a 1-hour period prior to the event. Self-controlled waveforms were obtained from the same patient either one day before or up to seven days after the shock event.</p><p><strong>Results: </strong>The study included 389 ICU patients who met the shock criteria and had complete physiological waveform data available for analysis. A total of 299 features were derived: 90 from arterial blood pressure (ABP), 89 from electrocardiogram (ECG), 112 from respiratory waveforms (RESP), and 8 from blood oxygen saturation (SpO<sub>2</sub>). The weighted ensemble model showed the best performance with an AUC of 0.93 and accuracy of 84.15%, and sensitivity of 79.64% in the testing set. The most predictive features included ECG_HRV_pNN50 (proportion of successive heartbeat intervals differing by more than 50 ms), RESP_Width_Mean (mean width of respiratory waveform), RESP_Cycle_Rate_Mean (mean respiratory cycle rate), ABP_TimeSBP2DBP_SampEn (sample entropy of systolic-diastolic intervals), and ABP_AmplitudeDBP_Median (median amplitude of diastolic peaks).</p><p><strong>Conclusions: </strong>This study demonstrated the feasibility of predicting shock one hour before its onset using only four physiological waveforms, combined with feature engineering based on physiological concepts and self-sampling data. The model achieved a strong AUC and a high sensitivity.</p><p><strong>Clinical trial number: </strong>Not applicable.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"25 1","pages":"262"},"PeriodicalIF":3.3000,"publicationDate":"2025-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12261771/pdf/","citationCount":"0","resultStr":"{\"title\":\"Advancing shock prediction: leveraging prior knowledge and self-controlled data for enhanced model accuracy and generalizability.\",\"authors\":\"Cheng-Yu Tsai, Xiu-Rong Huang, Po-Tsun Kuo, Tzu-Tao Chen, Yun-Kai Yeh, Kuan-Yuan Chen, Arnab Majumdar, Chien-Hua Tseng\",\"doi\":\"10.1186/s12911-025-03108-2\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Objectives: </strong>Timely intervention in shock is vital, as delays over one hour greatly increase mortality. This study aims to develop an enhanced machine learning model that improves predictive performance by utilizing self-controlled data and applying feature engineering informed by medical knowledge to physiological waveforms, enabling the prediction of shock one hour in advance without relying on blood tests.</p><p><strong>Methods: </strong>Patient data and physiological waveforms were obtained from the Medical Information Mart for Intensive Care III (MIMIC-3) database. Shock was defined as a mean arterial pressure ≤ 65 mmHg for more than one minute, combined with serum lactate levels ≥ 2 mmol/L within 12 h before or after the hypotension event. Waveforms used for prediction were extracted from 30 min time-segment before a 1-hour period prior to the event. Self-controlled waveforms were obtained from the same patient either one day before or up to seven days after the shock event.</p><p><strong>Results: </strong>The study included 389 ICU patients who met the shock criteria and had complete physiological waveform data available for analysis. A total of 299 features were derived: 90 from arterial blood pressure (ABP), 89 from electrocardiogram (ECG), 112 from respiratory waveforms (RESP), and 8 from blood oxygen saturation (SpO<sub>2</sub>). The weighted ensemble model showed the best performance with an AUC of 0.93 and accuracy of 84.15%, and sensitivity of 79.64% in the testing set. The most predictive features included ECG_HRV_pNN50 (proportion of successive heartbeat intervals differing by more than 50 ms), RESP_Width_Mean (mean width of respiratory waveform), RESP_Cycle_Rate_Mean (mean respiratory cycle rate), ABP_TimeSBP2DBP_SampEn (sample entropy of systolic-diastolic intervals), and ABP_AmplitudeDBP_Median (median amplitude of diastolic peaks).</p><p><strong>Conclusions: </strong>This study demonstrated the feasibility of predicting shock one hour before its onset using only four physiological waveforms, combined with feature engineering based on physiological concepts and self-sampling data. The model achieved a strong AUC and a high sensitivity.</p><p><strong>Clinical trial number: </strong>Not applicable.</p>\",\"PeriodicalId\":9340,\"journal\":{\"name\":\"BMC Medical Informatics and Decision Making\",\"volume\":\"25 1\",\"pages\":\"262\"},\"PeriodicalIF\":3.3000,\"publicationDate\":\"2025-07-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12261771/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMC Medical Informatics and Decision Making\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s12911-025-03108-2\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MEDICAL INFORMATICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Informatics and Decision Making","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12911-025-03108-2","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
引用次数: 0

摘要

目的:及时干预休克是至关重要的,因为延误超过一小时会大大增加死亡率。本研究旨在开发一种增强的机器学习模型,通过利用自我控制数据和将医学知识告知的特征工程应用于生理波形来提高预测性能,从而在不依赖血液测试的情况下提前一小时预测休克。方法:从重症监护医学信息市场III (MIMIC-3)数据库中获取患者资料和生理波形。休克定义为平均动脉压≤65 mmHg持续1分钟以上,并在低血压事件发生前后12小时内血清乳酸水平≥2 mmol/L。用于预测的波形是在事件发生前1小时前的30分钟时间段内提取的。从同一患者身上获得自我控制的波形,要么在休克事件发生前一天,要么在休克事件发生后七天。结果:本研究纳入389例符合休克标准且有完整生理波形资料可供分析的ICU患者。共获得299个特征:90个来自动脉血压(ABP), 89个来自心电图(ECG), 112个来自呼吸波形(RESP), 8个来自血氧饱和度(SpO2)。在测试集中,加权集合模型的AUC为0.93,准确率为84.15%,灵敏度为79.64%,表现出最佳性能。最具预测性的特征包括ECG_HRV_pNN50(连续心跳间隔相差大于50 ms的比例)、RESP_Width_Mean(呼吸波形平均宽度)、RESP_Cycle_Rate_Mean(平均呼吸周期率)、ABP_TimeSBP2DBP_SampEn(收缩期-舒张期样本熵)和ABP_AmplitudeDBP_Median(舒张期峰值中值幅度)。结论:本研究证明了仅使用四种生理波形,结合基于生理概念和自采样数据的特征工程,在发作前一小时预测休克的可行性。该模型具有较强的AUC和较高的灵敏度。临床试验号:不适用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Advancing shock prediction: leveraging prior knowledge and self-controlled data for enhanced model accuracy and generalizability.

Objectives: Timely intervention in shock is vital, as delays over one hour greatly increase mortality. This study aims to develop an enhanced machine learning model that improves predictive performance by utilizing self-controlled data and applying feature engineering informed by medical knowledge to physiological waveforms, enabling the prediction of shock one hour in advance without relying on blood tests.

Methods: Patient data and physiological waveforms were obtained from the Medical Information Mart for Intensive Care III (MIMIC-3) database. Shock was defined as a mean arterial pressure ≤ 65 mmHg for more than one minute, combined with serum lactate levels ≥ 2 mmol/L within 12 h before or after the hypotension event. Waveforms used for prediction were extracted from 30 min time-segment before a 1-hour period prior to the event. Self-controlled waveforms were obtained from the same patient either one day before or up to seven days after the shock event.

Results: The study included 389 ICU patients who met the shock criteria and had complete physiological waveform data available for analysis. A total of 299 features were derived: 90 from arterial blood pressure (ABP), 89 from electrocardiogram (ECG), 112 from respiratory waveforms (RESP), and 8 from blood oxygen saturation (SpO2). The weighted ensemble model showed the best performance with an AUC of 0.93 and accuracy of 84.15%, and sensitivity of 79.64% in the testing set. The most predictive features included ECG_HRV_pNN50 (proportion of successive heartbeat intervals differing by more than 50 ms), RESP_Width_Mean (mean width of respiratory waveform), RESP_Cycle_Rate_Mean (mean respiratory cycle rate), ABP_TimeSBP2DBP_SampEn (sample entropy of systolic-diastolic intervals), and ABP_AmplitudeDBP_Median (median amplitude of diastolic peaks).

Conclusions: This study demonstrated the feasibility of predicting shock one hour before its onset using only four physiological waveforms, combined with feature engineering based on physiological concepts and self-sampling data. The model achieved a strong AUC and a high sensitivity.

Clinical trial number: Not applicable.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
7.20
自引率
5.70%
发文量
297
审稿时长
1 months
期刊介绍: BMC Medical Informatics and Decision Making is an open access journal publishing original peer-reviewed research articles in relation to the design, development, implementation, use, and evaluation of health information technologies and decision-making for human health.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信