Zhengling He, Xianxiang Chen, Zhen Fang, Weidong Yi, Chenshuo Wang, Li Jiang, Zhongkai Tong, Zhongrui Bai, Yueqi Li, Yichen Pan
{"title":"基于LSTM递归神经网络特征提取的集成学习早期脓毒症预测","authors":"Zhengling He, Xianxiang Chen, Zhen Fang, Weidong Yi, Chenshuo Wang, Li Jiang, Zhongkai Tong, Zhongrui Bai, Yueqi Li, Yichen Pan","doi":"10.23919/CinC49843.2019.9005929","DOIUrl":null,"url":null,"abstract":"Early prediction of sepsis can help to identify potential risks in time and help take necessary measures to prevent more dangerous situations from occurring. In PhysioNet/Computing in Cardiology Challenge 2019, we integrate Long Short Term Memory (LSTM) recurrent neural network and ensemble learning to achieve early sepsis prediction. Specifically, we tackle the problem of class imbalance and data missing firstly, and then we manually extract features according to the prior knowledge from the medical field. In addition, we regard the prediction of sepsis as a time series prediction problem and pre-train LSTM-based models as feature extractors to obtain the \"deep\" features on time series that might be related to the onset of sepsis. Manual features and \"deep\" features are then used to train prediction models under the framework of ensemble learning, including Extreme Gradient Boosting (XGBoost) and Gradient Boosting Decision Tree (GBDT) regressor. The final normalized utility score our team (UCAS_DataMiner) have obtained was 0.313 on full hidden test set.","PeriodicalId":6697,"journal":{"name":"2019 Computing in Cardiology (CinC)","volume":"1049 1","pages":"Page 1-Page 4"},"PeriodicalIF":0.0000,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Early Sepsis Prediction Using Ensemble Learning with Features Extracted from LSTM Recurrent Neural Network\",\"authors\":\"Zhengling He, Xianxiang Chen, Zhen Fang, Weidong Yi, Chenshuo Wang, Li Jiang, Zhongkai Tong, Zhongrui Bai, Yueqi Li, Yichen Pan\",\"doi\":\"10.23919/CinC49843.2019.9005929\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Early prediction of sepsis can help to identify potential risks in time and help take necessary measures to prevent more dangerous situations from occurring. In PhysioNet/Computing in Cardiology Challenge 2019, we integrate Long Short Term Memory (LSTM) recurrent neural network and ensemble learning to achieve early sepsis prediction. Specifically, we tackle the problem of class imbalance and data missing firstly, and then we manually extract features according to the prior knowledge from the medical field. In addition, we regard the prediction of sepsis as a time series prediction problem and pre-train LSTM-based models as feature extractors to obtain the \\\"deep\\\" features on time series that might be related to the onset of sepsis. Manual features and \\\"deep\\\" features are then used to train prediction models under the framework of ensemble learning, including Extreme Gradient Boosting (XGBoost) and Gradient Boosting Decision Tree (GBDT) regressor. The final normalized utility score our team (UCAS_DataMiner) have obtained was 0.313 on full hidden test set.\",\"PeriodicalId\":6697,\"journal\":{\"name\":\"2019 Computing in Cardiology (CinC)\",\"volume\":\"1049 1\",\"pages\":\"Page 1-Page 4\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 Computing in Cardiology (CinC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.23919/CinC49843.2019.9005929\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 Computing in Cardiology (CinC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/CinC49843.2019.9005929","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
摘要
对脓毒症的早期预测有助于及时发现潜在的风险,并采取必要的措施,防止更危险的情况发生。在PhysioNet/Computing In Cardiology Challenge 2019中,我们整合了长短期记忆(LSTM)递归神经网络和集成学习来实现早期脓毒症预测。具体来说,我们首先解决类别失衡和数据缺失的问题,然后根据医学领域的先验知识手动提取特征。此外,我们将脓毒症的预测视为一个时间序列预测问题,并将基于lstm的预训练模型作为特征提取器,以获得时间序列上可能与脓毒症发病相关的“深度”特征。然后使用人工特征和“深度”特征在集成学习框架下训练预测模型,包括极端梯度增强(XGBoost)和梯度增强决策树(GBDT)回归器。我们的团队(UCAS_DataMiner)在完全隐藏测试集上获得的最终标准化效用得分为0.313。
Early Sepsis Prediction Using Ensemble Learning with Features Extracted from LSTM Recurrent Neural Network
Early prediction of sepsis can help to identify potential risks in time and help take necessary measures to prevent more dangerous situations from occurring. In PhysioNet/Computing in Cardiology Challenge 2019, we integrate Long Short Term Memory (LSTM) recurrent neural network and ensemble learning to achieve early sepsis prediction. Specifically, we tackle the problem of class imbalance and data missing firstly, and then we manually extract features according to the prior knowledge from the medical field. In addition, we regard the prediction of sepsis as a time series prediction problem and pre-train LSTM-based models as feature extractors to obtain the "deep" features on time series that might be related to the onset of sepsis. Manual features and "deep" features are then used to train prediction models under the framework of ensemble learning, including Extreme Gradient Boosting (XGBoost) and Gradient Boosting Decision Tree (GBDT) regressor. The final normalized utility score our team (UCAS_DataMiner) have obtained was 0.313 on full hidden test set.