Data Augmentation to Improve the Performance of Ensemble Learning for System Failure Prediction with Limited Observations

Guo Shi, B. Liu, Lesley Walls
{"title":"Data Augmentation to Improve the Performance of Ensemble Learning for System Failure Prediction with Limited Observations","authors":"Guo Shi, B. Liu, Lesley Walls","doi":"10.1109/ICRMS55680.2022.9944577","DOIUrl":null,"url":null,"abstract":"Ensemble learning has been widely used to improve the performance and robustness of machine learning algorithms on time series data. However, in real operational processes where the observed data is limited, it hinders the capability of ensemble learning algorithms. To address the challenge of limited observed data, this paper proposes a novel three-layer ensemble learning framework by use of data augmentation. Firstly, multiple classical time series augmentation methods are applied to increase the size of the data set. Subsequently, after pre-processing, these augmented data is trained by multiple basic learners with K-fold cross-validation as the first layer of the developed ensemble learning framework. The outputs of the first layer are integrated via LASSO to further improve the prediction performance, which serves as the second layer of the developed framework. Finally, the third-layer output is generated by averaging the prediction of the second layer and the output from an improved Long-Short Term Memory model that provides prediction based on the augmented data. A case study on a real wastewater treatment plant is used to illustrate the effectiveness of the proposed method.","PeriodicalId":421500,"journal":{"name":"2022 13th International Conference on Reliability, Maintainability, and Safety (ICRMS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 13th International Conference on Reliability, Maintainability, and Safety (ICRMS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICRMS55680.2022.9944577","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Ensemble learning has been widely used to improve the performance and robustness of machine learning algorithms on time series data. However, in real operational processes where the observed data is limited, it hinders the capability of ensemble learning algorithms. To address the challenge of limited observed data, this paper proposes a novel three-layer ensemble learning framework by use of data augmentation. Firstly, multiple classical time series augmentation methods are applied to increase the size of the data set. Subsequently, after pre-processing, these augmented data is trained by multiple basic learners with K-fold cross-validation as the first layer of the developed ensemble learning framework. The outputs of the first layer are integrated via LASSO to further improve the prediction performance, which serves as the second layer of the developed framework. Finally, the third-layer output is generated by averaging the prediction of the second layer and the output from an improved Long-Short Term Memory model that provides prediction based on the augmented data. A case study on a real wastewater treatment plant is used to illustrate the effectiveness of the proposed method.
有限观测值下改进集成学习系统故障预测性能的数据增强
集成学习已被广泛用于提高机器学习算法在时间序列数据上的性能和鲁棒性。然而,在实际操作过程中,观察到的数据是有限的,这阻碍了集成学习算法的能力。为了解决观测数据有限的问题,本文提出了一种基于数据增强的三层集成学习框架。首先,采用多种经典时间序列增广方法增大数据集的大小;随后,经过预处理后,这些增强数据由多个基本学习者进行训练,并使用K-fold交叉验证作为开发的集成学习框架的第一层。第一层的输出通过LASSO进行集成,进一步提高预测性能,作为开发框架的第二层。最后,第三层输出是通过对第二层的预测和基于增强数据提供预测的改进长短期记忆模型的输出进行平均而生成的。最后以实际污水处理厂为例,说明了该方法的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信