基于确定性和概率策略的机车轴温混合集成深度强化学习模型

IF 2.7 4区工程技术 Q2 TRANSPORTATION SCIENCE & TECHNOLOGY

Transportation Safety and Environment Pub Date : 2022-12-21 DOI:10.1093/tse/tdac055

Guangxi Yan, Hui Liu, Chengqing Yu, Chengming Yu, Ye Li, Zhu Duan

{"title":"基于确定性和概率策略的机车轴温混合集成深度强化学习模型","authors":"Guangxi Yan, Hui Liu, Chengqing Yu, Chengming Yu, Ye Li, Zhu Duan","doi":"10.1093/tse/tdac055","DOIUrl":null,"url":null,"abstract":"\n This paper proposes a hybrid deep reinforcement learning framework for locomotive axle temperature by combining the wavelet packet decomposition (WPD), long short-term memory (LSTM), the gated recurrent unit (GRU) reinforcement learning, and generalized autoregressive conditional heteroskedasticity (GARCH) algorithms. The WPD is utilized to decompose the raw nonlinear series into subseries. Then the deep learning predictors LSTM and GRU are established to predict the future axle temperatures in each subseries. The Q-learning could generate optimal ensemble weights to integrate the predictors to finish the deterministic forecasting and GARCH is used to conduct the deterministic forecasting based on the deterministic forecasting residual. These parts of the hybrid ensemble structure contributed to optimal modeling accuracy and provided effective support in the real-time monitoring and fault diagnosis of transportation.","PeriodicalId":52804,"journal":{"name":"Transportation Safety and Environment","volume":" ","pages":""},"PeriodicalIF":2.7000,"publicationDate":"2022-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A hybrid ensemble deep reinforcement learning model for locomotive axle temperature using the deterministic and probabilistic strategy\",\"authors\":\"Guangxi Yan, Hui Liu, Chengqing Yu, Chengming Yu, Ye Li, Zhu Duan\",\"doi\":\"10.1093/tse/tdac055\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n This paper proposes a hybrid deep reinforcement learning framework for locomotive axle temperature by combining the wavelet packet decomposition (WPD), long short-term memory (LSTM), the gated recurrent unit (GRU) reinforcement learning, and generalized autoregressive conditional heteroskedasticity (GARCH) algorithms. The WPD is utilized to decompose the raw nonlinear series into subseries. Then the deep learning predictors LSTM and GRU are established to predict the future axle temperatures in each subseries. The Q-learning could generate optimal ensemble weights to integrate the predictors to finish the deterministic forecasting and GARCH is used to conduct the deterministic forecasting based on the deterministic forecasting residual. These parts of the hybrid ensemble structure contributed to optimal modeling accuracy and provided effective support in the real-time monitoring and fault diagnosis of transportation.\",\"PeriodicalId\":52804,\"journal\":{\"name\":\"Transportation Safety and Environment\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":2.7000,\"publicationDate\":\"2022-12-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Transportation Safety and Environment\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1093/tse/tdac055\",\"RegionNum\":4,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"TRANSPORTATION SCIENCE & TECHNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transportation Safety and Environment","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1093/tse/tdac055","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"TRANSPORTATION SCIENCE & TECHNOLOGY","Score":null,"Total":0}

引用次数: 0

摘要

结合小波包分解(WPD)、长短期记忆(LSTM)、门控循环单元(GRU)强化学习和广义自回归条件异方差(GARCH)算法，提出了机车轴温混合深度强化学习框架。利用WPD将原始非线性序列分解为子序列。然后建立深度学习预测器LSTM和GRU来预测每个子系列的未来轴温。Q-learning可以生成最优的集合权值来整合预测因子完成确定性预测，并利用GARCH基于确定性预测残差进行确定性预测。这些部分的混合集成结构有助于优化建模精度，为交通运输实时监测和故障诊断提供有效支持。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A hybrid ensemble deep reinforcement learning model for locomotive axle temperature using the deterministic and probabilistic strategy

This paper proposes a hybrid deep reinforcement learning framework for locomotive axle temperature by combining the wavelet packet decomposition (WPD), long short-term memory (LSTM), the gated recurrent unit (GRU) reinforcement learning, and generalized autoregressive conditional heteroskedasticity (GARCH) algorithms. The WPD is utilized to decompose the raw nonlinear series into subseries. Then the deep learning predictors LSTM and GRU are established to predict the future axle temperatures in each subseries. The Q-learning could generate optimal ensemble weights to integrate the predictors to finish the deterministic forecasting and GARCH is used to conduct the deterministic forecasting based on the deterministic forecasting residual. These parts of the hybrid ensemble structure contributed to optimal modeling accuracy and provided effective support in the real-time monitoring and fault diagnosis of transportation.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Transportation Safety and Environment TRANSPORTATION SCIENCE & TECHNOLOGY-

CiteScore

3.90

自引率

13.60%

发文量

审稿时长

10 weeks