基于确定性和概率策略的机车轴温混合集成深度强化学习模型

IF 2.7 4区 工程技术 Q2 TRANSPORTATION SCIENCE & TECHNOLOGY
Guangxi Yan, Hui Liu, Chengqing Yu, Chengming Yu, Ye Li, Zhu Duan
{"title":"基于确定性和概率策略的机车轴温混合集成深度强化学习模型","authors":"Guangxi Yan, Hui Liu, Chengqing Yu, Chengming Yu, Ye Li, Zhu Duan","doi":"10.1093/tse/tdac055","DOIUrl":null,"url":null,"abstract":"\n This paper proposes a hybrid deep reinforcement learning framework for locomotive axle temperature by combining the wavelet packet decomposition (WPD), long short-term memory (LSTM), the gated recurrent unit (GRU) reinforcement learning, and generalized autoregressive conditional heteroskedasticity (GARCH) algorithms. The WPD is utilized to decompose the raw nonlinear series into subseries. Then the deep learning predictors LSTM and GRU are established to predict the future axle temperatures in each subseries. The Q-learning could generate optimal ensemble weights to integrate the predictors to finish the deterministic forecasting and GARCH is used to conduct the deterministic forecasting based on the deterministic forecasting residual. These parts of the hybrid ensemble structure contributed to optimal modeling accuracy and provided effective support in the real-time monitoring and fault diagnosis of transportation.","PeriodicalId":52804,"journal":{"name":"Transportation Safety and Environment","volume":null,"pages":null},"PeriodicalIF":2.7000,"publicationDate":"2022-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A hybrid ensemble deep reinforcement learning model for locomotive axle temperature using the deterministic and probabilistic strategy\",\"authors\":\"Guangxi Yan, Hui Liu, Chengqing Yu, Chengming Yu, Ye Li, Zhu Duan\",\"doi\":\"10.1093/tse/tdac055\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n This paper proposes a hybrid deep reinforcement learning framework for locomotive axle temperature by combining the wavelet packet decomposition (WPD), long short-term memory (LSTM), the gated recurrent unit (GRU) reinforcement learning, and generalized autoregressive conditional heteroskedasticity (GARCH) algorithms. The WPD is utilized to decompose the raw nonlinear series into subseries. Then the deep learning predictors LSTM and GRU are established to predict the future axle temperatures in each subseries. The Q-learning could generate optimal ensemble weights to integrate the predictors to finish the deterministic forecasting and GARCH is used to conduct the deterministic forecasting based on the deterministic forecasting residual. These parts of the hybrid ensemble structure contributed to optimal modeling accuracy and provided effective support in the real-time monitoring and fault diagnosis of transportation.\",\"PeriodicalId\":52804,\"journal\":{\"name\":\"Transportation Safety and Environment\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":2.7000,\"publicationDate\":\"2022-12-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Transportation Safety and Environment\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1093/tse/tdac055\",\"RegionNum\":4,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"TRANSPORTATION SCIENCE & TECHNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transportation Safety and Environment","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1093/tse/tdac055","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"TRANSPORTATION SCIENCE & TECHNOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

结合小波包分解(WPD)、长短期记忆(LSTM)、门控循环单元(GRU)强化学习和广义自回归条件异方差(GARCH)算法,提出了机车轴温混合深度强化学习框架。利用WPD将原始非线性序列分解为子序列。然后建立深度学习预测器LSTM和GRU来预测每个子系列的未来轴温。Q-learning可以生成最优的集合权值来整合预测因子完成确定性预测,并利用GARCH基于确定性预测残差进行确定性预测。这些部分的混合集成结构有助于优化建模精度,为交通运输实时监测和故障诊断提供有效支持。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A hybrid ensemble deep reinforcement learning model for locomotive axle temperature using the deterministic and probabilistic strategy
This paper proposes a hybrid deep reinforcement learning framework for locomotive axle temperature by combining the wavelet packet decomposition (WPD), long short-term memory (LSTM), the gated recurrent unit (GRU) reinforcement learning, and generalized autoregressive conditional heteroskedasticity (GARCH) algorithms. The WPD is utilized to decompose the raw nonlinear series into subseries. Then the deep learning predictors LSTM and GRU are established to predict the future axle temperatures in each subseries. The Q-learning could generate optimal ensemble weights to integrate the predictors to finish the deterministic forecasting and GARCH is used to conduct the deterministic forecasting based on the deterministic forecasting residual. These parts of the hybrid ensemble structure contributed to optimal modeling accuracy and provided effective support in the real-time monitoring and fault diagnosis of transportation.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Transportation Safety and Environment
Transportation Safety and Environment TRANSPORTATION SCIENCE & TECHNOLOGY-
CiteScore
3.90
自引率
13.60%
发文量
32
审稿时长
10 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信