{"title":"基于确定性和概率策略的机车轴温混合集成深度强化学习模型","authors":"Guangxi Yan, Hui Liu, Chengqing Yu, Chengming Yu, Ye Li, Zhu Duan","doi":"10.1093/tse/tdac055","DOIUrl":null,"url":null,"abstract":"\n This paper proposes a hybrid deep reinforcement learning framework for locomotive axle temperature by combining the wavelet packet decomposition (WPD), long short-term memory (LSTM), the gated recurrent unit (GRU) reinforcement learning, and generalized autoregressive conditional heteroskedasticity (GARCH) algorithms. The WPD is utilized to decompose the raw nonlinear series into subseries. Then the deep learning predictors LSTM and GRU are established to predict the future axle temperatures in each subseries. The Q-learning could generate optimal ensemble weights to integrate the predictors to finish the deterministic forecasting and GARCH is used to conduct the deterministic forecasting based on the deterministic forecasting residual. These parts of the hybrid ensemble structure contributed to optimal modeling accuracy and provided effective support in the real-time monitoring and fault diagnosis of transportation.","PeriodicalId":52804,"journal":{"name":"Transportation Safety and Environment","volume":null,"pages":null},"PeriodicalIF":2.7000,"publicationDate":"2022-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A hybrid ensemble deep reinforcement learning model for locomotive axle temperature using the deterministic and probabilistic strategy\",\"authors\":\"Guangxi Yan, Hui Liu, Chengqing Yu, Chengming Yu, Ye Li, Zhu Duan\",\"doi\":\"10.1093/tse/tdac055\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n This paper proposes a hybrid deep reinforcement learning framework for locomotive axle temperature by combining the wavelet packet decomposition (WPD), long short-term memory (LSTM), the gated recurrent unit (GRU) reinforcement learning, and generalized autoregressive conditional heteroskedasticity (GARCH) algorithms. The WPD is utilized to decompose the raw nonlinear series into subseries. Then the deep learning predictors LSTM and GRU are established to predict the future axle temperatures in each subseries. The Q-learning could generate optimal ensemble weights to integrate the predictors to finish the deterministic forecasting and GARCH is used to conduct the deterministic forecasting based on the deterministic forecasting residual. These parts of the hybrid ensemble structure contributed to optimal modeling accuracy and provided effective support in the real-time monitoring and fault diagnosis of transportation.\",\"PeriodicalId\":52804,\"journal\":{\"name\":\"Transportation Safety and Environment\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":2.7000,\"publicationDate\":\"2022-12-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Transportation Safety and Environment\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1093/tse/tdac055\",\"RegionNum\":4,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"TRANSPORTATION SCIENCE & TECHNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transportation Safety and Environment","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1093/tse/tdac055","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"TRANSPORTATION SCIENCE & TECHNOLOGY","Score":null,"Total":0}
A hybrid ensemble deep reinforcement learning model for locomotive axle temperature using the deterministic and probabilistic strategy
This paper proposes a hybrid deep reinforcement learning framework for locomotive axle temperature by combining the wavelet packet decomposition (WPD), long short-term memory (LSTM), the gated recurrent unit (GRU) reinforcement learning, and generalized autoregressive conditional heteroskedasticity (GARCH) algorithms. The WPD is utilized to decompose the raw nonlinear series into subseries. Then the deep learning predictors LSTM and GRU are established to predict the future axle temperatures in each subseries. The Q-learning could generate optimal ensemble weights to integrate the predictors to finish the deterministic forecasting and GARCH is used to conduct the deterministic forecasting based on the deterministic forecasting residual. These parts of the hybrid ensemble structure contributed to optimal modeling accuracy and provided effective support in the real-time monitoring and fault diagnosis of transportation.