{"title":"Energy-Saving Predictive Video Streaming with Deep Reinforcement Learning","authors":"Dong Liu, Jianyu Zhao, Chenyang Yang","doi":"10.1109/GLOBECOM38437.2019.9013784","DOIUrl":null,"url":null,"abstract":"In this paper, we propose a policy to optimize predictive power allocation for video streaming over mobile networks with deep reinforcement learning. The objective is to minimize the average energy consumption for video transmission under the quality of service constraint that avoids video stalling. To handle the continuous state and action spaces, we resort to deep deterministic policy gradient to solve the formulated problem. In contrast to previous predictive resource policies for video streaming, the proposed policy operates in an on- line and end-to-end manner. By judiciously designing action and state, the policy can exploit future information without explicit prediction. Simulation results show that the proposed policy can converge closely to the optimal policy with perfect prediction of future large-scale channel gains and outperforms the prediction-based optimal policy when prediction errors exist.","PeriodicalId":6868,"journal":{"name":"2019 IEEE Global Communications Conference (GLOBECOM)","volume":"1 1","pages":"1-6"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE Global Communications Conference (GLOBECOM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GLOBECOM38437.2019.9013784","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7
Abstract
In this paper, we propose a policy to optimize predictive power allocation for video streaming over mobile networks with deep reinforcement learning. The objective is to minimize the average energy consumption for video transmission under the quality of service constraint that avoids video stalling. To handle the continuous state and action spaces, we resort to deep deterministic policy gradient to solve the formulated problem. In contrast to previous predictive resource policies for video streaming, the proposed policy operates in an on- line and end-to-end manner. By judiciously designing action and state, the policy can exploit future information without explicit prediction. Simulation results show that the proposed policy can converge closely to the optimal policy with perfect prediction of future large-scale channel gains and outperforms the prediction-based optimal policy when prediction errors exist.