{"title":"Underwater Acoustic Adaptive Modulation with Reinforcement Learning and Channel Prediction","authors":"Yuzhi Zhang, J. Zhu, Yang Liu, Bin Wang","doi":"10.1145/3491315.3491352","DOIUrl":null,"url":null,"abstract":"We present our ongoing work on reinforcement learning based adaptive modulation for underwater acoustic (UWA) communication. As the long propagation delay, the feedback channel state information (CSI) will be outdated for UWA adaptive modulation. Reinforcement learning can learn the optimal transmission mode even with the outdated CSI. Furthermore, this paper employs long short-term memory neural network to predict CSI for updating learning table, and then selects modulation mode by ϵ– greedy algorithm. The simulation results revealed that the network throughput of proposed method is improved under error bit rate constraint, which is compared to Q-learning with state transition probability predicted CSI and direct feedback CSI in time varying UWA channel.","PeriodicalId":191580,"journal":{"name":"Proceedings of the 15th International Conference on Underwater Networks & Systems","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 15th International Conference on Underwater Networks & Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3491315.3491352","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
We present our ongoing work on reinforcement learning based adaptive modulation for underwater acoustic (UWA) communication. As the long propagation delay, the feedback channel state information (CSI) will be outdated for UWA adaptive modulation. Reinforcement learning can learn the optimal transmission mode even with the outdated CSI. Furthermore, this paper employs long short-term memory neural network to predict CSI for updating learning table, and then selects modulation mode by ϵ– greedy algorithm. The simulation results revealed that the network throughput of proposed method is improved under error bit rate constraint, which is compared to Q-learning with state transition probability predicted CSI and direct feedback CSI in time varying UWA channel.