{"title":"基于深度强化学习的车辆网络公平性最大化研究","authors":"Shenghui Zhao, Bao Gui, Guilin Chen, Bin Yang","doi":"10.1109/NaNA56854.2022.00027","DOIUrl":null,"url":null,"abstract":"This paper investigates the rate fairness maximization (FM) in a vehicular network consisting of multiple vehicle-to-vehicle(V2V) pairs and vehicle-to-infrastructure (V2I) pairs. To this end, we formulate the FM as an optimal problem subject to the constraints of the quality of service (QoS) requirements, spectrum and power resources. It is usually challenging to solve this nonlinear and nonconvex optimization problem. To tackle with this challenge, we further model the spectrum sharing between V2V and V2I links, and the transmit powers of V2V and V2I users as a Markov decision process. Then, a deep reinforcement learning-based algorithm is proposed to maximize the rate fairness while meeting the constraints of the QoS requirements by jointly optimizing the allocations of spectrum and power resources. Finally, simulation results are presented to illustrate our findings.","PeriodicalId":113743,"journal":{"name":"2022 International Conference on Networking and Network Applications (NaNA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"On Rate Fairness Maximization of Vehicular Networks: A Deep Reinforcement Learning Approach\",\"authors\":\"Shenghui Zhao, Bao Gui, Guilin Chen, Bin Yang\",\"doi\":\"10.1109/NaNA56854.2022.00027\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper investigates the rate fairness maximization (FM) in a vehicular network consisting of multiple vehicle-to-vehicle(V2V) pairs and vehicle-to-infrastructure (V2I) pairs. To this end, we formulate the FM as an optimal problem subject to the constraints of the quality of service (QoS) requirements, spectrum and power resources. It is usually challenging to solve this nonlinear and nonconvex optimization problem. To tackle with this challenge, we further model the spectrum sharing between V2V and V2I links, and the transmit powers of V2V and V2I users as a Markov decision process. Then, a deep reinforcement learning-based algorithm is proposed to maximize the rate fairness while meeting the constraints of the QoS requirements by jointly optimizing the allocations of spectrum and power resources. Finally, simulation results are presented to illustrate our findings.\",\"PeriodicalId\":113743,\"journal\":{\"name\":\"2022 International Conference on Networking and Network Applications (NaNA)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 International Conference on Networking and Network Applications (NaNA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NaNA56854.2022.00027\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Networking and Network Applications (NaNA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NaNA56854.2022.00027","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
On Rate Fairness Maximization of Vehicular Networks: A Deep Reinforcement Learning Approach
This paper investigates the rate fairness maximization (FM) in a vehicular network consisting of multiple vehicle-to-vehicle(V2V) pairs and vehicle-to-infrastructure (V2I) pairs. To this end, we formulate the FM as an optimal problem subject to the constraints of the quality of service (QoS) requirements, spectrum and power resources. It is usually challenging to solve this nonlinear and nonconvex optimization problem. To tackle with this challenge, we further model the spectrum sharing between V2V and V2I links, and the transmit powers of V2V and V2I users as a Markov decision process. Then, a deep reinforcement learning-based algorithm is proposed to maximize the rate fairness while meeting the constraints of the QoS requirements by jointly optimizing the allocations of spectrum and power resources. Finally, simulation results are presented to illustrate our findings.