{"title":"Continuous Incentive Mechanism for D2D Content sharing: A Deep Reinforcement Learning Approach","authors":"Min Chen, Haibo Wang, Xiaoli Chu","doi":"10.1109/ICCWorkshops49005.2020.9145226","DOIUrl":null,"url":null,"abstract":"Device-to-device (D2D) communication-based content sharing is regarded as a promising way to offload traffic from cellular networks, where incentive mechanisms are required to motivate mobile user equipment (UE) to participate in content sharing. In this paper, we firstly propose an improved scoring mechanism to provide continuous incentive and then study the impact of historical behavior on continuous motivation. Furthermore, to maintain continuous motivation while keeping the service quality of content sharing, we investigate the weights setting of historical behavior and current status in scores calculating, which is formulated as a stochastic dynamic programming (SDP) problem due to the long-term performance and the randomness of the network. To tackle the curse of dimensionality, a deep reinforcement learning (DRL) algorithm is employed for optimization. Simulation results show that with DRL, the mechanism is effective in motivating content-sharing continuously, improving the quality of service (QoS), and cutting down the sharing cost as well.","PeriodicalId":254869,"journal":{"name":"2020 IEEE International Conference on Communications Workshops (ICC Workshops)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE International Conference on Communications Workshops (ICC Workshops)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCWorkshops49005.2020.9145226","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Device-to-device (D2D) communication-based content sharing is regarded as a promising way to offload traffic from cellular networks, where incentive mechanisms are required to motivate mobile user equipment (UE) to participate in content sharing. In this paper, we firstly propose an improved scoring mechanism to provide continuous incentive and then study the impact of historical behavior on continuous motivation. Furthermore, to maintain continuous motivation while keeping the service quality of content sharing, we investigate the weights setting of historical behavior and current status in scores calculating, which is formulated as a stochastic dynamic programming (SDP) problem due to the long-term performance and the randomness of the network. To tackle the curse of dimensionality, a deep reinforcement learning (DRL) algorithm is employed for optimization. Simulation results show that with DRL, the mechanism is effective in motivating content-sharing continuously, improving the quality of service (QoS), and cutting down the sharing cost as well.