{"title":"D2D内容共享的持续激励机制:一种深度强化学习方法","authors":"Min Chen, Haibo Wang, Xiaoli Chu","doi":"10.1109/ICCWorkshops49005.2020.9145226","DOIUrl":null,"url":null,"abstract":"Device-to-device (D2D) communication-based content sharing is regarded as a promising way to offload traffic from cellular networks, where incentive mechanisms are required to motivate mobile user equipment (UE) to participate in content sharing. In this paper, we firstly propose an improved scoring mechanism to provide continuous incentive and then study the impact of historical behavior on continuous motivation. Furthermore, to maintain continuous motivation while keeping the service quality of content sharing, we investigate the weights setting of historical behavior and current status in scores calculating, which is formulated as a stochastic dynamic programming (SDP) problem due to the long-term performance and the randomness of the network. To tackle the curse of dimensionality, a deep reinforcement learning (DRL) algorithm is employed for optimization. Simulation results show that with DRL, the mechanism is effective in motivating content-sharing continuously, improving the quality of service (QoS), and cutting down the sharing cost as well.","PeriodicalId":254869,"journal":{"name":"2020 IEEE International Conference on Communications Workshops (ICC Workshops)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Continuous Incentive Mechanism for D2D Content sharing: A Deep Reinforcement Learning Approach\",\"authors\":\"Min Chen, Haibo Wang, Xiaoli Chu\",\"doi\":\"10.1109/ICCWorkshops49005.2020.9145226\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Device-to-device (D2D) communication-based content sharing is regarded as a promising way to offload traffic from cellular networks, where incentive mechanisms are required to motivate mobile user equipment (UE) to participate in content sharing. In this paper, we firstly propose an improved scoring mechanism to provide continuous incentive and then study the impact of historical behavior on continuous motivation. Furthermore, to maintain continuous motivation while keeping the service quality of content sharing, we investigate the weights setting of historical behavior and current status in scores calculating, which is formulated as a stochastic dynamic programming (SDP) problem due to the long-term performance and the randomness of the network. To tackle the curse of dimensionality, a deep reinforcement learning (DRL) algorithm is employed for optimization. Simulation results show that with DRL, the mechanism is effective in motivating content-sharing continuously, improving the quality of service (QoS), and cutting down the sharing cost as well.\",\"PeriodicalId\":254869,\"journal\":{\"name\":\"2020 IEEE International Conference on Communications Workshops (ICC Workshops)\",\"volume\":\"12 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE International Conference on Communications Workshops (ICC Workshops)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCWorkshops49005.2020.9145226\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE International Conference on Communications Workshops (ICC Workshops)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCWorkshops49005.2020.9145226","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Continuous Incentive Mechanism for D2D Content sharing: A Deep Reinforcement Learning Approach
Device-to-device (D2D) communication-based content sharing is regarded as a promising way to offload traffic from cellular networks, where incentive mechanisms are required to motivate mobile user equipment (UE) to participate in content sharing. In this paper, we firstly propose an improved scoring mechanism to provide continuous incentive and then study the impact of historical behavior on continuous motivation. Furthermore, to maintain continuous motivation while keeping the service quality of content sharing, we investigate the weights setting of historical behavior and current status in scores calculating, which is formulated as a stochastic dynamic programming (SDP) problem due to the long-term performance and the randomness of the network. To tackle the curse of dimensionality, a deep reinforcement learning (DRL) algorithm is employed for optimization. Simulation results show that with DRL, the mechanism is effective in motivating content-sharing continuously, improving the quality of service (QoS), and cutting down the sharing cost as well.