基于深度强化学习的移动众感任务分配

Xi Tao, Wei Song
{"title":"基于深度强化学习的移动众感任务分配","authors":"Xi Tao, Wei Song","doi":"10.1109/WCNC45663.2020.9120489","DOIUrl":null,"url":null,"abstract":"Mobile crowdsensing (MCS) is a new and promising paradigm of data collection in large-scale sensing and computing. A large group of users with mobile devices are recruited in a specific area to accomplish sensing tasks. An essential aspect of an MCS application is task allocation, which aims to efficiently assign sensing tasks to the recruited workers. Due to various resource and quality constraints, the MCS task allocation problem is often an NP-hard optimization problem. Traditional greedy or heuristic approaches are usually subject to performance loss in a certain degree so as to maintain tractability or accommodate special requirements such as incentive constraints. In this paper, we attempt to employ a deep reinforcement learning method to search for a more efficient task allocation solution. Specifically, we use a double deep Q-network (DDQN) to solve the task allocation problem as a path-planning problem with time windows. Our formulated problem takes into account location-dependency and time-sensitivity of sensing tasks, as well as the resource limits of workers in terms of maximum travelling distances. Simulations are conducted to compare the DDQN-based solution with two standard baseline solutions. The results show that our proposed solution outperforms the baseline solutions in terms of the platform’s profit and the coverage of tasks.","PeriodicalId":415064,"journal":{"name":"2020 IEEE Wireless Communications and Networking Conference (WCNC)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":"{\"title\":\"Task Allocation for Mobile Crowdsensing with Deep Reinforcement Learning\",\"authors\":\"Xi Tao, Wei Song\",\"doi\":\"10.1109/WCNC45663.2020.9120489\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Mobile crowdsensing (MCS) is a new and promising paradigm of data collection in large-scale sensing and computing. A large group of users with mobile devices are recruited in a specific area to accomplish sensing tasks. An essential aspect of an MCS application is task allocation, which aims to efficiently assign sensing tasks to the recruited workers. Due to various resource and quality constraints, the MCS task allocation problem is often an NP-hard optimization problem. Traditional greedy or heuristic approaches are usually subject to performance loss in a certain degree so as to maintain tractability or accommodate special requirements such as incentive constraints. In this paper, we attempt to employ a deep reinforcement learning method to search for a more efficient task allocation solution. Specifically, we use a double deep Q-network (DDQN) to solve the task allocation problem as a path-planning problem with time windows. Our formulated problem takes into account location-dependency and time-sensitivity of sensing tasks, as well as the resource limits of workers in terms of maximum travelling distances. Simulations are conducted to compare the DDQN-based solution with two standard baseline solutions. The results show that our proposed solution outperforms the baseline solutions in terms of the platform’s profit and the coverage of tasks.\",\"PeriodicalId\":415064,\"journal\":{\"name\":\"2020 IEEE Wireless Communications and Networking Conference (WCNC)\",\"volume\":\"48 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"9\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE Wireless Communications and Networking Conference (WCNC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/WCNC45663.2020.9120489\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE Wireless Communications and Networking Conference (WCNC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WCNC45663.2020.9120489","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9

摘要

移动群体传感(MCS)是一种新的、有前途的大规模传感和计算数据采集范式。在特定区域招募一大群拥有移动设备的用户来完成传感任务。MCS应用程序的一个重要方面是任务分配,其目的是有效地分配感知任务给招募的工人。由于各种资源和质量约束,MCS任务分配问题往往是一个NP-hard优化问题。传统的贪心或启发式方法为了保持可追溯性或适应激励约束等特殊要求,通常会在一定程度上造成绩效损失。在本文中,我们尝试采用深度强化学习方法来寻找更有效的任务分配解决方案。具体来说,我们使用双深度q网络(DDQN)将任务分配问题作为一个带时间窗的路径规划问题来解决。我们制定的问题考虑了传感任务的位置依赖性和时间敏感性,以及工人在最大旅行距离方面的资源限制。仿真比较了基于ddqn的解决方案和两个标准基线解决方案。结果表明,我们提出的解决方案在平台利润和任务覆盖方面优于基线解决方案。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Task Allocation for Mobile Crowdsensing with Deep Reinforcement Learning
Mobile crowdsensing (MCS) is a new and promising paradigm of data collection in large-scale sensing and computing. A large group of users with mobile devices are recruited in a specific area to accomplish sensing tasks. An essential aspect of an MCS application is task allocation, which aims to efficiently assign sensing tasks to the recruited workers. Due to various resource and quality constraints, the MCS task allocation problem is often an NP-hard optimization problem. Traditional greedy or heuristic approaches are usually subject to performance loss in a certain degree so as to maintain tractability or accommodate special requirements such as incentive constraints. In this paper, we attempt to employ a deep reinforcement learning method to search for a more efficient task allocation solution. Specifically, we use a double deep Q-network (DDQN) to solve the task allocation problem as a path-planning problem with time windows. Our formulated problem takes into account location-dependency and time-sensitivity of sensing tasks, as well as the resource limits of workers in terms of maximum travelling distances. Simulations are conducted to compare the DDQN-based solution with two standard baseline solutions. The results show that our proposed solution outperforms the baseline solutions in terms of the platform’s profit and the coverage of tasks.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信