A Foraging Strategy with Risk Response for Individual Robots in Adversarial Environments

ACM Transactions on Intelligent Systems and Technology (TIST) Pub Date : 2022-02-04 DOI:10.1145/3514499

K. Di, Yifeng Zhou, Fuhan Yan, Jiuchuan Jiang, Shaofu Yang, Yichuan Jiang

{"title":"A Foraging Strategy with Risk Response for Individual Robots in Adversarial Environments","authors":"K. Di, Yifeng Zhou, Fuhan Yan, Jiuchuan Jiang, Shaofu Yang, Yichuan Jiang","doi":"10.1145/3514499","DOIUrl":null,"url":null,"abstract":"As an essential problem in robotics, foraging means that robots collect objects from a given environment and return them to a specified location. On many occasions, robots are required to perform foraging tasks in adversarial environments, such as battlefield rescue, where potential adversaries may damage robots with a certain probability. The longer an individual robot moves through adversarial environments, the higher the probability of being damaged by adversaries. The robot system can gain utility only when the robot brings carried objects back to a predetermined home station. Such a risk of being damaged makes returning home at different locations potentially relevant to the expected utility produced by the robot. Thus, the individual robot faces a dilemma when it responds to the potential risks in adversarial environments: whether to return the carried resources home or continue foraging tasks. In this article, two fundamental environment settings are discussed, homogeneous cases and heterogeneous cases. The former is analyzed as having both the optimal substructure property and the non-aftereffect property. Then, we present a dynamic programming (DP) algorithm that can find an optimal solution with polynomial time complexity. For the latter, it is proven that finding an optimal solution is \\( \\mathcal {NP} \\) -hard. We then propose a heuristic algorithm: A division hierarchical path planning (DHPP) algorithm that is based on the idea of dividing the foraging routes generated initially into a certain number of subroutes to dilute risks. Finally, these algorithms are extensively evaluated in simulations, concluding that in adversarial environments, they can significantly improve the productivity of an individual robot before it is damaged.","PeriodicalId":123526,"journal":{"name":"ACM Transactions on Intelligent Systems and Technology (TIST)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Transactions on Intelligent Systems and Technology (TIST)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3514499","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

As an essential problem in robotics, foraging means that robots collect objects from a given environment and return them to a specified location. On many occasions, robots are required to perform foraging tasks in adversarial environments, such as battlefield rescue, where potential adversaries may damage robots with a certain probability. The longer an individual robot moves through adversarial environments, the higher the probability of being damaged by adversaries. The robot system can gain utility only when the robot brings carried objects back to a predetermined home station. Such a risk of being damaged makes returning home at different locations potentially relevant to the expected utility produced by the robot. Thus, the individual robot faces a dilemma when it responds to the potential risks in adversarial environments: whether to return the carried resources home or continue foraging tasks. In this article, two fundamental environment settings are discussed, homogeneous cases and heterogeneous cases. The former is analyzed as having both the optimal substructure property and the non-aftereffect property. Then, we present a dynamic programming (DP) algorithm that can find an optimal solution with polynomial time complexity. For the latter, it is proven that finding an optimal solution is \( \mathcal {NP} \) -hard. We then propose a heuristic algorithm: A division hierarchical path planning (DHPP) algorithm that is based on the idea of dividing the foraging routes generated initially into a certain number of subroutes to dilute risks. Finally, these algorithms are extensively evaluated in simulations, concluding that in adversarial environments, they can significantly improve the productivity of an individual robot before it is damaged.

查看原文本刊更多论文

个体机器人在对抗环境下的风险响应觅食策略

觅食是机器人技术中的一个重要问题，它是指机器人从给定的环境中收集物体并将其返回到指定的位置。在很多情况下，机器人需要在对抗性环境中执行觅食任务，例如战场救援，潜在的对手可能会以一定的概率伤害机器人。单个机器人在敌对环境中移动的时间越长，被对手破坏的可能性就越高。只有当机器人将携带的物体带回预定的主站时，机器人系统才能获得效用。这种被损坏的风险使得在不同地点回家可能与机器人产生的预期效用相关。因此，个体机器人在应对敌对环境中的潜在风险时面临两难:是将携带的资源返回家园，还是继续觅食任务。在本文中，讨论了两种基本的环境设置，同质情况和异质情况。分析了前者既具有最优子结构特性，又具有无后效特性。然后，我们提出了一种动态规划(DP)算法，该算法可以找到多项式时间复杂度的最优解。对于后者，证明了找到最优解\( \mathcal {NP} \) -困难。然后，我们提出了一种启发式算法:划分分层路径规划(DHPP)算法，该算法基于将初始生成的觅食路径划分为一定数量的子路径以降低风险的思想。最后，这些算法在模拟中进行了广泛的评估，得出的结论是，在对抗环境中，它们可以在单个机器人被损坏之前显着提高其生产率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

ACM Transactions on Intelligent Systems and Technology (TIST)

自引率

0.00%

发文量