Hyper-Learning Algorithms for Online Evolution of Robot Controllers

ACM Transactions on Autonomous and Adaptive Systems (TAAS) Pub Date : 2017-09-20 DOI:10.1145/3092815

Fernando Silva, L. Correia, A. Christensen

{"title":"Hyper-Learning Algorithms for Online Evolution of Robot Controllers","authors":"Fernando Silva, L. Correia, A. Christensen","doi":"10.1145/3092815","DOIUrl":null,"url":null,"abstract":"A long-standing goal in artificial intelligence and robotics is synthesising agents that can effectively learn and adapt throughout their lifetime. One open-ended approach to behaviour learning in autonomous robots is online evolution, which is part of the evolutionary robotics field of research. In online evolution approaches, an evolutionary algorithm is executed on the robots during task execution, which enables continuous optimisation and adaptation of behaviour. Despite the potential for automatic behaviour learning, online evolution has not been widely adopted because it often requires several hours or days to synthesise solutions to a given task. In this respect, research in the field has failed to develop a prevalent algorithm able to effectively synthesise solutions to a large number of different tasks in a timely manner. Rather than focusing on a single algorithm, we argue for more general mechanisms that can combine the benefits of different algorithms to increase the performance of online evolution of robot controllers. We conduct a comprehensive assessment of a novel approach called online hyper-evolution (OHE). Robots executing OHE use the different sources of feedback information traditionally associated with controller evaluation to find effective evolutionary algorithms during task execution. First, we study two approaches: OHE-fitness, which uses the fitness score of controllers as the criterion to select promising algorithms over time, and OHE-diversity, which relies on the behavioural diversity of controllers for algorithm selection. We then propose a novel class of techniques called OHE-hybrid, which combine diversity and fitness to search for suitable algorithms. In addition to their effectiveness at selecting suitable algorithms, the different OHE approaches are evaluated for their ability to construct algorithms by controlling which algorithmic components should be employed for controller generation (e.g., mutation, crossover, among others), an unprecedented approach in evolutionary robotics. Results show that OHE (i) facilitates the evolution of controllers with high performance, (ii) can increase effectiveness at different stages of evolution by combining the benefits of multiple algorithms over time, and (iii) can be effectively applied to construct new algorithms during task execution. Overall, our study shows that OHE is a powerful new paradigm that allows robots to improve their learning process as they operate in the task environment.","PeriodicalId":377078,"journal":{"name":"ACM Transactions on Autonomous and Adaptive Systems (TAAS)","volume":"2014 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Transactions on Autonomous and Adaptive Systems (TAAS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3092815","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

A long-standing goal in artificial intelligence and robotics is synthesising agents that can effectively learn and adapt throughout their lifetime. One open-ended approach to behaviour learning in autonomous robots is online evolution, which is part of the evolutionary robotics field of research. In online evolution approaches, an evolutionary algorithm is executed on the robots during task execution, which enables continuous optimisation and adaptation of behaviour. Despite the potential for automatic behaviour learning, online evolution has not been widely adopted because it often requires several hours or days to synthesise solutions to a given task. In this respect, research in the field has failed to develop a prevalent algorithm able to effectively synthesise solutions to a large number of different tasks in a timely manner. Rather than focusing on a single algorithm, we argue for more general mechanisms that can combine the benefits of different algorithms to increase the performance of online evolution of robot controllers. We conduct a comprehensive assessment of a novel approach called online hyper-evolution (OHE). Robots executing OHE use the different sources of feedback information traditionally associated with controller evaluation to find effective evolutionary algorithms during task execution. First, we study two approaches: OHE-fitness, which uses the fitness score of controllers as the criterion to select promising algorithms over time, and OHE-diversity, which relies on the behavioural diversity of controllers for algorithm selection. We then propose a novel class of techniques called OHE-hybrid, which combine diversity and fitness to search for suitable algorithms. In addition to their effectiveness at selecting suitable algorithms, the different OHE approaches are evaluated for their ability to construct algorithms by controlling which algorithmic components should be employed for controller generation (e.g., mutation, crossover, among others), an unprecedented approach in evolutionary robotics. Results show that OHE (i) facilitates the evolution of controllers with high performance, (ii) can increase effectiveness at different stages of evolution by combining the benefits of multiple algorithms over time, and (iii) can be effectively applied to construct new algorithms during task execution. Overall, our study shows that OHE is a powerful new paradigm that allows robots to improve their learning process as they operate in the task environment.

查看原文本刊更多论文

机器人控制器在线进化的超学习算法

人工智能和机器人技术的一个长期目标是合成能够在其一生中有效学习和适应的代理。自主机器人行为学习的一种开放式方法是在线进化，这是进化机器人研究领域的一部分。在在线进化方法中，在任务执行过程中对机器人执行进化算法，使其能够持续优化和适应行为。尽管有自动行为学习的潜力，但在线进化并没有被广泛采用，因为它通常需要几个小时或几天的时间来综合解决给定任务。在这方面，该领域的研究未能开发出一种流行的算法，能够及时有效地综合大量不同任务的解决方案。而不是专注于单一算法，我们主张更通用的机制，可以结合不同算法的好处，以提高机器人控制器的在线进化性能。我们对一种称为在线超进化(OHE)的新方法进行了全面评估。执行OHE的机器人使用传统上与控制器评估相关的不同反馈信息来源，在任务执行过程中找到有效的进化算法。首先，我们研究了两种方法:ohe适应度和ohe多样性，前者使用控制器的适应度分数作为标准，随着时间的推移选择有前途的算法，后者依赖于控制器的行为多样性进行算法选择。然后，我们提出了一种称为OHE-hybrid的新技术，它结合了多样性和适应度来搜索合适的算法。除了在选择合适算法方面的有效性之外，不同的OHE方法还通过控制应该使用哪些算法组件来构建算法(例如，突变，交叉等)来评估其构建算法的能力，这是进化机器人中前所未有的方法。结果表明，OHE (i)促进了控制器的高性能进化，(ii)通过结合多种算法随时间的优势，可以提高不同进化阶段的有效性，(iii)可以有效地应用于任务执行过程中构建新算法。总的来说，我们的研究表明，OHE是一个强大的新范例，它允许机器人在任务环境中操作时改善他们的学习过程。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

ACM Transactions on Autonomous and Adaptive Systems (TAAS)

自引率

0.00%

发文量