一种基于任务序列增量学习的响应式导航方法

Proceedings of the First Workshop on Robot Motion and Control. RoMoCo'99 (Cat. No.99EX353) Pub Date : 1999-06-28 DOI:10.1109/ROMOCO.1999.791048

F. Davesne, C. Barret

{"title":"一种基于任务序列增量学习的响应式导航方法","authors":"F. Davesne, C. Barret","doi":"10.1109/ROMOCO.1999.791048","DOIUrl":null,"url":null,"abstract":"Within the contest of learning sequences of basic tasks to build a complex behavior, a method is proposed to coordinate a hierarchical set of tasks. Each one possesses a set of sub-tasks lower in the hierarchy, which must be coordinated to respect a binary perceptive constraint. For each task, the coordination is achieved by a reinforcement learning inspired algorithm based on the heuristic which does not need internal parameters. A validation of the method is given, using a simulated Khepera robot. A goal-seeking behavior is divided into three tasks: go to the goal, follow a wall on the left and on the right. The last two tasks utilize basic behaviors and two other sub-tasks: avoid obstacles on the left and on the right. All the tasks may use a set of 5 basic behaviors. The global goal-seeking behavior and the wall-following and the obstacle avoidance tasks are learned during a step by step learning process.","PeriodicalId":131049,"journal":{"name":"Proceedings of the First Workshop on Robot Motion and Control. RoMoCo'99 (Cat. No.99EX353)","volume":"108 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1999-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A reactive navigation method based on an incremental learning of tasks sequences\",\"authors\":\"F. Davesne, C. Barret\",\"doi\":\"10.1109/ROMOCO.1999.791048\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Within the contest of learning sequences of basic tasks to build a complex behavior, a method is proposed to coordinate a hierarchical set of tasks. Each one possesses a set of sub-tasks lower in the hierarchy, which must be coordinated to respect a binary perceptive constraint. For each task, the coordination is achieved by a reinforcement learning inspired algorithm based on the heuristic which does not need internal parameters. A validation of the method is given, using a simulated Khepera robot. A goal-seeking behavior is divided into three tasks: go to the goal, follow a wall on the left and on the right. The last two tasks utilize basic behaviors and two other sub-tasks: avoid obstacles on the left and on the right. All the tasks may use a set of 5 basic behaviors. The global goal-seeking behavior and the wall-following and the obstacle avoidance tasks are learned during a step by step learning process.\",\"PeriodicalId\":131049,\"journal\":{\"name\":\"Proceedings of the First Workshop on Robot Motion and Control. RoMoCo'99 (Cat. No.99EX353)\",\"volume\":\"108 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1999-06-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the First Workshop on Robot Motion and Control. RoMoCo'99 (Cat. No.99EX353)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ROMOCO.1999.791048\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the First Workshop on Robot Motion and Control. RoMoCo'99 (Cat. No.99EX353)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ROMOCO.1999.791048","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

在基本任务序列的学习竞赛中构建复杂行为，提出了一种协调分层任务集的方法。每个任务都有一组层次较低的子任务，这些子任务必须协调以尊重二元感知约束。对于每个任务，通过基于启发式的强化学习启发算法实现协调，不需要内部参数。最后利用仿真Khepera机器人对该方法进行了验证。追求目标的行为分为三个任务:到达目标，沿着左边的墙走，沿着右边的墙走。最后两个任务使用基本行为和另外两个子任务:避开左边和右边的障碍物。所有的任务都可以使用5个基本行为。全局目标寻求行为和沿墙、避障任务是一个循序渐进的学习过程。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A reactive navigation method based on an incremental learning of tasks sequences

Within the contest of learning sequences of basic tasks to build a complex behavior, a method is proposed to coordinate a hierarchical set of tasks. Each one possesses a set of sub-tasks lower in the hierarchy, which must be coordinated to respect a binary perceptive constraint. For each task, the coordination is achieved by a reinforcement learning inspired algorithm based on the heuristic which does not need internal parameters. A validation of the method is given, using a simulated Khepera robot. A goal-seeking behavior is divided into three tasks: go to the goal, follow a wall on the left and on the right. The last two tasks utilize basic behaviors and two other sub-tasks: avoid obstacles on the left and on the right. All the tasks may use a set of 5 basic behaviors. The global goal-seeking behavior and the wall-following and the obstacle avoidance tasks are learned during a step by step learning process.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the First Workshop on Robot Motion and Control. RoMoCo'99 (Cat. No.99EX353)

自引率

0.00%

发文量