基于强化学习策略搜索的自主水下航行器控制方法

Europe Oceans 2005 Pub Date : 2005-06-20 DOI:10.1109/OCEANSE.2005.1513157

A. El-Fakdi, M. Carreras, N. Palomeras, P. Ridao

{"title":"基于强化学习策略搜索的自主水下航行器控制方法","authors":"A. El-Fakdi, M. Carreras, N. Palomeras, P. Ridao","doi":"10.1109/OCEANSE.2005.1513157","DOIUrl":null,"url":null,"abstract":"Autonomous underwater vehicles (AUV) represent a challenging control problem with complex, noisy, dynamics. Nowadays, not only the continuous scientific advances in underwater robotics but the increasing number of subsea missions and its complexity ask for an automatization of submarine processes. This paper proposes a high-level control system for solving the action selection problem of an autonomous robot. The system is characterized by the use of reinforcement learning direct policy search methods (RLDPS) for learning the internal state/action mapping of some behaviors. We demonstrate its feasibility with simulated experiments using the model of our underwater robot URIS in a target following task.","PeriodicalId":120840,"journal":{"name":"Europe Oceans 2005","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2005-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"15","resultStr":"{\"title\":\"Autonomous underwater vehicle control using reinforcement learning policy search methods\",\"authors\":\"A. El-Fakdi, M. Carreras, N. Palomeras, P. Ridao\",\"doi\":\"10.1109/OCEANSE.2005.1513157\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Autonomous underwater vehicles (AUV) represent a challenging control problem with complex, noisy, dynamics. Nowadays, not only the continuous scientific advances in underwater robotics but the increasing number of subsea missions and its complexity ask for an automatization of submarine processes. This paper proposes a high-level control system for solving the action selection problem of an autonomous robot. The system is characterized by the use of reinforcement learning direct policy search methods (RLDPS) for learning the internal state/action mapping of some behaviors. We demonstrate its feasibility with simulated experiments using the model of our underwater robot URIS in a target following task.\",\"PeriodicalId\":120840,\"journal\":{\"name\":\"Europe Oceans 2005\",\"volume\":\"9 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2005-06-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"15\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Europe Oceans 2005\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/OCEANSE.2005.1513157\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Europe Oceans 2005","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/OCEANSE.2005.1513157","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 15

摘要

自主水下航行器(AUV)具有复杂的、噪声的、动态的控制问题。随着水下机器人技术的不断发展，水下任务的数量和复杂性不断增加，对水下作业过程的自动化提出了更高的要求。针对自主机器人的动作选择问题，提出了一种高级控制系统。该系统的特点是使用强化学习直接策略搜索方法(RLDPS)来学习某些行为的内部状态/动作映射。利用水下机器人URIS模型在目标跟踪任务中进行了仿真实验，验证了该方法的可行性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Autonomous underwater vehicle control using reinforcement learning policy search methods

Autonomous underwater vehicles (AUV) represent a challenging control problem with complex, noisy, dynamics. Nowadays, not only the continuous scientific advances in underwater robotics but the increasing number of subsea missions and its complexity ask for an automatization of submarine processes. This paper proposes a high-level control system for solving the action selection problem of an autonomous robot. The system is characterized by the use of reinforcement learning direct policy search methods (RLDPS) for learning the internal state/action mapping of some behaviors. We demonstrate its feasibility with simulated experiments using the model of our underwater robot URIS in a target following task.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Europe Oceans 2005

自引率

0.00%

发文量