机器人世界杯三维足球仿真中的强化学习方法

Mohammad Amin Fahami, M. Roshanzamir, N. H. Izadi
{"title":"机器人世界杯三维足球仿真中的强化学习方法","authors":"Mohammad Amin Fahami, M. Roshanzamir, N. H. Izadi","doi":"10.1109/ICCKE.2017.8167920","DOIUrl":null,"url":null,"abstract":"Reinforcement learning is one of the best methods to train autonomous robots. Using this method, a robot can learn to make optimal decisions without detailed programming and hard coded instructions. So, this method is useful for learning complex robotic behaviors. For example, in RoboCup competitions this method will be very useful in learning different behaviors. We propose a method for training a robot to score a goal from anywhere on the field by one or more kicks. Using reinforcement learning, Nao robot will learn the optimal policy to kick towards desired points correctly. Learning process is done in two phases. In the first phase, Nao learns to kick such that the ball goes more distance with minimum divergence from the desired path. In the second phase, the robot learns an optimal policy to score a goal by one or more kicks. Using this method, our robot performance increased significantly compared with kicking towards predetermined points in the goal.","PeriodicalId":151934,"journal":{"name":"2017 7th International Conference on Computer and Knowledge Engineering (ICCKE)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"A reinforcement learning approach to score goals in RoboCup 3D soccer simulation for nao humanoid robot\",\"authors\":\"Mohammad Amin Fahami, M. Roshanzamir, N. H. Izadi\",\"doi\":\"10.1109/ICCKE.2017.8167920\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Reinforcement learning is one of the best methods to train autonomous robots. Using this method, a robot can learn to make optimal decisions without detailed programming and hard coded instructions. So, this method is useful for learning complex robotic behaviors. For example, in RoboCup competitions this method will be very useful in learning different behaviors. We propose a method for training a robot to score a goal from anywhere on the field by one or more kicks. Using reinforcement learning, Nao robot will learn the optimal policy to kick towards desired points correctly. Learning process is done in two phases. In the first phase, Nao learns to kick such that the ball goes more distance with minimum divergence from the desired path. In the second phase, the robot learns an optimal policy to score a goal by one or more kicks. Using this method, our robot performance increased significantly compared with kicking towards predetermined points in the goal.\",\"PeriodicalId\":151934,\"journal\":{\"name\":\"2017 7th International Conference on Computer and Knowledge Engineering (ICCKE)\",\"volume\":\"18 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 7th International Conference on Computer and Knowledge Engineering (ICCKE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCKE.2017.8167920\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 7th International Conference on Computer and Knowledge Engineering (ICCKE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCKE.2017.8167920","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

摘要

强化学习是训练自主机器人的最佳方法之一。使用这种方法,机器人可以在没有详细编程和硬编码指令的情况下学习做出最佳决策。因此,这种方法对于学习复杂的机器人行为是有用的。例如,在机器人世界杯比赛中,这种方法对于学习不同的行为是非常有用的。我们提出了一种训练机器人在场上任何位置通过一次或多次射门得分的方法。通过强化学习,Nao机器人将学习最优策略,以正确地踢向所需的点。学习过程分为两个阶段。在第一阶段,Nao学习如何踢球,使球在最小偏离预期路径的情况下走得更远。在第二阶段,机器人学习一个最佳策略,通过一次或多次射门得分。使用这种方法,我们的机器人性能显著提高与踢向预定点的目标。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A reinforcement learning approach to score goals in RoboCup 3D soccer simulation for nao humanoid robot
Reinforcement learning is one of the best methods to train autonomous robots. Using this method, a robot can learn to make optimal decisions without detailed programming and hard coded instructions. So, this method is useful for learning complex robotic behaviors. For example, in RoboCup competitions this method will be very useful in learning different behaviors. We propose a method for training a robot to score a goal from anywhere on the field by one or more kicks. Using reinforcement learning, Nao robot will learn the optimal policy to kick towards desired points correctly. Learning process is done in two phases. In the first phase, Nao learns to kick such that the ball goes more distance with minimum divergence from the desired path. In the second phase, the robot learns an optimal policy to score a goal by one or more kicks. Using this method, our robot performance increased significantly compared with kicking towards predetermined points in the goal.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信