基于强化学习的配给算法的少用动作探索问题研究

2011 15th IEEE International Conference on Intelligent Engineering Systems Pub Date : 2011-06-23 DOI:10.1109/INES.2011.5954770

M. Stoica, G. Calangiu, F. Sisak

{"title":"基于强化学习的配给算法的少用动作探索问题研究","authors":"M. Stoica, G. Calangiu, F. Sisak","doi":"10.1109/INES.2011.5954770","DOIUrl":null,"url":null,"abstract":"Programming by demonstration is an interesting subject in the field of robotics and it is developing more and more in the direction of robots for services and humanoid robots. Programming by demonstration is much less researched when it comes to industrial robots. One of the reasons is that an industrial robot has to act in a precise and certain manner. However, extending research regarding programming by demonstration to the field of industrial robots could lead to the creation of intelligent systems where the industrial robot could be programmed in an easier way. The goal of our research is to develop an intelligent system useful for industrial robot programming by demonstration. The reasoning algorithms are the mechanisms which offer flexibility to the proposed system. We have focused our research on the creation of a reasoning algorithm based on artificial neural networks [1, 2]. Because the results of this algorithm were not satisfying we have switched our focus to the development of a reasoning algorithm based on reinforcement learning [3]. The algorithm is based on the idea that marks can be assigned to each possible action whenever the robot is in an unknown state. The exploration of less-used actions plays also an important role in the case the robot must to take a decision. Based on the marks and on the exploration feature of the algorithm the robot updates its behaviour. This paper presents a description and some studies on less-used actions exploration problem of the algorithm. Some chapters of the paper will deal with the problems implementing the algorithm, the conducted experiments in terms of exploration feature of the algorithm and the results obtained. The analysis of the results and the characteristics of the algorithm in terms of less-used actions exploration are also discussed in this paper.","PeriodicalId":414812,"journal":{"name":"2011 15th IEEE International Conference on Intelligent Engineering Systems","volume":"319 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Studies on the less-used actions exploration problem of a rationing algorithm based on reinforcement learning\",\"authors\":\"M. Stoica, G. Calangiu, F. Sisak\",\"doi\":\"10.1109/INES.2011.5954770\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Programming by demonstration is an interesting subject in the field of robotics and it is developing more and more in the direction of robots for services and humanoid robots. Programming by demonstration is much less researched when it comes to industrial robots. One of the reasons is that an industrial robot has to act in a precise and certain manner. However, extending research regarding programming by demonstration to the field of industrial robots could lead to the creation of intelligent systems where the industrial robot could be programmed in an easier way. The goal of our research is to develop an intelligent system useful for industrial robot programming by demonstration. The reasoning algorithms are the mechanisms which offer flexibility to the proposed system. We have focused our research on the creation of a reasoning algorithm based on artificial neural networks [1, 2]. Because the results of this algorithm were not satisfying we have switched our focus to the development of a reasoning algorithm based on reinforcement learning [3]. The algorithm is based on the idea that marks can be assigned to each possible action whenever the robot is in an unknown state. The exploration of less-used actions plays also an important role in the case the robot must to take a decision. Based on the marks and on the exploration feature of the algorithm the robot updates its behaviour. This paper presents a description and some studies on less-used actions exploration problem of the algorithm. Some chapters of the paper will deal with the problems implementing the algorithm, the conducted experiments in terms of exploration feature of the algorithm and the results obtained. The analysis of the results and the characteristics of the algorithm in terms of less-used actions exploration are also discussed in this paper.\",\"PeriodicalId\":414812,\"journal\":{\"name\":\"2011 15th IEEE International Conference on Intelligent Engineering Systems\",\"volume\":\"319 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-06-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2011 15th IEEE International Conference on Intelligent Engineering Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/INES.2011.5954770\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 15th IEEE International Conference on Intelligent Engineering Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INES.2011.5954770","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

演示编程是机器人领域一个有趣的研究课题，它正朝着服务型机器人和类人机器人的方向发展。当涉及到工业机器人时，通过演示编程的研究要少得多。其中一个原因是工业机器人必须以精确和确定的方式行动。然而，通过示范将关于编程的研究扩展到工业机器人领域可能会导致智能系统的创建，其中工业机器人可以以更容易的方式编程。我们的研究目标是通过演示开发一个可用于工业机器人编程的智能系统。推理算法是为所提出的系统提供灵活性的机制。我们的研究重点是基于人工神经网络的推理算法的创建[1,2]。由于该算法的结果并不令人满意，因此我们将重点转向了基于强化学习的推理算法的开发[3]。该算法基于这样一种思想，即当机器人处于未知状态时，可以为每个可能的动作分配标记。在机器人必须做出决定的情况下，探索较少使用的动作也起着重要的作用。基于标记和算法的探索特性，机器人更新其行为。本文对该算法的少用动作探索问题进行了描述和研究。论文的一些章节将讨论算法的实现问题，并就算法的探索特点和得到的结果进行了实验。本文还从少用动作探索的角度对结果进行了分析，并对算法的特点进行了讨论。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Studies on the less-used actions exploration problem of a rationing algorithm based on reinforcement learning

Programming by demonstration is an interesting subject in the field of robotics and it is developing more and more in the direction of robots for services and humanoid robots. Programming by demonstration is much less researched when it comes to industrial robots. One of the reasons is that an industrial robot has to act in a precise and certain manner. However, extending research regarding programming by demonstration to the field of industrial robots could lead to the creation of intelligent systems where the industrial robot could be programmed in an easier way. The goal of our research is to develop an intelligent system useful for industrial robot programming by demonstration. The reasoning algorithms are the mechanisms which offer flexibility to the proposed system. We have focused our research on the creation of a reasoning algorithm based on artificial neural networks [1, 2]. Because the results of this algorithm were not satisfying we have switched our focus to the development of a reasoning algorithm based on reinforcement learning [3]. The algorithm is based on the idea that marks can be assigned to each possible action whenever the robot is in an unknown state. The exploration of less-used actions plays also an important role in the case the robot must to take a decision. Based on the marks and on the exploration feature of the algorithm the robot updates its behaviour. This paper presents a description and some studies on less-used actions exploration problem of the algorithm. Some chapters of the paper will deal with the problems implementing the algorithm, the conducted experiments in terms of exploration feature of the algorithm and the results obtained. The analysis of the results and the characteristics of the algorithm in terms of less-used actions exploration are also discussed in this paper.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2011 15th IEEE International Conference on Intelligent Engineering Systems

自引率

0.00%

发文量