{"title":"在复杂环境中学习协调动作:相扑实验","authors":"Jiming Liu, Chow Kwong Pok, HuiKa Keung","doi":"10.1109/CEC.1999.781945","DOIUrl":null,"url":null,"abstract":"This paper describes a dual-agent system capable of learning eye-body-coordinated maneuvers in playing a sumo contest. The two agents rely on each other by either offering feedback information on the physical performance of a certain selected maneuver or giving advice on candidate maneuvers for an improvement over the previous performance. At the core of this learning system lies in a multi-phase genetic-programming approach that is aimed to enable the player to gradually acquire sophisticated sumo maneuvers. As illustrated in the sumo learning experiments involving opponents of complex shapes and sizes, the proposed multi-phase learning allows the development of specialized strategic maneuvers based on the general ones, and hence demonstrates the efficiency of maneuver acquisition. We provide details of the problem addressed and the implemented solutions concerning a mobile robot for performing sumo maneuvers and the computational assistant for coaching the robot. In addition, we show the actual performances of the sumo agent, as a result of coaching, in dealing with a number of difficult sumo situations.","PeriodicalId":292523,"journal":{"name":"Proceedings of the 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1999-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Learning coordinated maneuvers in complex environments: a sumo experiment\",\"authors\":\"Jiming Liu, Chow Kwong Pok, HuiKa Keung\",\"doi\":\"10.1109/CEC.1999.781945\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper describes a dual-agent system capable of learning eye-body-coordinated maneuvers in playing a sumo contest. The two agents rely on each other by either offering feedback information on the physical performance of a certain selected maneuver or giving advice on candidate maneuvers for an improvement over the previous performance. At the core of this learning system lies in a multi-phase genetic-programming approach that is aimed to enable the player to gradually acquire sophisticated sumo maneuvers. As illustrated in the sumo learning experiments involving opponents of complex shapes and sizes, the proposed multi-phase learning allows the development of specialized strategic maneuvers based on the general ones, and hence demonstrates the efficiency of maneuver acquisition. We provide details of the problem addressed and the implemented solutions concerning a mobile robot for performing sumo maneuvers and the computational assistant for coaching the robot. In addition, we show the actual performances of the sumo agent, as a result of coaching, in dealing with a number of difficult sumo situations.\",\"PeriodicalId\":292523,\"journal\":{\"name\":\"Proceedings of the 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406)\",\"volume\":\"36 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1999-07-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CEC.1999.781945\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CEC.1999.781945","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Learning coordinated maneuvers in complex environments: a sumo experiment
This paper describes a dual-agent system capable of learning eye-body-coordinated maneuvers in playing a sumo contest. The two agents rely on each other by either offering feedback information on the physical performance of a certain selected maneuver or giving advice on candidate maneuvers for an improvement over the previous performance. At the core of this learning system lies in a multi-phase genetic-programming approach that is aimed to enable the player to gradually acquire sophisticated sumo maneuvers. As illustrated in the sumo learning experiments involving opponents of complex shapes and sizes, the proposed multi-phase learning allows the development of specialized strategic maneuvers based on the general ones, and hence demonstrates the efficiency of maneuver acquisition. We provide details of the problem addressed and the implemented solutions concerning a mobile robot for performing sumo maneuvers and the computational assistant for coaching the robot. In addition, we show the actual performances of the sumo agent, as a result of coaching, in dealing with a number of difficult sumo situations.