{"title":"Reinforcement learning with fuzzy evaluative feedback for a biped robot","authors":"Changjiu Zhou, Qingchun Meng","doi":"10.1109/ROBOT.2000.845328","DOIUrl":null,"url":null,"abstract":"Proposes a fuzzy reinforcement learning algorithm for biped gait synthesis. It is based on a modified GARIC (generalized approximate reasoning for intelligent control) architecture that can accept fuzzy evaluative feedback rather than a numerical one. The proposed gait synthesizer forms the initial gait from intuitive balancing knowledge, and it is then trained by the fuzzy reinforcement learning algorithm that uses a fuzzy critical signal to evaluate the degree of success for the biped dynamic walking by means of the zero moment point. The performance and applicability of the proposed method are illustrated through biped simulation.","PeriodicalId":286422,"journal":{"name":"Proceedings 2000 ICRA. Millennium Conference. IEEE International Conference on Robotics and Automation. Symposia Proceedings (Cat. No.00CH37065)","volume":"86 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2000-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"36","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings 2000 ICRA. Millennium Conference. IEEE International Conference on Robotics and Automation. Symposia Proceedings (Cat. No.00CH37065)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ROBOT.2000.845328","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 36
Abstract
Proposes a fuzzy reinforcement learning algorithm for biped gait synthesis. It is based on a modified GARIC (generalized approximate reasoning for intelligent control) architecture that can accept fuzzy evaluative feedback rather than a numerical one. The proposed gait synthesizer forms the initial gait from intuitive balancing knowledge, and it is then trained by the fuzzy reinforcement learning algorithm that uses a fuzzy critical signal to evaluate the degree of success for the biped dynamic walking by means of the zero moment point. The performance and applicability of the proposed method are illustrated through biped simulation.