{"title":"Performance improvement of intelligent machines through feedback","authors":"P. Lima, G. Saridis","doi":"10.1109/ISIC.1995.525040","DOIUrl":null,"url":null,"abstract":"This paper introduces an algorithm for performance improvement of intelligent machines based on a cost function recursively estimated from feedback. The interfaces between the three levels of the hierarchical intelligent controller (HIC) for the intelligent machine are modeled by a 2-stage hierarchical learning stochastic automaton (HLSA). The cost function used by the HLSA combines measures of reliability and computational cost, defined in conjunction. Novel contributions of the paper include an original hierarchical reinforcement learning scheme and a new cost function for intelligent machines. Results of simulations show the application of the methodology to an intelligent robotic system.","PeriodicalId":219623,"journal":{"name":"Proceedings of Tenth International Symposium on Intelligent Control","volume":"62 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1995-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of Tenth International Symposium on Intelligent Control","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISIC.1995.525040","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
This paper introduces an algorithm for performance improvement of intelligent machines based on a cost function recursively estimated from feedback. The interfaces between the three levels of the hierarchical intelligent controller (HIC) for the intelligent machine are modeled by a 2-stage hierarchical learning stochastic automaton (HLSA). The cost function used by the HLSA combines measures of reliability and computational cost, defined in conjunction. Novel contributions of the paper include an original hierarchical reinforcement learning scheme and a new cost function for intelligent machines. Results of simulations show the application of the methodology to an intelligent robotic system.