{"title":"一类不可识别马尔可夫链的自适应控制","authors":"A. Jalali, M. Ferguson","doi":"10.1109/CDC.1990.203606","DOIUrl":null,"url":null,"abstract":"Adaptive control of finite unknown Markov chains is considered. A new performance criterion from the theory of bandit processes has recently been introduced for adaptive control of Markov chains. The new performance criterion is stronger than the expected average cost criterion and is more appropriate when the identifiability condition does not hold. An adaptive controller is derived to achieve optimality for a modified version of the new performance criterion.<<ETX>>","PeriodicalId":287089,"journal":{"name":"29th IEEE Conference on Decision and Control","volume":"50 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1990-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Adaptive control of a class of nonidentifiable Markov chains\",\"authors\":\"A. Jalali, M. Ferguson\",\"doi\":\"10.1109/CDC.1990.203606\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Adaptive control of finite unknown Markov chains is considered. A new performance criterion from the theory of bandit processes has recently been introduced for adaptive control of Markov chains. The new performance criterion is stronger than the expected average cost criterion and is more appropriate when the identifiability condition does not hold. An adaptive controller is derived to achieve optimality for a modified version of the new performance criterion.<<ETX>>\",\"PeriodicalId\":287089,\"journal\":{\"name\":\"29th IEEE Conference on Decision and Control\",\"volume\":\"50 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1990-12-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"29th IEEE Conference on Decision and Control\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CDC.1990.203606\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"29th IEEE Conference on Decision and Control","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CDC.1990.203606","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Adaptive control of a class of nonidentifiable Markov chains
Adaptive control of finite unknown Markov chains is considered. A new performance criterion from the theory of bandit processes has recently been introduced for adaptive control of Markov chains. The new performance criterion is stronger than the expected average cost criterion and is more appropriate when the identifiability condition does not hold. An adaptive controller is derived to achieve optimality for a modified version of the new performance criterion.<>