{"title":"随机自动机在多模态性能准则参数自优化中的应用","authors":"I. Shapiro, K. Narendra","doi":"10.1109/TSSC.1969.300228","DOIUrl":null,"url":null,"abstract":"The application of stochastic automata to adaptive parameter optimization problems is considered. The fundamental problem is that of relating the concepts of automata theory and mathematical psychology learning theory to the usual notion of a performance index in a control system. Consideration is given to a number of possible automata structures, linear and nonlinear. One particular linear model is derived with optimal rather than expedient properties of convergence. A basic feature of this model is that it is based on a system response set of rewards and inactions, the latter being substituted for the more common penalty responses. This choice of response set is directly related to the achievement of the desired behavior. Simulations are described for the maximization of multimodal performance functions intentionally constructed to demonstrate the use of the method in situations where relative extrema occur. An example is also given of the automaton as a direct adaptive controller for a third order control system.","PeriodicalId":120916,"journal":{"name":"IEEE Trans. Syst. Sci. Cybern.","volume":"42 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1969-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"106","resultStr":"{\"title\":\"Use of Stochastic Automata for Parameter Self-Optimization with Multimodal Performance Criteria\",\"authors\":\"I. Shapiro, K. Narendra\",\"doi\":\"10.1109/TSSC.1969.300228\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The application of stochastic automata to adaptive parameter optimization problems is considered. The fundamental problem is that of relating the concepts of automata theory and mathematical psychology learning theory to the usual notion of a performance index in a control system. Consideration is given to a number of possible automata structures, linear and nonlinear. One particular linear model is derived with optimal rather than expedient properties of convergence. A basic feature of this model is that it is based on a system response set of rewards and inactions, the latter being substituted for the more common penalty responses. This choice of response set is directly related to the achievement of the desired behavior. Simulations are described for the maximization of multimodal performance functions intentionally constructed to demonstrate the use of the method in situations where relative extrema occur. An example is also given of the automaton as a direct adaptive controller for a third order control system.\",\"PeriodicalId\":120916,\"journal\":{\"name\":\"IEEE Trans. Syst. Sci. Cybern.\",\"volume\":\"42 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1969-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"106\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Trans. Syst. Sci. Cybern.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/TSSC.1969.300228\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Trans. Syst. Sci. Cybern.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TSSC.1969.300228","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Use of Stochastic Automata for Parameter Self-Optimization with Multimodal Performance Criteria
The application of stochastic automata to adaptive parameter optimization problems is considered. The fundamental problem is that of relating the concepts of automata theory and mathematical psychology learning theory to the usual notion of a performance index in a control system. Consideration is given to a number of possible automata structures, linear and nonlinear. One particular linear model is derived with optimal rather than expedient properties of convergence. A basic feature of this model is that it is based on a system response set of rewards and inactions, the latter being substituted for the more common penalty responses. This choice of response set is directly related to the achievement of the desired behavior. Simulations are described for the maximization of multimodal performance functions intentionally constructed to demonstrate the use of the method in situations where relative extrema occur. An example is also given of the automaton as a direct adaptive controller for a third order control system.