{"title":"A Working Theory of a Learned Model in a Partially Observable Environment for Cognitive Decision-Making","authors":"Emma Graham","doi":"10.1109/sieds55548.2022.9799386","DOIUrl":null,"url":null,"abstract":"To survive in our unpredictable, evolving world, cognitive beings learn to make decisions with the limited knowledge of the world they process. Reflective of an individual's view of the world, a cognitive decision-making model is explored in a partially observable, stochastic environment. The cognitive model uses the Partially Observable Markov Decision Process problem formulation, which is a framework for neurological models and considered implementable in neural circuitry [26] [16]. To structure a planning model comparable to that of DeepMind's MuZero in a partially observable environment, a belief function will translate the observations to a vector of belief states that will be discretized so as to be used as the observations of a MuZero-based machine learning algorithm [29]. The belief states are computed recursively from the previous belief state using Bayesian inference. Bayes rule is thought to capture the neurological and cognitive levels of reasoning [26]. Components of the planning, training, and action methods of the cognitive model will follow those of MuZero. The model could then be trained and act, in way parallel to that of MuZero, in a partially observable environment. Cognitive insights from a model structured in this form and additional considerations are discussed.","PeriodicalId":286724,"journal":{"name":"2022 Systems and Information Engineering Design Symposium (SIEDS)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 Systems and Information Engineering Design Symposium (SIEDS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/sieds55548.2022.9799386","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
To survive in our unpredictable, evolving world, cognitive beings learn to make decisions with the limited knowledge of the world they process. Reflective of an individual's view of the world, a cognitive decision-making model is explored in a partially observable, stochastic environment. The cognitive model uses the Partially Observable Markov Decision Process problem formulation, which is a framework for neurological models and considered implementable in neural circuitry [26] [16]. To structure a planning model comparable to that of DeepMind's MuZero in a partially observable environment, a belief function will translate the observations to a vector of belief states that will be discretized so as to be used as the observations of a MuZero-based machine learning algorithm [29]. The belief states are computed recursively from the previous belief state using Bayesian inference. Bayes rule is thought to capture the neurological and cognitive levels of reasoning [26]. Components of the planning, training, and action methods of the cognitive model will follow those of MuZero. The model could then be trained and act, in way parallel to that of MuZero, in a partially observable environment. Cognitive insights from a model structured in this form and additional considerations are discussed.