{"title":"利用无监督无权重神经网络作为自主状态分类器的强化学习算法","authors":"Yusman Yusof, Asri H. Mansor, Adizul Ahmad","doi":"10.1109/CSPA.2017.8064963","DOIUrl":null,"url":null,"abstract":"An implementation of reinforcement learning algorithm in an autonomous system requires knowledge expert to specify anticipated states, actions and rewards; and the algorithm will autonomously discover a near optimal behaviour for the system through trial-and-error interactions with its environment. The information on anticipated states are usually extracted from data streams and pre-programmed based on the knowledge expert interpretation of the data thus making the reinforcement learning algorithm rigid to only handles anticipated circumstances and the system will not be able to optimize. As an alternative, in this paper we explore the use of AUTOWiSARD, an unsupervised weightless neural network which will autonomously classify the states based on sensor information and then used by Q-learning, a reinforcement learning algorithm in order find near optimal behavior. The implementation will be demonstrated in an autonomous mobile robot simulation and the outcome will be presented and discussed.","PeriodicalId":445522,"journal":{"name":"2017 IEEE 13th International Colloquium on Signal Processing & its Applications (CSPA)","volume":"56 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Utilizing unsupervised weightless neural network as autonomous states classifier in reinforcement learning algorithm\",\"authors\":\"Yusman Yusof, Asri H. Mansor, Adizul Ahmad\",\"doi\":\"10.1109/CSPA.2017.8064963\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"An implementation of reinforcement learning algorithm in an autonomous system requires knowledge expert to specify anticipated states, actions and rewards; and the algorithm will autonomously discover a near optimal behaviour for the system through trial-and-error interactions with its environment. The information on anticipated states are usually extracted from data streams and pre-programmed based on the knowledge expert interpretation of the data thus making the reinforcement learning algorithm rigid to only handles anticipated circumstances and the system will not be able to optimize. As an alternative, in this paper we explore the use of AUTOWiSARD, an unsupervised weightless neural network which will autonomously classify the states based on sensor information and then used by Q-learning, a reinforcement learning algorithm in order find near optimal behavior. The implementation will be demonstrated in an autonomous mobile robot simulation and the outcome will be presented and discussed.\",\"PeriodicalId\":445522,\"journal\":{\"name\":\"2017 IEEE 13th International Colloquium on Signal Processing & its Applications (CSPA)\",\"volume\":\"56 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 IEEE 13th International Colloquium on Signal Processing & its Applications (CSPA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CSPA.2017.8064963\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE 13th International Colloquium on Signal Processing & its Applications (CSPA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSPA.2017.8064963","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Utilizing unsupervised weightless neural network as autonomous states classifier in reinforcement learning algorithm
An implementation of reinforcement learning algorithm in an autonomous system requires knowledge expert to specify anticipated states, actions and rewards; and the algorithm will autonomously discover a near optimal behaviour for the system through trial-and-error interactions with its environment. The information on anticipated states are usually extracted from data streams and pre-programmed based on the knowledge expert interpretation of the data thus making the reinforcement learning algorithm rigid to only handles anticipated circumstances and the system will not be able to optimize. As an alternative, in this paper we explore the use of AUTOWiSARD, an unsupervised weightless neural network which will autonomously classify the states based on sensor information and then used by Q-learning, a reinforcement learning algorithm in order find near optimal behavior. The implementation will be demonstrated in an autonomous mobile robot simulation and the outcome will be presented and discussed.