利用无监督无权重神经网络作为自主状态分类器的强化学习算法

Yusman Yusof, Asri H. Mansor, Adizul Ahmad
{"title":"利用无监督无权重神经网络作为自主状态分类器的强化学习算法","authors":"Yusman Yusof, Asri H. Mansor, Adizul Ahmad","doi":"10.1109/CSPA.2017.8064963","DOIUrl":null,"url":null,"abstract":"An implementation of reinforcement learning algorithm in an autonomous system requires knowledge expert to specify anticipated states, actions and rewards; and the algorithm will autonomously discover a near optimal behaviour for the system through trial-and-error interactions with its environment. The information on anticipated states are usually extracted from data streams and pre-programmed based on the knowledge expert interpretation of the data thus making the reinforcement learning algorithm rigid to only handles anticipated circumstances and the system will not be able to optimize. As an alternative, in this paper we explore the use of AUTOWiSARD, an unsupervised weightless neural network which will autonomously classify the states based on sensor information and then used by Q-learning, a reinforcement learning algorithm in order find near optimal behavior. The implementation will be demonstrated in an autonomous mobile robot simulation and the outcome will be presented and discussed.","PeriodicalId":445522,"journal":{"name":"2017 IEEE 13th International Colloquium on Signal Processing & its Applications (CSPA)","volume":"56 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Utilizing unsupervised weightless neural network as autonomous states classifier in reinforcement learning algorithm\",\"authors\":\"Yusman Yusof, Asri H. Mansor, Adizul Ahmad\",\"doi\":\"10.1109/CSPA.2017.8064963\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"An implementation of reinforcement learning algorithm in an autonomous system requires knowledge expert to specify anticipated states, actions and rewards; and the algorithm will autonomously discover a near optimal behaviour for the system through trial-and-error interactions with its environment. The information on anticipated states are usually extracted from data streams and pre-programmed based on the knowledge expert interpretation of the data thus making the reinforcement learning algorithm rigid to only handles anticipated circumstances and the system will not be able to optimize. As an alternative, in this paper we explore the use of AUTOWiSARD, an unsupervised weightless neural network which will autonomously classify the states based on sensor information and then used by Q-learning, a reinforcement learning algorithm in order find near optimal behavior. The implementation will be demonstrated in an autonomous mobile robot simulation and the outcome will be presented and discussed.\",\"PeriodicalId\":445522,\"journal\":{\"name\":\"2017 IEEE 13th International Colloquium on Signal Processing & its Applications (CSPA)\",\"volume\":\"56 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 IEEE 13th International Colloquium on Signal Processing & its Applications (CSPA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CSPA.2017.8064963\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE 13th International Colloquium on Signal Processing & its Applications (CSPA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSPA.2017.8064963","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

在自治系统中实现强化学习算法需要知识专家指定预期状态、动作和奖励;该算法将通过与环境的反复试验,自动发现系统的接近最优行为。关于预期状态的信息通常是从数据流中提取的,并基于知识专家对数据的解释进行预编程,这使得强化学习算法只局限于处理预期情况,系统将无法优化。作为替代方案,在本文中,我们探索了AUTOWiSARD的使用,AUTOWiSARD是一种无监督的无权重神经网络,它将根据传感器信息自主分类状态,然后由Q-learning(一种强化学习算法)使用,以找到接近最优的行为。该实现将在自主移动机器人仿真中进行演示,并将展示和讨论结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Utilizing unsupervised weightless neural network as autonomous states classifier in reinforcement learning algorithm
An implementation of reinforcement learning algorithm in an autonomous system requires knowledge expert to specify anticipated states, actions and rewards; and the algorithm will autonomously discover a near optimal behaviour for the system through trial-and-error interactions with its environment. The information on anticipated states are usually extracted from data streams and pre-programmed based on the knowledge expert interpretation of the data thus making the reinforcement learning algorithm rigid to only handles anticipated circumstances and the system will not be able to optimize. As an alternative, in this paper we explore the use of AUTOWiSARD, an unsupervised weightless neural network which will autonomously classify the states based on sensor information and then used by Q-learning, a reinforcement learning algorithm in order find near optimal behavior. The implementation will be demonstrated in an autonomous mobile robot simulation and the outcome will be presented and discussed.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信