An LSPI Based Reinforcement Learning Approach to Enable Network Cooperation in Cognitive Wireless Sensor Network

M. Rovcanin, E. D. Poorter, I. Moerman, P. Demeester
{"title":"An LSPI Based Reinforcement Learning Approach to Enable Network Cooperation in Cognitive Wireless Sensor Network","authors":"M. Rovcanin, E. D. Poorter, I. Moerman, P. Demeester","doi":"10.1109/WAINA.2013.8","DOIUrl":null,"url":null,"abstract":"The number of wirelessly communicating devices increases every day, along with the number of communication standards and technologies that they use to exchange data. Arelatively new form of research is trying to find a way to make all these co-located devices not only capable of detecting each other's presence, but to go one step further - to make them cooperate. One recently proposed way to tackle this problem is to engage into cooperation by activating 'network services'(such as internet sharing, interference avoidance, etc.) that offer benefits for other co-located networks. This approach reduces the problem to the following research topic: how to determine which network services would be beneficial for all the cooperating networks. In this paper we analyze and propose a conceptual solution for this problem using the reinforcement learning technique known as the Least Square Policy Iteration (LSPI). The proposes solution uses a self-learning entity that negotiates between different independent and co-located networks. First, the reasoning entity uses self-learning techniques to determine which service configuration should be used to optimize the network performance of each single network. Afterwards, this performance is used as a reference point and LSPI is used to deduce if cooperating with other co-located networks can lead to even further performance improvements.","PeriodicalId":359251,"journal":{"name":"2013 27th International Conference on Advanced Information Networking and Applications Workshops","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 27th International Conference on Advanced Information Networking and Applications Workshops","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WAINA.2013.8","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

The number of wirelessly communicating devices increases every day, along with the number of communication standards and technologies that they use to exchange data. Arelatively new form of research is trying to find a way to make all these co-located devices not only capable of detecting each other's presence, but to go one step further - to make them cooperate. One recently proposed way to tackle this problem is to engage into cooperation by activating 'network services'(such as internet sharing, interference avoidance, etc.) that offer benefits for other co-located networks. This approach reduces the problem to the following research topic: how to determine which network services would be beneficial for all the cooperating networks. In this paper we analyze and propose a conceptual solution for this problem using the reinforcement learning technique known as the Least Square Policy Iteration (LSPI). The proposes solution uses a self-learning entity that negotiates between different independent and co-located networks. First, the reasoning entity uses self-learning techniques to determine which service configuration should be used to optimize the network performance of each single network. Afterwards, this performance is used as a reference point and LSPI is used to deduce if cooperating with other co-located networks can lead to even further performance improvements.
认知无线传感器网络中基于LSPI的网络协作强化学习方法
无线通信设备的数量每天都在增加,它们用于交换数据的通信标准和技术的数量也在增加。一种相对较新的研究形式正在试图找到一种方法,使所有这些共存的设备不仅能够探测到彼此的存在,而且更进一步——使它们相互合作。最近提出的一种解决这个问题的方法是通过激活“网络服务”(如互联网共享、避免干扰等)来参与合作,为其他位于同一位置的网络提供好处。该方法将问题简化为以下研究主题:如何确定哪些网络服务对所有合作网络都是有益的。在本文中,我们使用被称为最小二乘策略迭代(LSPI)的强化学习技术分析并提出了这个问题的概念解决方案。提出的解决方案使用一个自学习实体,在不同的独立和共位于的网络之间进行协商。首先,推理实体使用自学习技术来确定应该使用哪个服务配置来优化每个单个网络的网络性能。然后,将此性能用作参考点,并使用LSPI来推断与其他同址网络合作是否可以进一步提高性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信