An LSPI Based Reinforcement Learning Approach to Enable Network Cooperation in Cognitive Wireless Sensor Network

2013 27th International Conference on Advanced Information Networking and Applications Workshops Pub Date : 2013-03-25 DOI:10.1109/WAINA.2013.8

M. Rovcanin, E. D. Poorter, I. Moerman, P. Demeester

{"title":"An LSPI Based Reinforcement Learning Approach to Enable Network Cooperation in Cognitive Wireless Sensor Network","authors":"M. Rovcanin, E. D. Poorter, I. Moerman, P. Demeester","doi":"10.1109/WAINA.2013.8","DOIUrl":null,"url":null,"abstract":"The number of wirelessly communicating devices increases every day, along with the number of communication standards and technologies that they use to exchange data. Arelatively new form of research is trying to find a way to make all these co-located devices not only capable of detecting each other's presence, but to go one step further - to make them cooperate. One recently proposed way to tackle this problem is to engage into cooperation by activating 'network services'(such as internet sharing, interference avoidance, etc.) that offer benefits for other co-located networks. This approach reduces the problem to the following research topic: how to determine which network services would be beneficial for all the cooperating networks. In this paper we analyze and propose a conceptual solution for this problem using the reinforcement learning technique known as the Least Square Policy Iteration (LSPI). The proposes solution uses a self-learning entity that negotiates between different independent and co-located networks. First, the reasoning entity uses self-learning techniques to determine which service configuration should be used to optimize the network performance of each single network. Afterwards, this performance is used as a reference point and LSPI is used to deduce if cooperating with other co-located networks can lead to even further performance improvements.","PeriodicalId":359251,"journal":{"name":"2013 27th International Conference on Advanced Information Networking and Applications Workshops","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 27th International Conference on Advanced Information Networking and Applications Workshops","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WAINA.2013.8","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

Abstract

The number of wirelessly communicating devices increases every day, along with the number of communication standards and technologies that they use to exchange data. Arelatively new form of research is trying to find a way to make all these co-located devices not only capable of detecting each other's presence, but to go one step further - to make them cooperate. One recently proposed way to tackle this problem is to engage into cooperation by activating 'network services'(such as internet sharing, interference avoidance, etc.) that offer benefits for other co-located networks. This approach reduces the problem to the following research topic: how to determine which network services would be beneficial for all the cooperating networks. In this paper we analyze and propose a conceptual solution for this problem using the reinforcement learning technique known as the Least Square Policy Iteration (LSPI). The proposes solution uses a self-learning entity that negotiates between different independent and co-located networks. First, the reasoning entity uses self-learning techniques to determine which service configuration should be used to optimize the network performance of each single network. Afterwards, this performance is used as a reference point and LSPI is used to deduce if cooperating with other co-located networks can lead to even further performance improvements.

查看原文本刊更多论文

认知无线传感器网络中基于LSPI的网络协作强化学习方法

无线通信设备的数量每天都在增加，它们用于交换数据的通信标准和技术的数量也在增加。一种相对较新的研究形式正在试图找到一种方法，使所有这些共存的设备不仅能够探测到彼此的存在，而且更进一步——使它们相互合作。最近提出的一种解决这个问题的方法是通过激活“网络服务”(如互联网共享、避免干扰等)来参与合作，为其他位于同一位置的网络提供好处。该方法将问题简化为以下研究主题:如何确定哪些网络服务对所有合作网络都是有益的。在本文中，我们使用被称为最小二乘策略迭代(LSPI)的强化学习技术分析并提出了这个问题的概念解决方案。提出的解决方案使用一个自学习实体，在不同的独立和共位于的网络之间进行协商。首先，推理实体使用自学习技术来确定应该使用哪个服务配置来优化每个单个网络的网络性能。然后，将此性能用作参考点，并使用LSPI来推断与其他同址网络合作是否可以进一步提高性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2013 27th International Conference on Advanced Information Networking and Applications Workshops

自引率

0.00%

发文量