递归神经网络预测问题的状态形成和转移限制研究

Australasian Computer Science Conference Pub Date : 1900-01-01 DOI:10.1145/1378279.1378303

A. Kennedy, C. MacNish

{"title":"递归神经网络预测问题的状态形成和转移限制研究","authors":"A. Kennedy, C. MacNish","doi":"10.1145/1378279.1378303","DOIUrl":null,"url":null,"abstract":"Recurrent neural networks are able to store information about previous as well as current inputs. This \"memory\" allows them to solve temporal problems such as language recognition and sequence prediction, and provide memory elements for larger cognitive networks. It is generally understood that there is an (increasing) relationship between the number of nodes (and connections) in a network, the capabilities of the network, and the amount of training required. However the specifics of this relationship are less well understood. In particular, given that the state of a recurrent network is encoded as a real-valued vector of activation levels, even for small networks there are infinitely many states to choose from. What then determines, or limits, the capabilities of the network? \n \nIn this paper we use dynamical systems techniques to examine this question in regard to temporal lag. We show that for simple delay problems that the network is unable to solve, the system is able to learn sufficient state representations, but appears to be unable to create transitions that allow it to access those states in the correct order (or equivalently, is unable to arrange its states to suit the transitions that it can support).","PeriodicalId":136130,"journal":{"name":"Australasian Computer Science Conference","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"An investigation of the state formation and transition limitations for prediction problems in recurrent neural networks\",\"authors\":\"A. Kennedy, C. MacNish\",\"doi\":\"10.1145/1378279.1378303\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recurrent neural networks are able to store information about previous as well as current inputs. This \\\"memory\\\" allows them to solve temporal problems such as language recognition and sequence prediction, and provide memory elements for larger cognitive networks. It is generally understood that there is an (increasing) relationship between the number of nodes (and connections) in a network, the capabilities of the network, and the amount of training required. However the specifics of this relationship are less well understood. In particular, given that the state of a recurrent network is encoded as a real-valued vector of activation levels, even for small networks there are infinitely many states to choose from. What then determines, or limits, the capabilities of the network? \\n \\nIn this paper we use dynamical systems techniques to examine this question in regard to temporal lag. We show that for simple delay problems that the network is unable to solve, the system is able to learn sufficient state representations, but appears to be unable to create transitions that allow it to access those states in the correct order (or equivalently, is unable to arrange its states to suit the transitions that it can support).\",\"PeriodicalId\":136130,\"journal\":{\"name\":\"Australasian Computer Science Conference\",\"volume\":\"10 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Australasian Computer Science Conference\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/1378279.1378303\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Australasian Computer Science Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/1378279.1378303","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

循环神经网络能够存储以前和当前输入的信息。这种“记忆”使他们能够解决语言识别和序列预测等时间问题，并为更大的认知网络提供记忆元素。一般认为，网络中节点(和连接)的数量、网络的能力和所需的训练量之间存在(不断增加的)关系。然而，这种关系的具体细节还不太清楚。特别是，考虑到循环网络的状态被编码为激活水平的实值向量，即使是小网络也有无限多个状态可供选择。那么是什么决定或限制了网络的能力呢?在本文中，我们使用动力系统技术来检查这个问题，关于时间滞后。我们表明，对于网络无法解决的简单延迟问题，系统能够学习足够的状态表示，但似乎无法创建允许它以正确顺序访问这些状态的转换(或者等效地，无法安排其状态以适应它可以支持的转换)。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

An investigation of the state formation and transition limitations for prediction problems in recurrent neural networks

Recurrent neural networks are able to store information about previous as well as current inputs. This "memory" allows them to solve temporal problems such as language recognition and sequence prediction, and provide memory elements for larger cognitive networks. It is generally understood that there is an (increasing) relationship between the number of nodes (and connections) in a network, the capabilities of the network, and the amount of training required. However the specifics of this relationship are less well understood. In particular, given that the state of a recurrent network is encoded as a real-valued vector of activation levels, even for small networks there are infinitely many states to choose from. What then determines, or limits, the capabilities of the network? In this paper we use dynamical systems techniques to examine this question in regard to temporal lag. We show that for simple delay problems that the network is unable to solve, the system is able to learn sufficient state representations, but appears to be unable to create transitions that allow it to access those states in the correct order (or equivalently, is unable to arrange its states to suit the transitions that it can support).

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Australasian Computer Science Conference

自引率

0.00%

发文量