赢家通吃的视觉手写字符识别神经网络

Thirteenth National Radio Science Conference. NRSC '96 Pub Date : 1996-03-19 DOI:10.1109/NRSC.1996.551115

M. Tayel, H. Shalaby, H. Saleh

{"title":"赢家通吃的视觉手写字符识别神经网络","authors":"M. Tayel, H. Shalaby, H. Saleh","doi":"10.1109/NRSC.1996.551115","DOIUrl":null,"url":null,"abstract":"A neural network model of visual pattern recognition, called the Necognitron, was previously proposed by Kunihiko Fukushima (1988). After training, it can recognize the input pattern without being affected by a change in size or a shift in position. The Fukushima model possesses several appealing categories so that its application would not be restricted to the process of pattern recognition. It can be applied to many other fields if its details are modified properly. With the model's ability of selective attention, gain control and perfect recall for deformed patterns, as a model of associative memory, it would be possible to endow the data compression problem. This article proposes a learning scheme that embeds the Karhunen-Loeve (K-L) transform basis technique into the structure of a Fukushima based neural network to compress the Arabic alphabetic patterns data as well as to reduce the input dimensionality for the network training. Despite that, the input pattern can still be recognized and reconstructed from a few local features. The proposed scheme converges the connections weight vectors to the principal eigenvectors, that retains the maximum information contained in the Arabic alphabetic patterns set into a few significant local features and reduces the redundancies present among the inputs to the perceptual layer. The learning process not only leads to efficient data compression and reconstruction but also enhances the network's ability of feature extraction and Arabic alphabetic recognition.","PeriodicalId":127585,"journal":{"name":"Thirteenth National Radio Science Conference. NRSC '96","volume":"60 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1996-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Winner-take-all neural network for visual handwritten character recognition\",\"authors\":\"M. Tayel, H. Shalaby, H. Saleh\",\"doi\":\"10.1109/NRSC.1996.551115\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A neural network model of visual pattern recognition, called the Necognitron, was previously proposed by Kunihiko Fukushima (1988). After training, it can recognize the input pattern without being affected by a change in size or a shift in position. The Fukushima model possesses several appealing categories so that its application would not be restricted to the process of pattern recognition. It can be applied to many other fields if its details are modified properly. With the model's ability of selective attention, gain control and perfect recall for deformed patterns, as a model of associative memory, it would be possible to endow the data compression problem. This article proposes a learning scheme that embeds the Karhunen-Loeve (K-L) transform basis technique into the structure of a Fukushima based neural network to compress the Arabic alphabetic patterns data as well as to reduce the input dimensionality for the network training. Despite that, the input pattern can still be recognized and reconstructed from a few local features. The proposed scheme converges the connections weight vectors to the principal eigenvectors, that retains the maximum information contained in the Arabic alphabetic patterns set into a few significant local features and reduces the redundancies present among the inputs to the perceptual layer. The learning process not only leads to efficient data compression and reconstruction but also enhances the network's ability of feature extraction and Arabic alphabetic recognition.\",\"PeriodicalId\":127585,\"journal\":{\"name\":\"Thirteenth National Radio Science Conference. NRSC '96\",\"volume\":\"60 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1996-03-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Thirteenth National Radio Science Conference. NRSC '96\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NRSC.1996.551115\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Thirteenth National Radio Science Conference. NRSC '96","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NRSC.1996.551115","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

一种视觉模式识别的神经网络模型，称为Necognitron，先前由福岛国彦(1988)提出。经过训练，它可以识别输入模式，而不受大小变化或位置移动的影响。福岛模型拥有几个吸引人的类别，因此它的应用不会局限于模式识别过程。如果对其细节进行适当修改，它可以应用于许多其他领域。该模型具有对变形模式的选择性注意、获得控制和完美回忆的能力，作为一种联想记忆模型，将有可能赋予数据压缩问题。本文提出了一种将Karhunen-Loeve (K-L)变换基技术嵌入到基于福岛的神经网络结构中的学习方案，以压缩阿拉伯字母模式数据并降低网络训练的输入维数。尽管如此，输入模式仍然可以通过一些局部特征进行识别和重构。该方案将连接权向量收敛到主特征向量，将阿拉伯字母模式集合中包含的最大信息保留为几个重要的局部特征，并减少了感知层输入之间存在的冗余。这一学习过程不仅提高了网络的数据压缩和重构效率，而且提高了网络的特征提取和阿拉伯字母识别能力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Winner-take-all neural network for visual handwritten character recognition

A neural network model of visual pattern recognition, called the Necognitron, was previously proposed by Kunihiko Fukushima (1988). After training, it can recognize the input pattern without being affected by a change in size or a shift in position. The Fukushima model possesses several appealing categories so that its application would not be restricted to the process of pattern recognition. It can be applied to many other fields if its details are modified properly. With the model's ability of selective attention, gain control and perfect recall for deformed patterns, as a model of associative memory, it would be possible to endow the data compression problem. This article proposes a learning scheme that embeds the Karhunen-Loeve (K-L) transform basis technique into the structure of a Fukushima based neural network to compress the Arabic alphabetic patterns data as well as to reduce the input dimensionality for the network training. Despite that, the input pattern can still be recognized and reconstructed from a few local features. The proposed scheme converges the connections weight vectors to the principal eigenvectors, that retains the maximum information contained in the Arabic alphabetic patterns set into a few significant local features and reduces the redundancies present among the inputs to the perceptual layer. The learning process not only leads to efficient data compression and reconstruction but also enhances the network's ability of feature extraction and Arabic alphabetic recognition.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Thirteenth National Radio Science Conference. NRSC '96

自引率

0.00%

发文量