赢家通吃的视觉手写字符识别神经网络

M. Tayel, H. Shalaby, H. Saleh
{"title":"赢家通吃的视觉手写字符识别神经网络","authors":"M. Tayel, H. Shalaby, H. Saleh","doi":"10.1109/NRSC.1996.551115","DOIUrl":null,"url":null,"abstract":"A neural network model of visual pattern recognition, called the Necognitron, was previously proposed by Kunihiko Fukushima (1988). After training, it can recognize the input pattern without being affected by a change in size or a shift in position. The Fukushima model possesses several appealing categories so that its application would not be restricted to the process of pattern recognition. It can be applied to many other fields if its details are modified properly. With the model's ability of selective attention, gain control and perfect recall for deformed patterns, as a model of associative memory, it would be possible to endow the data compression problem. This article proposes a learning scheme that embeds the Karhunen-Loeve (K-L) transform basis technique into the structure of a Fukushima based neural network to compress the Arabic alphabetic patterns data as well as to reduce the input dimensionality for the network training. Despite that, the input pattern can still be recognized and reconstructed from a few local features. The proposed scheme converges the connections weight vectors to the principal eigenvectors, that retains the maximum information contained in the Arabic alphabetic patterns set into a few significant local features and reduces the redundancies present among the inputs to the perceptual layer. The learning process not only leads to efficient data compression and reconstruction but also enhances the network's ability of feature extraction and Arabic alphabetic recognition.","PeriodicalId":127585,"journal":{"name":"Thirteenth National Radio Science Conference. NRSC '96","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"1996-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Winner-take-all neural network for visual handwritten character recognition\",\"authors\":\"M. Tayel, H. Shalaby, H. Saleh\",\"doi\":\"10.1109/NRSC.1996.551115\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A neural network model of visual pattern recognition, called the Necognitron, was previously proposed by Kunihiko Fukushima (1988). After training, it can recognize the input pattern without being affected by a change in size or a shift in position. The Fukushima model possesses several appealing categories so that its application would not be restricted to the process of pattern recognition. It can be applied to many other fields if its details are modified properly. With the model's ability of selective attention, gain control and perfect recall for deformed patterns, as a model of associative memory, it would be possible to endow the data compression problem. This article proposes a learning scheme that embeds the Karhunen-Loeve (K-L) transform basis technique into the structure of a Fukushima based neural network to compress the Arabic alphabetic patterns data as well as to reduce the input dimensionality for the network training. Despite that, the input pattern can still be recognized and reconstructed from a few local features. The proposed scheme converges the connections weight vectors to the principal eigenvectors, that retains the maximum information contained in the Arabic alphabetic patterns set into a few significant local features and reduces the redundancies present among the inputs to the perceptual layer. The learning process not only leads to efficient data compression and reconstruction but also enhances the network's ability of feature extraction and Arabic alphabetic recognition.\",\"PeriodicalId\":127585,\"journal\":{\"name\":\"Thirteenth National Radio Science Conference. NRSC '96\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1996-03-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Thirteenth National Radio Science Conference. NRSC '96\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NRSC.1996.551115\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Thirteenth National Radio Science Conference. NRSC '96","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NRSC.1996.551115","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

一种视觉模式识别的神经网络模型,称为Necognitron,先前由福岛国彦(1988)提出。经过训练,它可以识别输入模式,而不受大小变化或位置移动的影响。福岛模型拥有几个吸引人的类别,因此它的应用不会局限于模式识别过程。如果对其细节进行适当修改,它可以应用于许多其他领域。该模型具有对变形模式的选择性注意、获得控制和完美回忆的能力,作为一种联想记忆模型,将有可能赋予数据压缩问题。本文提出了一种将Karhunen-Loeve (K-L)变换基技术嵌入到基于福岛的神经网络结构中的学习方案,以压缩阿拉伯字母模式数据并降低网络训练的输入维数。尽管如此,输入模式仍然可以通过一些局部特征进行识别和重构。该方案将连接权向量收敛到主特征向量,将阿拉伯字母模式集合中包含的最大信息保留为几个重要的局部特征,并减少了感知层输入之间存在的冗余。这一学习过程不仅提高了网络的数据压缩和重构效率,而且提高了网络的特征提取和阿拉伯字母识别能力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Winner-take-all neural network for visual handwritten character recognition
A neural network model of visual pattern recognition, called the Necognitron, was previously proposed by Kunihiko Fukushima (1988). After training, it can recognize the input pattern without being affected by a change in size or a shift in position. The Fukushima model possesses several appealing categories so that its application would not be restricted to the process of pattern recognition. It can be applied to many other fields if its details are modified properly. With the model's ability of selective attention, gain control and perfect recall for deformed patterns, as a model of associative memory, it would be possible to endow the data compression problem. This article proposes a learning scheme that embeds the Karhunen-Loeve (K-L) transform basis technique into the structure of a Fukushima based neural network to compress the Arabic alphabetic patterns data as well as to reduce the input dimensionality for the network training. Despite that, the input pattern can still be recognized and reconstructed from a few local features. The proposed scheme converges the connections weight vectors to the principal eigenvectors, that retains the maximum information contained in the Arabic alphabetic patterns set into a few significant local features and reduces the redundancies present among the inputs to the perceptual layer. The learning process not only leads to efficient data compression and reconstruction but also enhances the network's ability of feature extraction and Arabic alphabetic recognition.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信