{"title":"赢家通吃的视觉手写字符识别神经网络","authors":"M. Tayel, H. Shalaby, H. Saleh","doi":"10.1109/NRSC.1996.551115","DOIUrl":null,"url":null,"abstract":"A neural network model of visual pattern recognition, called the Necognitron, was previously proposed by Kunihiko Fukushima (1988). After training, it can recognize the input pattern without being affected by a change in size or a shift in position. The Fukushima model possesses several appealing categories so that its application would not be restricted to the process of pattern recognition. It can be applied to many other fields if its details are modified properly. With the model's ability of selective attention, gain control and perfect recall for deformed patterns, as a model of associative memory, it would be possible to endow the data compression problem. This article proposes a learning scheme that embeds the Karhunen-Loeve (K-L) transform basis technique into the structure of a Fukushima based neural network to compress the Arabic alphabetic patterns data as well as to reduce the input dimensionality for the network training. Despite that, the input pattern can still be recognized and reconstructed from a few local features. The proposed scheme converges the connections weight vectors to the principal eigenvectors, that retains the maximum information contained in the Arabic alphabetic patterns set into a few significant local features and reduces the redundancies present among the inputs to the perceptual layer. The learning process not only leads to efficient data compression and reconstruction but also enhances the network's ability of feature extraction and Arabic alphabetic recognition.","PeriodicalId":127585,"journal":{"name":"Thirteenth National Radio Science Conference. NRSC '96","volume":"60 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1996-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Winner-take-all neural network for visual handwritten character recognition\",\"authors\":\"M. Tayel, H. Shalaby, H. Saleh\",\"doi\":\"10.1109/NRSC.1996.551115\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A neural network model of visual pattern recognition, called the Necognitron, was previously proposed by Kunihiko Fukushima (1988). After training, it can recognize the input pattern without being affected by a change in size or a shift in position. The Fukushima model possesses several appealing categories so that its application would not be restricted to the process of pattern recognition. It can be applied to many other fields if its details are modified properly. With the model's ability of selective attention, gain control and perfect recall for deformed patterns, as a model of associative memory, it would be possible to endow the data compression problem. This article proposes a learning scheme that embeds the Karhunen-Loeve (K-L) transform basis technique into the structure of a Fukushima based neural network to compress the Arabic alphabetic patterns data as well as to reduce the input dimensionality for the network training. Despite that, the input pattern can still be recognized and reconstructed from a few local features. The proposed scheme converges the connections weight vectors to the principal eigenvectors, that retains the maximum information contained in the Arabic alphabetic patterns set into a few significant local features and reduces the redundancies present among the inputs to the perceptual layer. The learning process not only leads to efficient data compression and reconstruction but also enhances the network's ability of feature extraction and Arabic alphabetic recognition.\",\"PeriodicalId\":127585,\"journal\":{\"name\":\"Thirteenth National Radio Science Conference. NRSC '96\",\"volume\":\"60 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1996-03-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Thirteenth National Radio Science Conference. NRSC '96\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NRSC.1996.551115\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Thirteenth National Radio Science Conference. NRSC '96","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NRSC.1996.551115","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Winner-take-all neural network for visual handwritten character recognition
A neural network model of visual pattern recognition, called the Necognitron, was previously proposed by Kunihiko Fukushima (1988). After training, it can recognize the input pattern without being affected by a change in size or a shift in position. The Fukushima model possesses several appealing categories so that its application would not be restricted to the process of pattern recognition. It can be applied to many other fields if its details are modified properly. With the model's ability of selective attention, gain control and perfect recall for deformed patterns, as a model of associative memory, it would be possible to endow the data compression problem. This article proposes a learning scheme that embeds the Karhunen-Loeve (K-L) transform basis technique into the structure of a Fukushima based neural network to compress the Arabic alphabetic patterns data as well as to reduce the input dimensionality for the network training. Despite that, the input pattern can still be recognized and reconstructed from a few local features. The proposed scheme converges the connections weight vectors to the principal eigenvectors, that retains the maximum information contained in the Arabic alphabetic patterns set into a few significant local features and reduces the redundancies present among the inputs to the perceptual layer. The learning process not only leads to efficient data compression and reconstruction but also enhances the network's ability of feature extraction and Arabic alphabetic recognition.