生物驱动的字符识别神经网络分析

conference on Analysis of Neural Network Applications Pub Date : 1991-05-29 DOI:10.1145/106965.106967

M. Garris, R. A. Wilkinson, Charles L. Wilson

{"title":"生物驱动的字符识别神经网络分析","authors":"M. Garris, R. A. Wilkinson, Charles L. Wilson","doi":"10.1145/106965.106967","DOIUrl":null,"url":null,"abstract":"A neural network architecture for size-invariant and local shape-invariant digit recognition has been developed. The network is based on known biological data on the structure of vertebrate vision but is implemented using more conventional numerical methods for image feature extraction and pattern classification. The input receptor field structure of the network uses Gabor function feature selection. The classification section of the network uses back-propagation. Using these features as neurode inputs, an implementation of back-propagation on a serial machine achieved 100% accuracy when trained and tested on a single font size and style while classifying at a rate of 2 ms per character. Taking the same trained network, recognition greater than 99.9% accuracy was achieved when tested with digits of different font sizes. A network trained on multiple font styles when tested achieved greater than 99.9% accuracy and, when tested with digits of different font sizes, achieved greater than 99.8% accuracy. These networks, trained only with good quality prototypes, recognized images degraded with 15% random noise with an accuracy of 89%. In addition to raw recognition results, a study was conducted where activation distributions of correct responses from the network were compared against activation distributions of incorrect responses. By establishing a threshold between these two distributions, a reject mechanism was developed to minimize substitutional errors. This allowed substitutional errors on images degraded with 10% random noise to be reduced from 2.08% to 0.25%.","PeriodicalId":359315,"journal":{"name":"conference on Analysis of Neural Network Applications","volume":"50 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1991-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"19","resultStr":"{\"title\":\"Analysis of a biologically motivated neural network for character recognition\",\"authors\":\"M. Garris, R. A. Wilkinson, Charles L. Wilson\",\"doi\":\"10.1145/106965.106967\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A neural network architecture for size-invariant and local shape-invariant digit recognition has been developed. The network is based on known biological data on the structure of vertebrate vision but is implemented using more conventional numerical methods for image feature extraction and pattern classification. The input receptor field structure of the network uses Gabor function feature selection. The classification section of the network uses back-propagation. Using these features as neurode inputs, an implementation of back-propagation on a serial machine achieved 100% accuracy when trained and tested on a single font size and style while classifying at a rate of 2 ms per character. Taking the same trained network, recognition greater than 99.9% accuracy was achieved when tested with digits of different font sizes. A network trained on multiple font styles when tested achieved greater than 99.9% accuracy and, when tested with digits of different font sizes, achieved greater than 99.8% accuracy. These networks, trained only with good quality prototypes, recognized images degraded with 15% random noise with an accuracy of 89%. In addition to raw recognition results, a study was conducted where activation distributions of correct responses from the network were compared against activation distributions of incorrect responses. By establishing a threshold between these two distributions, a reject mechanism was developed to minimize substitutional errors. This allowed substitutional errors on images degraded with 10% random noise to be reduced from 2.08% to 0.25%.\",\"PeriodicalId\":359315,\"journal\":{\"name\":\"conference on Analysis of Neural Network Applications\",\"volume\":\"50 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1991-05-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"19\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"conference on Analysis of Neural Network Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/106965.106967\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"conference on Analysis of Neural Network Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/106965.106967","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 19

摘要

提出了一种用于尺寸不变和局部形状不变数字识别的神经网络体系结构。该网络基于已知的脊椎动物视觉结构的生物数据，但使用更传统的数值方法来实现图像特征提取和模式分类。网络的输入受体场结构采用Gabor函数特征选择。网络的分类部分使用反向传播。使用这些特征作为神经输入，在串行机器上的反向传播实现在以每个字符2毫秒的速度对单个字体大小和样式进行训练和测试时实现了100%的准确率。使用相同的训练网络，对不同字体大小的数字进行测试，识别准确率超过99.9%。在测试时，在多种字体样式上训练的网络达到了超过99.9%的准确率，在测试不同字体大小的数字时，达到了超过99.8%的准确率。这些网络仅使用高质量的原型进行训练，识别出带有15%随机噪声的图像，准确率达到89%。除了原始的识别结果外，还进行了一项研究，将来自网络的正确反应的激活分布与错误反应的激活分布进行比较。通过在这两个分布之间建立一个阈值，开发了一种拒绝机制来最小化替代误差。这使得10%随机噪声退化图像上的替换误差从2.08%降低到0.25%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Analysis of a biologically motivated neural network for character recognition

A neural network architecture for size-invariant and local shape-invariant digit recognition has been developed. The network is based on known biological data on the structure of vertebrate vision but is implemented using more conventional numerical methods for image feature extraction and pattern classification. The input receptor field structure of the network uses Gabor function feature selection. The classification section of the network uses back-propagation. Using these features as neurode inputs, an implementation of back-propagation on a serial machine achieved 100% accuracy when trained and tested on a single font size and style while classifying at a rate of 2 ms per character. Taking the same trained network, recognition greater than 99.9% accuracy was achieved when tested with digits of different font sizes. A network trained on multiple font styles when tested achieved greater than 99.9% accuracy and, when tested with digits of different font sizes, achieved greater than 99.8% accuracy. These networks, trained only with good quality prototypes, recognized images degraded with 15% random noise with an accuracy of 89%. In addition to raw recognition results, a study was conducted where activation distributions of correct responses from the network were compared against activation distributions of incorrect responses. By establishing a threshold between these two distributions, a reject mechanism was developed to minimize substitutional errors. This allowed substitutional errors on images degraded with 10% random noise to be reduced from 2.08% to 0.25%.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

conference on Analysis of Neural Network Applications

自引率

0.00%

发文量