神经网络能像学生一样识别模式吗?

2022 20th International Conference on Emerging eLearning Technologies and Applications (ICETA) Pub Date : 2022-10-20 DOI:10.1109/ICETA57911.2022.9974725

Jaroslav Kopčan, M. Klimo, O. Škvarek

{"title":"神经网络能像学生一样识别模式吗?","authors":"Jaroslav Kopčan, M. Klimo, O. Škvarek","doi":"10.1109/ICETA57911.2022.9974725","DOIUrl":null,"url":null,"abstract":"We compare the similarity of humans and one of the current convolutional deep neural network (CNN) LeNet-5 on handwritten digits (MNIST database) recognition under stricter formulated recognition tasks and image degradations caused by a generative adversarial network (GAN). First, a conditional GAN neural network was trained on the MNIST database and generator seeds with normal probability distribution were used. The images for recognition were obtained by the generator that is part of this trained GAN network. Different image degradations were obtained by scaling the length of the input random vector for the generator. Second, as a criterion for the recognition task, we asked the question: Did a person write this digit (without thinning, thickening, protrusions or interruption of the line, no additional image artefacts which a human cannot write with a smooth stroke of the pen)? Human is more robust in most cases of recognizing degraded images. This is since a person who learns to recognize images throughout his life can better comply with the decision criterion than a neural network, which has a much simpler structure than the human brain and learns to recognize only on a limited training set. On the other hand, if we focus recognition criteria on the handwriting of digits, human is more precise, and CNN generalizes the recognised digit more behind this limitation. Our similarity graphs between CNNs and humans can be further used to find the boundaries of the handwritten digits domain and to create a system for recognizing anomalous images.","PeriodicalId":151344,"journal":{"name":"2022 20th International Conference on Emerging eLearning Technologies and Applications (ICETA)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Do Neural Networks Recognize Patterns as well as Students?\",\"authors\":\"Jaroslav Kopčan, M. Klimo, O. Škvarek\",\"doi\":\"10.1109/ICETA57911.2022.9974725\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We compare the similarity of humans and one of the current convolutional deep neural network (CNN) LeNet-5 on handwritten digits (MNIST database) recognition under stricter formulated recognition tasks and image degradations caused by a generative adversarial network (GAN). First, a conditional GAN neural network was trained on the MNIST database and generator seeds with normal probability distribution were used. The images for recognition were obtained by the generator that is part of this trained GAN network. Different image degradations were obtained by scaling the length of the input random vector for the generator. Second, as a criterion for the recognition task, we asked the question: Did a person write this digit (without thinning, thickening, protrusions or interruption of the line, no additional image artefacts which a human cannot write with a smooth stroke of the pen)? Human is more robust in most cases of recognizing degraded images. This is since a person who learns to recognize images throughout his life can better comply with the decision criterion than a neural network, which has a much simpler structure than the human brain and learns to recognize only on a limited training set. On the other hand, if we focus recognition criteria on the handwriting of digits, human is more precise, and CNN generalizes the recognised digit more behind this limitation. Our similarity graphs between CNNs and humans can be further used to find the boundaries of the handwritten digits domain and to create a system for recognizing anomalous images.\",\"PeriodicalId\":151344,\"journal\":{\"name\":\"2022 20th International Conference on Emerging eLearning Technologies and Applications (ICETA)\",\"volume\":\"5 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 20th International Conference on Emerging eLearning Technologies and Applications (ICETA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICETA57911.2022.9974725\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 20th International Conference on Emerging eLearning Technologies and Applications (ICETA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICETA57911.2022.9974725","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

我们比较了当前卷积深度神经网络(CNN) LeNet-5在手写体数字(MNIST数据库)识别上的相似性，以及生成对抗网络(GAN)导致的图像退化。首先，在MNIST数据库上训练条件GAN神经网络，使用正态概率分布的生成器种子;用于识别的图像由生成器获得，该生成器是该训练GAN网络的一部分。通过缩放生成器的输入随机向量的长度来获得不同的图像退化。其次，作为识别任务的一个标准，我们提出了这样一个问题:一个人是否写了这个数字(没有变薄、变粗、突出或中断的线条，没有人类无法用流畅的笔触书写的额外图像伪影)?在大多数情况下，人类对退化图像的识别更加稳健。这是因为一个终生学习识别图像的人比神经网络更符合决策标准，而神经网络的结构比人脑简单得多，只能在有限的训练集上学习识别。另一方面，如果我们将识别标准集中在数字的笔迹上，人类更精确，CNN在这一限制下更能概括识别的数字。我们在cnn和人类之间的相似图可以进一步用来找到手写数字域的边界，并创建一个识别异常图像的系统。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Do Neural Networks Recognize Patterns as well as Students?

We compare the similarity of humans and one of the current convolutional deep neural network (CNN) LeNet-5 on handwritten digits (MNIST database) recognition under stricter formulated recognition tasks and image degradations caused by a generative adversarial network (GAN). First, a conditional GAN neural network was trained on the MNIST database and generator seeds with normal probability distribution were used. The images for recognition were obtained by the generator that is part of this trained GAN network. Different image degradations were obtained by scaling the length of the input random vector for the generator. Second, as a criterion for the recognition task, we asked the question: Did a person write this digit (without thinning, thickening, protrusions or interruption of the line, no additional image artefacts which a human cannot write with a smooth stroke of the pen)? Human is more robust in most cases of recognizing degraded images. This is since a person who learns to recognize images throughout his life can better comply with the decision criterion than a neural network, which has a much simpler structure than the human brain and learns to recognize only on a limited training set. On the other hand, if we focus recognition criteria on the handwriting of digits, human is more precise, and CNN generalizes the recognised digit more behind this limitation. Our similarity graphs between CNNs and humans can be further used to find the boundaries of the handwritten digits domain and to create a system for recognizing anomalous images.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 20th International Conference on Emerging eLearning Technologies and Applications (ICETA)

自引率

0.00%

发文量