基于高斯滤波的神经网络大写光学字符识别

E. Z. Astuti, C. A. Sari, Mutiara Syabilla, Hendra Sutrisno, E. H. Rachmawanto, Mohamed Doheir
{"title":"基于高斯滤波的神经网络大写光学字符识别","authors":"E. Z. Astuti, C. A. Sari, Mutiara Syabilla, Hendra Sutrisno, E. H. Rachmawanto, Mohamed Doheir","doi":"10.15294/sji.v10i3.43438","DOIUrl":null,"url":null,"abstract":"Purpose: As digital technology advances, society needs to convert physical text into digital text. There are now many methods available for doing this. One of them is OCR (Optical Character Recognition), which can scan images [1]–[4] containing writing and turn them into digital text, making it easier to copy written text from an image. Text recognition in images is complex due to variations in text size, color, font, orientation, background, and lighting conditions.Methods: The technique of text recognition or optical character recognition (OCR) in images can be done using several methods, one of which is a neural network or artificial neural network. The artificial neural network method can help a computer make intelligent decisions with limited human assistance. Intelligent decisions can be made because the neural network can learn and model the relationship between nonlinear and complex input and output data. In this research, the scaled conjugated gradient is applied for optimization. SCG is very effective in finding the minimum value of a complex function, but it takes longer than some other optimization algorithms.Result/Findings: The dataset used is an image with a size of 28 x 28 which is changed in dimension to 784 x 1. This research uses 4000 epochs and obtained the best validation result at epoch 3506 with a value of 0.0087446. Results: From the statistical test results, the effect of perceived usefulness on ease of use has the highest level of influence, obtaining a test value of 3.6. Furthermore, the effect of the attitude towards using on the behavioral intention to use has the lowest level of influence, which obtained a test value of 1.2.Novelty:  In this article, Gaussian filter is used as feature extraction to improve yield. Character detection results using a Gaussian filter are known to be almost 10% higher than those using only a neural network. The result with the Neural Network alone is 82.2%, while the Neural Network-Gaussian Filter produces 92.1%.","PeriodicalId":30781,"journal":{"name":"Scientific Journal of Informatics","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Capital Optical Character Recognition Using Neural Network Based on Gaussian Filter\",\"authors\":\"E. Z. Astuti, C. A. Sari, Mutiara Syabilla, Hendra Sutrisno, E. H. Rachmawanto, Mohamed Doheir\",\"doi\":\"10.15294/sji.v10i3.43438\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Purpose: As digital technology advances, society needs to convert physical text into digital text. There are now many methods available for doing this. One of them is OCR (Optical Character Recognition), which can scan images [1]–[4] containing writing and turn them into digital text, making it easier to copy written text from an image. Text recognition in images is complex due to variations in text size, color, font, orientation, background, and lighting conditions.Methods: The technique of text recognition or optical character recognition (OCR) in images can be done using several methods, one of which is a neural network or artificial neural network. The artificial neural network method can help a computer make intelligent decisions with limited human assistance. Intelligent decisions can be made because the neural network can learn and model the relationship between nonlinear and complex input and output data. In this research, the scaled conjugated gradient is applied for optimization. SCG is very effective in finding the minimum value of a complex function, but it takes longer than some other optimization algorithms.Result/Findings: The dataset used is an image with a size of 28 x 28 which is changed in dimension to 784 x 1. This research uses 4000 epochs and obtained the best validation result at epoch 3506 with a value of 0.0087446. Results: From the statistical test results, the effect of perceived usefulness on ease of use has the highest level of influence, obtaining a test value of 3.6. Furthermore, the effect of the attitude towards using on the behavioral intention to use has the lowest level of influence, which obtained a test value of 1.2.Novelty:  In this article, Gaussian filter is used as feature extraction to improve yield. Character detection results using a Gaussian filter are known to be almost 10% higher than those using only a neural network. The result with the Neural Network alone is 82.2%, while the Neural Network-Gaussian Filter produces 92.1%.\",\"PeriodicalId\":30781,\"journal\":{\"name\":\"Scientific Journal of Informatics\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-07-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Scientific Journal of Informatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.15294/sji.v10i3.43438\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scientific Journal of Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.15294/sji.v10i3.43438","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

目的:随着数字技术的进步,社会需要将实体文本转化为数字文本。现在有很多方法可以做到这一点。其中之一是OCR(光学字符识别),它可以扫描包含文字的图像[1]-[4],并将其转换为数字文本,从而更容易从图像中复制书面文本。由于文本大小、颜色、字体、方向、背景和照明条件的变化,图像中的文本识别是复杂的。方法:图像中的文本识别或光学字符识别(OCR)技术可以使用几种方法来实现,其中一种方法是神经网络或人工神经网络。人工神经网络方法可以帮助计算机在有限的人工辅助下做出智能决策。由于神经网络可以学习和建模非线性和复杂输入和输出数据之间的关系,因此可以做出智能决策。在本研究中,采用缩放共轭梯度进行优化。SCG在寻找复杂函数的最小值方面非常有效,但比其他一些优化算法需要更长的时间。结果/发现:使用的数据集是一张大小为28 × 28的图像,尺寸更改为784 × 1。本研究使用4000个epoch,在3506 epoch得到了最好的验证结果,其值为0.0087446。结果:从统计检验结果来看,感知有用性对易用性的影响程度最高,检验值为3.6。使用态度对行为使用意向的影响程度最低,检验值为1.2。新颖之处:本文采用高斯滤波作为特征提取,提高了提取率。已知使用高斯滤波器的字符检测结果比仅使用神经网络的字符检测结果高出近10%。单独使用神经网络的结果为82.2%,而神经网络-高斯滤波器的结果为92.1%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Capital Optical Character Recognition Using Neural Network Based on Gaussian Filter
Purpose: As digital technology advances, society needs to convert physical text into digital text. There are now many methods available for doing this. One of them is OCR (Optical Character Recognition), which can scan images [1]–[4] containing writing and turn them into digital text, making it easier to copy written text from an image. Text recognition in images is complex due to variations in text size, color, font, orientation, background, and lighting conditions.Methods: The technique of text recognition or optical character recognition (OCR) in images can be done using several methods, one of which is a neural network or artificial neural network. The artificial neural network method can help a computer make intelligent decisions with limited human assistance. Intelligent decisions can be made because the neural network can learn and model the relationship between nonlinear and complex input and output data. In this research, the scaled conjugated gradient is applied for optimization. SCG is very effective in finding the minimum value of a complex function, but it takes longer than some other optimization algorithms.Result/Findings: The dataset used is an image with a size of 28 x 28 which is changed in dimension to 784 x 1. This research uses 4000 epochs and obtained the best validation result at epoch 3506 with a value of 0.0087446. Results: From the statistical test results, the effect of perceived usefulness on ease of use has the highest level of influence, obtaining a test value of 3.6. Furthermore, the effect of the attitude towards using on the behavioral intention to use has the lowest level of influence, which obtained a test value of 1.2.Novelty:  In this article, Gaussian filter is used as feature extraction to improve yield. Character detection results using a Gaussian filter are known to be almost 10% higher than those using only a neural network. The result with the Neural Network alone is 82.2%, while the Neural Network-Gaussian Filter produces 92.1%.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
13
审稿时长
24 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信