Improving Optical Character Recognition(OCR) Accuracy using Multi-Layer Perceptron(MLP)

M. Krishnamoorthi, Karthi P Sri Ram, M. Sathyan, T. Vasanth
{"title":"Improving Optical Character Recognition(OCR) Accuracy using Multi-Layer Perceptron(MLP)","authors":"M. Krishnamoorthi, Karthi P Sri Ram, M. Sathyan, T. Vasanth","doi":"10.1109/ICOEI56765.2023.10125603","DOIUrl":null,"url":null,"abstract":"Optical Character Recognition (OCR) is a cutting-edge application that has been made possible due to advances in technology. Optical Character Recognition involves using deep learning algorithms such as Multi-Layer Perceptron and Support Vector Machine to create a system that can recognize characters in an image. The Optical Character Recognition system first segments each character individually as part of the pre-processing step, after which it post-processes the image to compare each character to the pre-processed data. In the modern world, Optical Character Recognition is a crucial technology used in many industries to shorten turnaround times while increasing accuracy. However, one of the challenges faced with Optical Character Recognition is that it takes longer to match exact results and produces results with lower precision and character misreadings. To address this challenge, the pre-processed data used by Optical Character Recognition systems now includes practically all fonts. This means that the Optical Character Recognition system has to match the characters with a much larger dataset, making the process more time-consuming and less accurate. To improve the accuracy and speed of Optical Character Recognition systems, several methods have been employed. For instance, post-processing techniques such as error correction algorithms can be used to detect and correct errors in the Optical Character Recognition output. Additionally, the use of deep learning techniques such as Conventional Neural Networks has been shown to improve the accuracy of Optical Character Recognition systems. By employing these methods, Optical Character Recognition can become an even more powerful tool that can shorten turnaround times while increasing accuracy.","PeriodicalId":168942,"journal":{"name":"2023 7th International Conference on Trends in Electronics and Informatics (ICOEI)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 7th International Conference on Trends in Electronics and Informatics (ICOEI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICOEI56765.2023.10125603","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Optical Character Recognition (OCR) is a cutting-edge application that has been made possible due to advances in technology. Optical Character Recognition involves using deep learning algorithms such as Multi-Layer Perceptron and Support Vector Machine to create a system that can recognize characters in an image. The Optical Character Recognition system first segments each character individually as part of the pre-processing step, after which it post-processes the image to compare each character to the pre-processed data. In the modern world, Optical Character Recognition is a crucial technology used in many industries to shorten turnaround times while increasing accuracy. However, one of the challenges faced with Optical Character Recognition is that it takes longer to match exact results and produces results with lower precision and character misreadings. To address this challenge, the pre-processed data used by Optical Character Recognition systems now includes practically all fonts. This means that the Optical Character Recognition system has to match the characters with a much larger dataset, making the process more time-consuming and less accurate. To improve the accuracy and speed of Optical Character Recognition systems, several methods have been employed. For instance, post-processing techniques such as error correction algorithms can be used to detect and correct errors in the Optical Character Recognition output. Additionally, the use of deep learning techniques such as Conventional Neural Networks has been shown to improve the accuracy of Optical Character Recognition systems. By employing these methods, Optical Character Recognition can become an even more powerful tool that can shorten turnaround times while increasing accuracy.
利用多层感知器提高光学字符识别(OCR)精度
光学字符识别(OCR)是由于技术的进步而成为可能的前沿应用。光学字符识别涉及使用多层感知机和支持向量机等深度学习算法来创建一个可以识别图像中的字符的系统。光学字符识别系统首先将每个字符单独分割作为预处理步骤的一部分,然后对图像进行后处理,将每个字符与预处理数据进行比较。在现代世界,光学字符识别是一项关键技术,用于许多行业缩短周转时间,同时提高准确性。然而,光学字符识别面临的挑战之一是需要更长的时间来匹配精确的结果,并且产生精度较低和字符误读的结果。为了应对这一挑战,光学字符识别系统使用的预处理数据现在几乎包括所有字体。这意味着光学字符识别系统必须与更大的数据集匹配字符,这使得这个过程更耗时,更不准确。为了提高光学字符识别系统的精度和速度,采用了几种方法。例如,诸如纠错算法之类的后处理技术可用于检测和纠正光学字符识别输出中的错误。此外,使用深度学习技术,如传统神经网络,已被证明可以提高光学字符识别系统的准确性。通过采用这些方法,光学字符识别可以成为一个更强大的工具,可以缩短周转时间,同时提高准确性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信