数学方程与任意语言光学字符识别模型的比较研究

IF 3.2 Q3 Mathematics
Sofi.A. Francis, M. Sangeetha
{"title":"数学方程与任意语言光学字符识别模型的比较研究","authors":"Sofi.A. Francis,&nbsp;M. Sangeetha","doi":"10.1016/j.rico.2025.100532","DOIUrl":null,"url":null,"abstract":"<div><div>Optical Character Recognition[OCR] is a technology that makes use of artificial intelligence and machine learning to extract readable text from documents, images, tags or any other type of sources. It allows one to convert characters and text objects into digital data that can be easily processed, analyzed, and modified. OCR can be applied to various types of languages in both written and spoken format. It can process everything from hand-written documents to typed-out text, making it a highly versatile technology. OCR makes use of a variety of algorithms and methods to process images, and then produces readable output, whatever language it is used for. This technology has the potential to be used for industries, banking, the medical field, security, and document storage among others. OCR faces significant challenges in accurately predicting language and mathematical expressions due to variations in handwriting styles, complex layouts, and the ambiguity of symbols. In this research, we propose assessing the results of different models that have been trained to identify an improved OCR system. The best OCR model is With the help of a decision tree model chosen.</div></div>","PeriodicalId":34733,"journal":{"name":"Results in Control and Optimization","volume":"18 ","pages":"Article 100532"},"PeriodicalIF":3.2000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A comparison study on optical character recognition models in mathematical equations and in any language\",\"authors\":\"Sofi.A. Francis,&nbsp;M. Sangeetha\",\"doi\":\"10.1016/j.rico.2025.100532\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Optical Character Recognition[OCR] is a technology that makes use of artificial intelligence and machine learning to extract readable text from documents, images, tags or any other type of sources. It allows one to convert characters and text objects into digital data that can be easily processed, analyzed, and modified. OCR can be applied to various types of languages in both written and spoken format. It can process everything from hand-written documents to typed-out text, making it a highly versatile technology. OCR makes use of a variety of algorithms and methods to process images, and then produces readable output, whatever language it is used for. This technology has the potential to be used for industries, banking, the medical field, security, and document storage among others. OCR faces significant challenges in accurately predicting language and mathematical expressions due to variations in handwriting styles, complex layouts, and the ambiguity of symbols. In this research, we propose assessing the results of different models that have been trained to identify an improved OCR system. The best OCR model is With the help of a decision tree model chosen.</div></div>\",\"PeriodicalId\":34733,\"journal\":{\"name\":\"Results in Control and Optimization\",\"volume\":\"18 \",\"pages\":\"Article 100532\"},\"PeriodicalIF\":3.2000,\"publicationDate\":\"2025-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Results in Control and Optimization\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2666720725000189\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"Mathematics\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Results in Control and Optimization","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666720725000189","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Mathematics","Score":null,"Total":0}
引用次数: 0

摘要

光学字符识别(OCR)是一种利用人工智能和机器学习从文档、图像、标签或任何其他类型的来源中提取可读文本的技术。它允许将字符和文本对象转换为易于处理、分析和修改的数字数据。OCR可以应用于各种语言的书面和口头格式。它可以处理从手写文档到打印文本的所有内容,使其成为一项高度通用的技术。OCR使用各种算法和方法来处理图像,然后产生可读的输出,无论它用于什么语言。这项技术有潜力用于工业、银行、医疗领域、安全和文档存储等领域。由于书写风格的变化、复杂的布局和符号的模糊性,OCR在准确预测语言和数学表达式方面面临着重大挑战。在这项研究中,我们建议评估不同模型的结果,这些模型已被训练以识别改进的OCR系统。最好的OCR模型是借助决策树模型选择的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A comparison study on optical character recognition models in mathematical equations and in any language
Optical Character Recognition[OCR] is a technology that makes use of artificial intelligence and machine learning to extract readable text from documents, images, tags or any other type of sources. It allows one to convert characters and text objects into digital data that can be easily processed, analyzed, and modified. OCR can be applied to various types of languages in both written and spoken format. It can process everything from hand-written documents to typed-out text, making it a highly versatile technology. OCR makes use of a variety of algorithms and methods to process images, and then produces readable output, whatever language it is used for. This technology has the potential to be used for industries, banking, the medical field, security, and document storage among others. OCR faces significant challenges in accurately predicting language and mathematical expressions due to variations in handwriting styles, complex layouts, and the ambiguity of symbols. In this research, we propose assessing the results of different models that have been trained to identify an improved OCR system. The best OCR model is With the help of a decision tree model chosen.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Results in Control and Optimization
Results in Control and Optimization Mathematics-Control and Optimization
CiteScore
3.00
自引率
0.00%
发文量
51
审稿时长
91 days
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信