数学方程与任意语言光学字符识别模型的比较研究

IF 3.2 Q3 Mathematics

Results in Control and Optimization Pub Date : 2025-02-01 DOI:10.1016/j.rico.2025.100532

Sofi.A. Francis, M. Sangeetha

{"title":"数学方程与任意语言光学字符识别模型的比较研究","authors":"Sofi.A. Francis, M. Sangeetha","doi":"10.1016/j.rico.2025.100532","DOIUrl":null,"url":null,"abstract":"<div><div>Optical Character Recognition[OCR] is a technology that makes use of artificial intelligence and machine learning to extract readable text from documents, images, tags or any other type of sources. It allows one to convert characters and text objects into digital data that can be easily processed, analyzed, and modified. OCR can be applied to various types of languages in both written and spoken format. It can process everything from hand-written documents to typed-out text, making it a highly versatile technology. OCR makes use of a variety of algorithms and methods to process images, and then produces readable output, whatever language it is used for. This technology has the potential to be used for industries, banking, the medical field, security, and document storage among others. OCR faces significant challenges in accurately predicting language and mathematical expressions due to variations in handwriting styles, complex layouts, and the ambiguity of symbols. In this research, we propose assessing the results of different models that have been trained to identify an improved OCR system. The best OCR model is With the help of a decision tree model chosen.</div></div>","PeriodicalId":34733,"journal":{"name":"Results in Control and Optimization","volume":"18 ","pages":"Article 100532"},"PeriodicalIF":3.2000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A comparison study on optical character recognition models in mathematical equations and in any language\",\"authors\":\"Sofi.A. Francis, M. Sangeetha\",\"doi\":\"10.1016/j.rico.2025.100532\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Optical Character Recognition[OCR] is a technology that makes use of artificial intelligence and machine learning to extract readable text from documents, images, tags or any other type of sources. It allows one to convert characters and text objects into digital data that can be easily processed, analyzed, and modified. OCR can be applied to various types of languages in both written and spoken format. It can process everything from hand-written documents to typed-out text, making it a highly versatile technology. OCR makes use of a variety of algorithms and methods to process images, and then produces readable output, whatever language it is used for. This technology has the potential to be used for industries, banking, the medical field, security, and document storage among others. OCR faces significant challenges in accurately predicting language and mathematical expressions due to variations in handwriting styles, complex layouts, and the ambiguity of symbols. In this research, we propose assessing the results of different models that have been trained to identify an improved OCR system. The best OCR model is With the help of a decision tree model chosen.</div></div>\",\"PeriodicalId\":34733,\"journal\":{\"name\":\"Results in Control and Optimization\",\"volume\":\"18 \",\"pages\":\"Article 100532\"},\"PeriodicalIF\":3.2000,\"publicationDate\":\"2025-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Results in Control and Optimization\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2666720725000189\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"Mathematics\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Results in Control and Optimization","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666720725000189","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Mathematics","Score":null,"Total":0}

引用次数: 0

摘要

光学字符识别（OCR）是一种利用人工智能和机器学习从文档、图像、标签或任何其他类型的来源中提取可读文本的技术。它允许将字符和文本对象转换为易于处理、分析和修改的数字数据。OCR可以应用于各种语言的书面和口头格式。它可以处理从手写文档到打印文本的所有内容，使其成为一项高度通用的技术。OCR使用各种算法和方法来处理图像，然后产生可读的输出，无论它用于什么语言。这项技术有潜力用于工业、银行、医疗领域、安全和文档存储等领域。由于书写风格的变化、复杂的布局和符号的模糊性，OCR在准确预测语言和数学表达式方面面临着重大挑战。在这项研究中，我们建议评估不同模型的结果，这些模型已被训练以识别改进的OCR系统。最好的OCR模型是借助决策树模型选择的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A comparison study on optical character recognition models in mathematical equations and in any language

Optical Character Recognition[OCR] is a technology that makes use of artificial intelligence and machine learning to extract readable text from documents, images, tags or any other type of sources. It allows one to convert characters and text objects into digital data that can be easily processed, analyzed, and modified. OCR can be applied to various types of languages in both written and spoken format. It can process everything from hand-written documents to typed-out text, making it a highly versatile technology. OCR makes use of a variety of algorithms and methods to process images, and then produces readable output, whatever language it is used for. This technology has the potential to be used for industries, banking, the medical field, security, and document storage among others. OCR faces significant challenges in accurately predicting language and mathematical expressions due to variations in handwriting styles, complex layouts, and the ambiguity of symbols. In this research, we propose assessing the results of different models that have been trained to identify an improved OCR system. The best OCR model is With the help of a decision tree model chosen.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Results in Control and Optimization Mathematics-Control and Optimization

CiteScore

3.00

自引率

0.00%

发文量

审稿时长

91 days