Google Tesseract:使用机器视觉的HDD / SSD标签上的光学字符识别(OCR

Vernon Estrada Bugayong, J. Villaverde, N. Linsangan
{"title":"Google Tesseract:使用机器视觉的HDD / SSD标签上的光学字符识别(OCR","authors":"Vernon Estrada Bugayong, J. Villaverde, N. Linsangan","doi":"10.1109/ICCAE55086.2022.9762440","DOIUrl":null,"url":null,"abstract":"This paper is designed to have an optical character recognition system capable of interpreting captured images of hard disk drive and solid-state drive labels with high accuracy. Manual checking of the disk capacity size and part number found on the labels is time consuming, more prone to errors and utilizes more manpower. Automating the inspection through optical character recognition using image pre-processing and machine vision contributes to an easier inspection process, better management of records and faster cycle time. The images captured using a vision camera went through different stages of image pre-processing via OpenCV-Python and recognition through Google Tesseract. Different categorical variables including exposure time and location of texts in a captured image were used to determine and improve the overall recognition accuracy. By improving the lighting condition through the addition of light sources, the developed OCR system was able to achieve a character recognition accuracy of 99.375%.","PeriodicalId":294641,"journal":{"name":"2022 14th International Conference on Computer and Automation Engineering (ICCAE)","volume":"86 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Google Tesseract: Optical Character Recognition (OCR) on HDD / SSD Labels Using Machine Vision\",\"authors\":\"Vernon Estrada Bugayong, J. Villaverde, N. Linsangan\",\"doi\":\"10.1109/ICCAE55086.2022.9762440\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper is designed to have an optical character recognition system capable of interpreting captured images of hard disk drive and solid-state drive labels with high accuracy. Manual checking of the disk capacity size and part number found on the labels is time consuming, more prone to errors and utilizes more manpower. Automating the inspection through optical character recognition using image pre-processing and machine vision contributes to an easier inspection process, better management of records and faster cycle time. The images captured using a vision camera went through different stages of image pre-processing via OpenCV-Python and recognition through Google Tesseract. Different categorical variables including exposure time and location of texts in a captured image were used to determine and improve the overall recognition accuracy. By improving the lighting condition through the addition of light sources, the developed OCR system was able to achieve a character recognition accuracy of 99.375%.\",\"PeriodicalId\":294641,\"journal\":{\"name\":\"2022 14th International Conference on Computer and Automation Engineering (ICCAE)\",\"volume\":\"86 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-03-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 14th International Conference on Computer and Automation Engineering (ICCAE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCAE55086.2022.9762440\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 14th International Conference on Computer and Automation Engineering (ICCAE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCAE55086.2022.9762440","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

本文设计了一种光学字符识别系统,能够对硬盘驱动器和固态驱动器标签捕获的图像进行高精度的判读。手动检查标签上的磁盘容量大小和部件号耗时,更容易出错,并且需要更多的人力。通过使用图像预处理和机器视觉的光学字符识别自动化检查有助于更轻松的检查过程,更好的记录管理和更快的周期时间。使用视觉摄像机拍摄的图像通过OpenCV-Python进行图像预处理,并通过Google Tesseract进行识别。使用不同的分类变量,包括曝光时间和捕获图像中的文本位置,来确定和提高整体识别精度。通过增加光源改善光照条件,所开发的OCR系统字符识别准确率达到99.375%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Google Tesseract: Optical Character Recognition (OCR) on HDD / SSD Labels Using Machine Vision
This paper is designed to have an optical character recognition system capable of interpreting captured images of hard disk drive and solid-state drive labels with high accuracy. Manual checking of the disk capacity size and part number found on the labels is time consuming, more prone to errors and utilizes more manpower. Automating the inspection through optical character recognition using image pre-processing and machine vision contributes to an easier inspection process, better management of records and faster cycle time. The images captured using a vision camera went through different stages of image pre-processing via OpenCV-Python and recognition through Google Tesseract. Different categorical variables including exposure time and location of texts in a captured image were used to determine and improve the overall recognition accuracy. By improving the lighting condition through the addition of light sources, the developed OCR system was able to achieve a character recognition accuracy of 99.375%.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信