一种用于药品标签快速识别和关键信息提取的双引擎融合光学字符识别方法

IF 6.8 2区 工程技术 Q1 ENGINEERING, MULTIDISCIPLINARY
Siyu Wu, Feng Chang
{"title":"一种用于药品标签快速识别和关键信息提取的双引擎融合光学字符识别方法","authors":"Siyu Wu,&nbsp;Feng Chang","doi":"10.1016/j.aej.2025.05.037","DOIUrl":null,"url":null,"abstract":"<div><div>In the context of smart healthcare and information-driven drug supervision, the automatic recognition and extraction of drug label information presents a significant challenge. Traditional Optical Character Recognition (OCR) methods often struggle with complex backgrounds, diverse fonts, and mixed languages. This paper proposes a dual-engine fusion OCR method combining EasyOCR and CnOCR to enhance recognition accuracy. The method integrates IoT-based data collection for real-time drug information monitoring, utilizing multi-threaded parallel recognition for efficiency and an image preprocessing pipeline (including tilt correction, deblurring, and contrast enhancement). Additionally, a field area positioning and template matching mechanism ensures the precise extraction of key information such as drug name, ingredients, specifications, and expiration date. The approach achieves over 92% accuracy across various real-world scenarios, demonstrating improved robustness and promising potential for digital drug management, as well as IoT-based drug traceability and supervision.</div></div>","PeriodicalId":7484,"journal":{"name":"alexandria engineering journal","volume":"128 ","pages":"Pages 1027-1036"},"PeriodicalIF":6.8000,"publicationDate":"2025-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A dual-engine fusion optical character recognition method for fast identification and key information extraction of drug labels\",\"authors\":\"Siyu Wu,&nbsp;Feng Chang\",\"doi\":\"10.1016/j.aej.2025.05.037\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In the context of smart healthcare and information-driven drug supervision, the automatic recognition and extraction of drug label information presents a significant challenge. Traditional Optical Character Recognition (OCR) methods often struggle with complex backgrounds, diverse fonts, and mixed languages. This paper proposes a dual-engine fusion OCR method combining EasyOCR and CnOCR to enhance recognition accuracy. The method integrates IoT-based data collection for real-time drug information monitoring, utilizing multi-threaded parallel recognition for efficiency and an image preprocessing pipeline (including tilt correction, deblurring, and contrast enhancement). Additionally, a field area positioning and template matching mechanism ensures the precise extraction of key information such as drug name, ingredients, specifications, and expiration date. The approach achieves over 92% accuracy across various real-world scenarios, demonstrating improved robustness and promising potential for digital drug management, as well as IoT-based drug traceability and supervision.</div></div>\",\"PeriodicalId\":7484,\"journal\":{\"name\":\"alexandria engineering journal\",\"volume\":\"128 \",\"pages\":\"Pages 1027-1036\"},\"PeriodicalIF\":6.8000,\"publicationDate\":\"2025-08-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"alexandria engineering journal\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S111001682500657X\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"alexandria engineering journal","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S111001682500657X","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

摘要

在智能医疗和信息驱动的药品监管背景下,药品标签信息的自动识别和提取是一个重大挑战。传统的光学字符识别(OCR)方法通常难以处理复杂的背景、不同的字体和混合的语言。为了提高识别精度,提出了一种结合EasyOCR和cocr的双引擎融合OCR方法。该方法集成了基于物联网的实时药物信息监测数据采集,利用多线程并行识别提高效率和图像预处理管道(包括倾斜校正、去模糊和对比度增强)。此外,领域区域定位和模板匹配机制确保了药品名称、成分、规格、有效期等关键信息的精确提取。该方法在各种现实场景中实现了92%以上的准确性,展示了数字药物管理以及基于物联网的药物可追溯性和监管的鲁棒性和潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A dual-engine fusion optical character recognition method for fast identification and key information extraction of drug labels
In the context of smart healthcare and information-driven drug supervision, the automatic recognition and extraction of drug label information presents a significant challenge. Traditional Optical Character Recognition (OCR) methods often struggle with complex backgrounds, diverse fonts, and mixed languages. This paper proposes a dual-engine fusion OCR method combining EasyOCR and CnOCR to enhance recognition accuracy. The method integrates IoT-based data collection for real-time drug information monitoring, utilizing multi-threaded parallel recognition for efficiency and an image preprocessing pipeline (including tilt correction, deblurring, and contrast enhancement). Additionally, a field area positioning and template matching mechanism ensures the precise extraction of key information such as drug name, ingredients, specifications, and expiration date. The approach achieves over 92% accuracy across various real-world scenarios, demonstrating improved robustness and promising potential for digital drug management, as well as IoT-based drug traceability and supervision.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
alexandria engineering journal
alexandria engineering journal Engineering-General Engineering
CiteScore
11.20
自引率
4.40%
发文量
1015
审稿时长
43 days
期刊介绍: Alexandria Engineering Journal is an international journal devoted to publishing high quality papers in the field of engineering and applied science. Alexandria Engineering Journal is cited in the Engineering Information Services (EIS) and the Chemical Abstracts (CA). The papers published in Alexandria Engineering Journal are grouped into five sections, according to the following classification: • Mechanical, Production, Marine and Textile Engineering • Electrical Engineering, Computer Science and Nuclear Engineering • Civil and Architecture Engineering • Chemical Engineering and Applied Sciences • Environmental Engineering
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信