{"title":"A dual-engine fusion optical character recognition method for fast identification and key information extraction of drug labels","authors":"Siyu Wu, Feng Chang","doi":"10.1016/j.aej.2025.05.037","DOIUrl":null,"url":null,"abstract":"<div><div>In the context of smart healthcare and information-driven drug supervision, the automatic recognition and extraction of drug label information presents a significant challenge. Traditional Optical Character Recognition (OCR) methods often struggle with complex backgrounds, diverse fonts, and mixed languages. This paper proposes a dual-engine fusion OCR method combining EasyOCR and CnOCR to enhance recognition accuracy. The method integrates IoT-based data collection for real-time drug information monitoring, utilizing multi-threaded parallel recognition for efficiency and an image preprocessing pipeline (including tilt correction, deblurring, and contrast enhancement). Additionally, a field area positioning and template matching mechanism ensures the precise extraction of key information such as drug name, ingredients, specifications, and expiration date. The approach achieves over 92% accuracy across various real-world scenarios, demonstrating improved robustness and promising potential for digital drug management, as well as IoT-based drug traceability and supervision.</div></div>","PeriodicalId":7484,"journal":{"name":"alexandria engineering journal","volume":"128 ","pages":"Pages 1027-1036"},"PeriodicalIF":6.8000,"publicationDate":"2025-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"alexandria engineering journal","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S111001682500657X","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
In the context of smart healthcare and information-driven drug supervision, the automatic recognition and extraction of drug label information presents a significant challenge. Traditional Optical Character Recognition (OCR) methods often struggle with complex backgrounds, diverse fonts, and mixed languages. This paper proposes a dual-engine fusion OCR method combining EasyOCR and CnOCR to enhance recognition accuracy. The method integrates IoT-based data collection for real-time drug information monitoring, utilizing multi-threaded parallel recognition for efficiency and an image preprocessing pipeline (including tilt correction, deblurring, and contrast enhancement). Additionally, a field area positioning and template matching mechanism ensures the precise extraction of key information such as drug name, ingredients, specifications, and expiration date. The approach achieves over 92% accuracy across various real-world scenarios, demonstrating improved robustness and promising potential for digital drug management, as well as IoT-based drug traceability and supervision.
期刊介绍:
Alexandria Engineering Journal is an international journal devoted to publishing high quality papers in the field of engineering and applied science. Alexandria Engineering Journal is cited in the Engineering Information Services (EIS) and the Chemical Abstracts (CA). The papers published in Alexandria Engineering Journal are grouped into five sections, according to the following classification:
• Mechanical, Production, Marine and Textile Engineering
• Electrical Engineering, Computer Science and Nuclear Engineering
• Civil and Architecture Engineering
• Chemical Engineering and Applied Sciences
• Environmental Engineering