Doanh C. Bui, Dung Truong, Nguyen D. Vo, Khang Nguyen
{"title":"MC-OCR Challenge 2021: Deep Learning Approach for Vietnamese Receipts OCR","authors":"Doanh C. Bui, Dung Truong, Nguyen D. Vo, Khang Nguyen","doi":"10.1109/RIVF51545.2021.9642128","DOIUrl":null,"url":null,"abstract":"Receipts OCR has made a significant improvement on accounting, which has attracted much attention of the research community in the field of computer vision as well as natural language processing. In this paper, we solve the problem of extracting pieces of information on Vietnamese receipts including seller, address, timestamp, and total cost. We divided this into two problems: detecting locations of information and using an OCR model to recognize texts. In this paper, we propose a pipeline that employs Faster R-CNN as an information location detector and training a Transformer model for text recognition. Through experiments, we achieved CER 32.19%, which is 9.65% higher than previous method CRNN, while pointing out the remaining statements and challenges of this problem.","PeriodicalId":6860,"journal":{"name":"2021 RIVF International Conference on Computing and Communication Technologies (RIVF)","volume":"35 1","pages":"1-6"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 RIVF International Conference on Computing and Communication Technologies (RIVF)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RIVF51545.2021.9642128","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Receipts OCR has made a significant improvement on accounting, which has attracted much attention of the research community in the field of computer vision as well as natural language processing. In this paper, we solve the problem of extracting pieces of information on Vietnamese receipts including seller, address, timestamp, and total cost. We divided this into two problems: detecting locations of information and using an OCR model to recognize texts. In this paper, we propose a pipeline that employs Faster R-CNN as an information location detector and training a Transformer model for text recognition. Through experiments, we achieved CER 32.19%, which is 9.65% higher than previous method CRNN, while pointing out the remaining statements and challenges of this problem.