{"title":"An End-to-End Optical Character Recognition Pipeline for Indonesian Identity Card","authors":"Andrea Chandra, Ruben Stefanus","doi":"10.1109/ICoICT52021.2021.9527436","DOIUrl":null,"url":null,"abstract":"Optical Character Recognition has been long studied over the past few years. The challenge remains for the specific purpose of extracting information from image documents. The aim of this study is to create an end-to-end pipeline for an Indonesian identity card. The final pipeline uses deep learning approach consist of Faster R-CNN for text detection, YOLOv5 for character detection, and Support Vector Machine for Character Recognition. The proposed pipeline showed a remarkable result for both the identity number and the full name. This provides a powerful tool for the auto-fill form and verification process effectively and efficiently.","PeriodicalId":191671,"journal":{"name":"2021 9th International Conference on Information and Communication Technology (ICoICT)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 9th International Conference on Information and Communication Technology (ICoICT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICoICT52021.2021.9527436","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Optical Character Recognition has been long studied over the past few years. The challenge remains for the specific purpose of extracting information from image documents. The aim of this study is to create an end-to-end pipeline for an Indonesian identity card. The final pipeline uses deep learning approach consist of Faster R-CNN for text detection, YOLOv5 for character detection, and Support Vector Machine for Character Recognition. The proposed pipeline showed a remarkable result for both the identity number and the full name. This provides a powerful tool for the auto-fill form and verification process effectively and efficiently.