Basheer Al-Sadawi, Ahmed Hussain, Nabeel Salih Ali
{"title":"High- Performance Printed Arabic Optical Character Recognition System Using ANN Classifier","authors":"Basheer Al-Sadawi, Ahmed Hussain, Nabeel Salih Ali","doi":"10.1109/PICICT53635.2021.00013","DOIUrl":null,"url":null,"abstract":"Optical Character Recognition (OCR) systems were developed with high accuracy to facilitate transactions and increase human-computer interaction in most levels of government and commerce sectors. OCR has been adopted for diverse languages, but a few efforts have mainly been conducted in Arabic characters, and it has suffered weakness in the Arabic language. Therefore, a new Arabic Optical Character Recognition (AOCR) system is proposed to achieve a highperformance recognition system in printed images. Several steps are conducted to achieve the proposed AOCR system, such as image preprocessing, segmentation (line, words, and character), feature extraction, and classification. After evaluated the recognition system with multi-criteria, the AOCR results have shown accuracy with 95% with different quality document images (spatial resolution); besides, the system was adequate to resist the degradation of the documents, compared to other commercial systems in the literature.","PeriodicalId":308869,"journal":{"name":"2021 Palestinian International Conference on Information and Communication Technology (PICICT)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 Palestinian International Conference on Information and Communication Technology (PICICT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PICICT53635.2021.00013","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Optical Character Recognition (OCR) systems were developed with high accuracy to facilitate transactions and increase human-computer interaction in most levels of government and commerce sectors. OCR has been adopted for diverse languages, but a few efforts have mainly been conducted in Arabic characters, and it has suffered weakness in the Arabic language. Therefore, a new Arabic Optical Character Recognition (AOCR) system is proposed to achieve a highperformance recognition system in printed images. Several steps are conducted to achieve the proposed AOCR system, such as image preprocessing, segmentation (line, words, and character), feature extraction, and classification. After evaluated the recognition system with multi-criteria, the AOCR results have shown accuracy with 95% with different quality document images (spatial resolution); besides, the system was adequate to resist the degradation of the documents, compared to other commercial systems in the literature.