N. Aharrane, A. Dahmouni, K. E. Moutaouakil, K. Satori
{"title":"End-to-end system for printed Amazigh script recognition in document images","authors":"N. Aharrane, A. Dahmouni, K. E. Moutaouakil, K. Satori","doi":"10.1109/ATSIP.2017.8075520","DOIUrl":null,"url":null,"abstract":"In this work, we present an end-to-end system devoted to automatic recognition of printed Amazigh script in complex document images containing different languages such as Web images and natural scene images. To this end, text extraction from images is performed; the extracted text serves as input for a trained convolutional neural network (CNN) to identify its language. Finally, we proceed to the recognition of the Amazigh text script using a developed optical character recognition (OCR) system. The CNN reaches 99,12% of accuracy while the OCR system gets 99,93%. The obtained results seem to be very satisfactory.","PeriodicalId":259951,"journal":{"name":"2017 International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)","volume":"03 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ATSIP.2017.8075520","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
In this work, we present an end-to-end system devoted to automatic recognition of printed Amazigh script in complex document images containing different languages such as Web images and natural scene images. To this end, text extraction from images is performed; the extracted text serves as input for a trained convolutional neural network (CNN) to identify its language. Finally, we proceed to the recognition of the Amazigh text script using a developed optical character recognition (OCR) system. The CNN reaches 99,12% of accuracy while the OCR system gets 99,93%. The obtained results seem to be very satisfactory.