Low resolution Arabic recognition with multidimensional recurrent neural networks

MOCR '13 Pub Date : 2013-08-24 DOI:10.1145/2505377.2505385
Sheikh Faisal Rashid, M. Schambach, J. Rottland, Stephan von der Nüll
{"title":"Low resolution Arabic recognition with multidimensional recurrent neural networks","authors":"Sheikh Faisal Rashid, M. Schambach, J. Rottland, Stephan von der Nüll","doi":"10.1145/2505377.2505385","DOIUrl":null,"url":null,"abstract":"OCR of multi-font Arabic text is difficult due to large variations in character shapes from one font to another. It becomes even more challenging if the text is rendered at very low resolution. This paper describes a multi-font, low resolution, and open vocabulary OCR system based on a multidimensional recurrent neural network architecture. For this work, we have developed various systems, trained for single-font/single-size, single-font/multi-size, and multi-font/multi-size data of the well known Arabic printed text image database (APTI). The evaluation tasks from the second Arabic text recognition competition, organized in conjunction with ICDAR 2013, have been adopted. Ten Arabic fonts in six font size categories are used for evaluation. Results show that the proposed method performs very well on the task of printed Arabic text recognition even for very low resolution and small font size images. Overall, the system yields above 99% recognition accuracy at character and word level for most of the printed Arabic fonts.","PeriodicalId":288465,"journal":{"name":"MOCR '13","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"35","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"MOCR '13","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2505377.2505385","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 35

Abstract

OCR of multi-font Arabic text is difficult due to large variations in character shapes from one font to another. It becomes even more challenging if the text is rendered at very low resolution. This paper describes a multi-font, low resolution, and open vocabulary OCR system based on a multidimensional recurrent neural network architecture. For this work, we have developed various systems, trained for single-font/single-size, single-font/multi-size, and multi-font/multi-size data of the well known Arabic printed text image database (APTI). The evaluation tasks from the second Arabic text recognition competition, organized in conjunction with ICDAR 2013, have been adopted. Ten Arabic fonts in six font size categories are used for evaluation. Results show that the proposed method performs very well on the task of printed Arabic text recognition even for very low resolution and small font size images. Overall, the system yields above 99% recognition accuracy at character and word level for most of the printed Arabic fonts.
基于多维递归神经网络的低分辨率阿拉伯语识别
多字体阿拉伯文本的OCR是困难的,因为从一种字体到另一种字体的字符形状有很大的变化。如果文本以非常低的分辨率呈现,则更具挑战性。介绍了一种基于多维递归神经网络结构的多字体、低分辨率、开放词汇OCR系统。为了这项工作,我们开发了各种系统,训练了著名的阿拉伯印刷文本图像数据库(APTI)的单字体/单尺寸、单字体/多尺寸和多字体/多尺寸数据。采用了与ICDAR 2013联合组织的第二届阿拉伯语文本识别竞赛的评估任务。评估使用了6种字体大小类别中的10种阿拉伯字体。结果表明,该方法在低分辨率、小字体图像的打印阿拉伯文文本识别中也能取得很好的效果。总体而言,该系统在字符和单词级别上对大多数印刷阿拉伯字体的识别准确率超过99%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信