Image character through signal and pattern formation

S. Zaman, Kanwal Anwar, Riaz Khan
{"title":"Image character through signal and pattern formation","authors":"S. Zaman, Kanwal Anwar, Riaz Khan","doi":"10.1109/LT.2016.7562865","DOIUrl":null,"url":null,"abstract":"Optical Character Recognition (OCR) is one of key research areas of Artificial Intelligence (AI), and image text recognition is one of challenging fields of OCR. Presented work offers a character recognition system for cursive script (e.g., Arabic, Urdu, etc.) segmented characters from their images. Presented methodology consists of phases namely (1) Image Acquisition, (2) Preprocessing, (3) Chain Code Formation, (4) Signal Generation, and (5) Pattern Extraction. Image Acquisition takes an image of segmented character and converts it into binary image. Preprocessing finds out unconnected regions from the binary image and separates into one main region and one or more secondary regions. The secondary regions make vector of features whereas main region is further converted to one pixel wide thinned image. Chain Code Formation finds out chain code vector using 8-directions. Finally Pattern Extraction uses a defined algorithm to form qualitative signal pattern describing increase (+1), decrease (-1), and constant (0) pattern of writing layout. Additionally recognition of the character is carried out through Feed-Forward Neural Network (FFNN). The methodology and presented algorithm are evaluated using 1292 images of segmented characters of 271 different ligature classes of printed script. The methodology is tested on MATLAB and overall recognition rate obtained with Mean Squared Error (MSE) of 0.0014 with 60 hidden neurons of FFNN.","PeriodicalId":194801,"journal":{"name":"2016 13th Learning and Technology Conference (L&T)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 13th Learning and Technology Conference (L&T)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/LT.2016.7562865","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Optical Character Recognition (OCR) is one of key research areas of Artificial Intelligence (AI), and image text recognition is one of challenging fields of OCR. Presented work offers a character recognition system for cursive script (e.g., Arabic, Urdu, etc.) segmented characters from their images. Presented methodology consists of phases namely (1) Image Acquisition, (2) Preprocessing, (3) Chain Code Formation, (4) Signal Generation, and (5) Pattern Extraction. Image Acquisition takes an image of segmented character and converts it into binary image. Preprocessing finds out unconnected regions from the binary image and separates into one main region and one or more secondary regions. The secondary regions make vector of features whereas main region is further converted to one pixel wide thinned image. Chain Code Formation finds out chain code vector using 8-directions. Finally Pattern Extraction uses a defined algorithm to form qualitative signal pattern describing increase (+1), decrease (-1), and constant (0) pattern of writing layout. Additionally recognition of the character is carried out through Feed-Forward Neural Network (FFNN). The methodology and presented algorithm are evaluated using 1292 images of segmented characters of 271 different ligature classes of printed script. The methodology is tested on MATLAB and overall recognition rate obtained with Mean Squared Error (MSE) of 0.0014 with 60 hidden neurons of FFNN.
通过信号和模式形成图像特征
光学字符识别(OCR)是人工智能(AI)的重点研究领域之一,而图像文本识别是OCR的难点之一。提出的工作提供了一个字符识别系统草书(例如,阿拉伯语,乌尔都语等)从他们的图像分割字符。提出的方法包括(1)图像采集,(2)预处理,(3)链码形成,(4)信号生成,(5)模式提取。图像采集将字符分割后的图像转换为二值图像。预处理从二值图像中找出未连通的区域,将其分割成一个主区域和一个或多个副区域。副区域构成特征向量,主区域进一步转换为1像素宽的薄化图像。链码形成利用8个方向找出链码向量。最后,Pattern Extraction使用定义好的算法形成定性的信号模式,描述书写布局的增加(+1)、减少(-1)和恒定(0)模式。另外,通过前馈神经网络(FFNN)对字符进行识别。利用271种不同结扎类型的1292张字符分割图像对该方法和算法进行了评价。在MATLAB上对该方法进行了测试,在60个隐藏神经元的情况下,FFNN的总体识别率均方误差(MSE)为0.0014。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信