Image character through signal and pattern formation

2016 13th Learning and Technology Conference (L&T) Pub Date : 2016-04-10 DOI:10.1109/LT.2016.7562865

S. Zaman, Kanwal Anwar, Riaz Khan

{"title":"Image character through signal and pattern formation","authors":"S. Zaman, Kanwal Anwar, Riaz Khan","doi":"10.1109/LT.2016.7562865","DOIUrl":null,"url":null,"abstract":"Optical Character Recognition (OCR) is one of key research areas of Artificial Intelligence (AI), and image text recognition is one of challenging fields of OCR. Presented work offers a character recognition system for cursive script (e.g., Arabic, Urdu, etc.) segmented characters from their images. Presented methodology consists of phases namely (1) Image Acquisition, (2) Preprocessing, (3) Chain Code Formation, (4) Signal Generation, and (5) Pattern Extraction. Image Acquisition takes an image of segmented character and converts it into binary image. Preprocessing finds out unconnected regions from the binary image and separates into one main region and one or more secondary regions. The secondary regions make vector of features whereas main region is further converted to one pixel wide thinned image. Chain Code Formation finds out chain code vector using 8-directions. Finally Pattern Extraction uses a defined algorithm to form qualitative signal pattern describing increase (+1), decrease (-1), and constant (0) pattern of writing layout. Additionally recognition of the character is carried out through Feed-Forward Neural Network (FFNN). The methodology and presented algorithm are evaluated using 1292 images of segmented characters of 271 different ligature classes of printed script. The methodology is tested on MATLAB and overall recognition rate obtained with Mean Squared Error (MSE) of 0.0014 with 60 hidden neurons of FFNN.","PeriodicalId":194801,"journal":{"name":"2016 13th Learning and Technology Conference (L&T)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 13th Learning and Technology Conference (L&T)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/LT.2016.7562865","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Optical Character Recognition (OCR) is one of key research areas of Artificial Intelligence (AI), and image text recognition is one of challenging fields of OCR. Presented work offers a character recognition system for cursive script (e.g., Arabic, Urdu, etc.) segmented characters from their images. Presented methodology consists of phases namely (1) Image Acquisition, (2) Preprocessing, (3) Chain Code Formation, (4) Signal Generation, and (5) Pattern Extraction. Image Acquisition takes an image of segmented character and converts it into binary image. Preprocessing finds out unconnected regions from the binary image and separates into one main region and one or more secondary regions. The secondary regions make vector of features whereas main region is further converted to one pixel wide thinned image. Chain Code Formation finds out chain code vector using 8-directions. Finally Pattern Extraction uses a defined algorithm to form qualitative signal pattern describing increase (+1), decrease (-1), and constant (0) pattern of writing layout. Additionally recognition of the character is carried out through Feed-Forward Neural Network (FFNN). The methodology and presented algorithm are evaluated using 1292 images of segmented characters of 271 different ligature classes of printed script. The methodology is tested on MATLAB and overall recognition rate obtained with Mean Squared Error (MSE) of 0.0014 with 60 hidden neurons of FFNN.

查看原文本刊更多论文

通过信号和模式形成图像特征

光学字符识别(OCR)是人工智能(AI)的重点研究领域之一，而图像文本识别是OCR的难点之一。提出的工作提供了一个字符识别系统草书(例如，阿拉伯语，乌尔都语等)从他们的图像分割字符。提出的方法包括(1)图像采集，(2)预处理，(3)链码形成，(4)信号生成，(5)模式提取。图像采集将字符分割后的图像转换为二值图像。预处理从二值图像中找出未连通的区域，将其分割成一个主区域和一个或多个副区域。副区域构成特征向量，主区域进一步转换为1像素宽的薄化图像。链码形成利用8个方向找出链码向量。最后，Pattern Extraction使用定义好的算法形成定性的信号模式，描述书写布局的增加(+1)、减少(-1)和恒定(0)模式。另外，通过前馈神经网络(FFNN)对字符进行识别。利用271种不同结扎类型的1292张字符分割图像对该方法和算法进行了评价。在MATLAB上对该方法进行了测试，在60个隐藏神经元的情况下，FFNN的总体识别率均方误差(MSE)为0.0014。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2016 13th Learning and Technology Conference (L&T)

自引率

0.00%

发文量