{"title":"Image character through signal and pattern formation","authors":"S. Zaman, Kanwal Anwar, Riaz Khan","doi":"10.1109/LT.2016.7562865","DOIUrl":null,"url":null,"abstract":"Optical Character Recognition (OCR) is one of key research areas of Artificial Intelligence (AI), and image text recognition is one of challenging fields of OCR. Presented work offers a character recognition system for cursive script (e.g., Arabic, Urdu, etc.) segmented characters from their images. Presented methodology consists of phases namely (1) Image Acquisition, (2) Preprocessing, (3) Chain Code Formation, (4) Signal Generation, and (5) Pattern Extraction. Image Acquisition takes an image of segmented character and converts it into binary image. Preprocessing finds out unconnected regions from the binary image and separates into one main region and one or more secondary regions. The secondary regions make vector of features whereas main region is further converted to one pixel wide thinned image. Chain Code Formation finds out chain code vector using 8-directions. Finally Pattern Extraction uses a defined algorithm to form qualitative signal pattern describing increase (+1), decrease (-1), and constant (0) pattern of writing layout. Additionally recognition of the character is carried out through Feed-Forward Neural Network (FFNN). The methodology and presented algorithm are evaluated using 1292 images of segmented characters of 271 different ligature classes of printed script. The methodology is tested on MATLAB and overall recognition rate obtained with Mean Squared Error (MSE) of 0.0014 with 60 hidden neurons of FFNN.","PeriodicalId":194801,"journal":{"name":"2016 13th Learning and Technology Conference (L&T)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 13th Learning and Technology Conference (L&T)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/LT.2016.7562865","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Optical Character Recognition (OCR) is one of key research areas of Artificial Intelligence (AI), and image text recognition is one of challenging fields of OCR. Presented work offers a character recognition system for cursive script (e.g., Arabic, Urdu, etc.) segmented characters from their images. Presented methodology consists of phases namely (1) Image Acquisition, (2) Preprocessing, (3) Chain Code Formation, (4) Signal Generation, and (5) Pattern Extraction. Image Acquisition takes an image of segmented character and converts it into binary image. Preprocessing finds out unconnected regions from the binary image and separates into one main region and one or more secondary regions. The secondary regions make vector of features whereas main region is further converted to one pixel wide thinned image. Chain Code Formation finds out chain code vector using 8-directions. Finally Pattern Extraction uses a defined algorithm to form qualitative signal pattern describing increase (+1), decrease (-1), and constant (0) pattern of writing layout. Additionally recognition of the character is carried out through Feed-Forward Neural Network (FFNN). The methodology and presented algorithm are evaluated using 1292 images of segmented characters of 271 different ligature classes of printed script. The methodology is tested on MATLAB and overall recognition rate obtained with Mean Squared Error (MSE) of 0.0014 with 60 hidden neurons of FFNN.