{"title":"Handwritten Jawi words recognition using Hidden Markov Models","authors":"Remon Redika, K. Omar, Mohammad Faidzul Nasrudin","doi":"10.1109/ITSIM.2008.4631723","DOIUrl":null,"url":null,"abstract":"Handwritten Jawi recognition is a challenging task because of the cursive nature of the writing. In manuscript writings, words are writer-dependent. The recognition task of Jawi Manuscript still opens problem due to the existence of many difficulties, such as the variability of character shape, overlap and presence of ligature in manuscript words. This paper describes a technique of Jawi word recognition using Hidden Markov Model (HMM). The technique of segmentation-free method used to transform word image into sequences of frames. The geometrical features are extracted using sliding window from each observation frame sequence. Besides, baseline parameters of Jawi word are use in the calculation of black pixel density. Vector Quantization clusters these features and assigns them into symbols that will be used as HMM input. Experiments have been conducted on 579 images of 100 words lexicon of Syair Rakis manuscript, and the recognition rate has reached 84 percent recognition.","PeriodicalId":314159,"journal":{"name":"2008 International Symposium on Information Technology","volume":"438 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 International Symposium on Information Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITSIM.2008.4631723","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8
Abstract
Handwritten Jawi recognition is a challenging task because of the cursive nature of the writing. In manuscript writings, words are writer-dependent. The recognition task of Jawi Manuscript still opens problem due to the existence of many difficulties, such as the variability of character shape, overlap and presence of ligature in manuscript words. This paper describes a technique of Jawi word recognition using Hidden Markov Model (HMM). The technique of segmentation-free method used to transform word image into sequences of frames. The geometrical features are extracted using sliding window from each observation frame sequence. Besides, baseline parameters of Jawi word are use in the calculation of black pixel density. Vector Quantization clusters these features and assigns them into symbols that will be used as HMM input. Experiments have been conducted on 579 images of 100 words lexicon of Syair Rakis manuscript, and the recognition rate has reached 84 percent recognition.