{"title":"使用HMM Toolkit (HTK)识别离线阿拉伯手写单词","authors":"Hicham El Moubtahij, A. Halli, K. Satori","doi":"10.1109/CGIV.2016.40","DOIUrl":null,"url":null,"abstract":"There are a lot of difficulties facing a good handwritten Arabic recognition system such as the similarities of different character shapes and the unlimited variants in human handwriting. This paper presents a handwriting Arabic word recognition system. The objective of this approach is to propose an analytical offline recognition method of handwritten Arabic for rapid implementation. The first part in the writing recognition system is the preprocessing phase that prepares the data which serves to introduce and extract a set of simple statistical features by a window sliding along that text line from the right to left, then it injects the resulting feature vectors to the Hidden Markov Model Toolkit (HTK). In the recognition phase, the concatenation of characters to form words is modelled by simple lexical models, each word is modelled by a stochastic finite-state automaton (SFSA). The proposed system is applied to an \"Arabic-Numbers\" data corpus, which contains 47 words and 1905 sentences. These sentences are written by five different peoples.","PeriodicalId":351561,"journal":{"name":"2016 13th International Conference on Computer Graphics, Imaging and Visualization (CGiV)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Recognition of Off-Line Arabic Handwriting Words Using HMM Toolkit (HTK)\",\"authors\":\"Hicham El Moubtahij, A. Halli, K. Satori\",\"doi\":\"10.1109/CGIV.2016.40\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"There are a lot of difficulties facing a good handwritten Arabic recognition system such as the similarities of different character shapes and the unlimited variants in human handwriting. This paper presents a handwriting Arabic word recognition system. The objective of this approach is to propose an analytical offline recognition method of handwritten Arabic for rapid implementation. The first part in the writing recognition system is the preprocessing phase that prepares the data which serves to introduce and extract a set of simple statistical features by a window sliding along that text line from the right to left, then it injects the resulting feature vectors to the Hidden Markov Model Toolkit (HTK). In the recognition phase, the concatenation of characters to form words is modelled by simple lexical models, each word is modelled by a stochastic finite-state automaton (SFSA). The proposed system is applied to an \\\"Arabic-Numbers\\\" data corpus, which contains 47 words and 1905 sentences. These sentences are written by five different peoples.\",\"PeriodicalId\":351561,\"journal\":{\"name\":\"2016 13th International Conference on Computer Graphics, Imaging and Visualization (CGiV)\",\"volume\":\"12 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-05-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 13th International Conference on Computer Graphics, Imaging and Visualization (CGiV)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CGIV.2016.40\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 13th International Conference on Computer Graphics, Imaging and Visualization (CGiV)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CGIV.2016.40","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Recognition of Off-Line Arabic Handwriting Words Using HMM Toolkit (HTK)
There are a lot of difficulties facing a good handwritten Arabic recognition system such as the similarities of different character shapes and the unlimited variants in human handwriting. This paper presents a handwriting Arabic word recognition system. The objective of this approach is to propose an analytical offline recognition method of handwritten Arabic for rapid implementation. The first part in the writing recognition system is the preprocessing phase that prepares the data which serves to introduce and extract a set of simple statistical features by a window sliding along that text line from the right to left, then it injects the resulting feature vectors to the Hidden Markov Model Toolkit (HTK). In the recognition phase, the concatenation of characters to form words is modelled by simple lexical models, each word is modelled by a stochastic finite-state automaton (SFSA). The proposed system is applied to an "Arabic-Numbers" data corpus, which contains 47 words and 1905 sentences. These sentences are written by five different peoples.