{"title":"A Hybrid Language Model for Handwritten Chinese Sentence Recognition","authors":"Q. He, Shijie Chen, Mingxi Zhao, Wei Lin","doi":"10.1109/ICFHR.2012.157","DOIUrl":null,"url":null,"abstract":"In this paper, we propose a hybrid language model for handwritten Chinese sentence recognition. This hybrid model is integrated from several independent language models, each of which is trained from a distinct type of corpus and models specifically the linguistic behavior for that type of corpus. By inferring the type of the string which the user has already written, we can make this hybrid language model contribute more precisely to the recognition engine. Our experiments show that the hybrid language model performs consistently well among different types of handwritten articles, and the overall performance is significantly better than a single standard language model. We also propose a candidate re-ranking process after recognition by reducing the language scores to improve the recognition accuracy. The experiment result also demonstrates that this re-ranking process effectively improves the performance of the recognition engine in terms of accuracy.","PeriodicalId":291062,"journal":{"name":"2012 International Conference on Frontiers in Handwriting Recognition","volume":"208 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 International Conference on Frontiers in Handwriting Recognition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICFHR.2012.157","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
In this paper, we propose a hybrid language model for handwritten Chinese sentence recognition. This hybrid model is integrated from several independent language models, each of which is trained from a distinct type of corpus and models specifically the linguistic behavior for that type of corpus. By inferring the type of the string which the user has already written, we can make this hybrid language model contribute more precisely to the recognition engine. Our experiments show that the hybrid language model performs consistently well among different types of handwritten articles, and the overall performance is significantly better than a single standard language model. We also propose a candidate re-ranking process after recognition by reducing the language scores to improve the recognition accuracy. The experiment result also demonstrates that this re-ranking process effectively improves the performance of the recognition engine in terms of accuracy.