{"title":"用于中文手写识别的深度LSTM网络","authors":"Li Sun, Tonghua Su, Ce Liu, Ruigang Wang","doi":"10.1109/ICFHR.2016.0059","DOIUrl":null,"url":null,"abstract":"Currently two heavy burdens are borne in online Chinese handwriting recognition: a large-scale training data needs to be annotated with the boundaries of each character and effective features should be handcrafted by domain experts. To relieve such issues, the paper presents a novel end-to-end recognition method based on recurrent neural networks. A mixture architecture of deep bidirectional Long Short-Term Memory (LSTM) layers and feed forward subsampling layers is used to encode the long contextual history trajectories. The Connectionist Temporal Classification (CTC) objective function makes it possible to train the model without providing alignment information between input trajectories and output strings. During decoding, a modified CTC beam search algorithm is devised to integrate the linguistic constraints wisely. Our method is evaluated both on test set and competition set of CASIA-OLHWDB 2. x. Comparing with state-of-the-art methods, over 30% relative error reductions are observed on test set in terms of both correct rate and accurate rate. Even to the more challenging competition set, better results can be achieved by our method if the out-of-vocabulary problem can be ignored.","PeriodicalId":194844,"journal":{"name":"2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"29","resultStr":"{\"title\":\"Deep LSTM Networks for Online Chinese Handwriting Recognition\",\"authors\":\"Li Sun, Tonghua Su, Ce Liu, Ruigang Wang\",\"doi\":\"10.1109/ICFHR.2016.0059\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Currently two heavy burdens are borne in online Chinese handwriting recognition: a large-scale training data needs to be annotated with the boundaries of each character and effective features should be handcrafted by domain experts. To relieve such issues, the paper presents a novel end-to-end recognition method based on recurrent neural networks. A mixture architecture of deep bidirectional Long Short-Term Memory (LSTM) layers and feed forward subsampling layers is used to encode the long contextual history trajectories. The Connectionist Temporal Classification (CTC) objective function makes it possible to train the model without providing alignment information between input trajectories and output strings. During decoding, a modified CTC beam search algorithm is devised to integrate the linguistic constraints wisely. Our method is evaluated both on test set and competition set of CASIA-OLHWDB 2. x. Comparing with state-of-the-art methods, over 30% relative error reductions are observed on test set in terms of both correct rate and accurate rate. Even to the more challenging competition set, better results can be achieved by our method if the out-of-vocabulary problem can be ignored.\",\"PeriodicalId\":194844,\"journal\":{\"name\":\"2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR)\",\"volume\":\"24 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"29\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICFHR.2016.0059\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICFHR.2016.0059","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Deep LSTM Networks for Online Chinese Handwriting Recognition
Currently two heavy burdens are borne in online Chinese handwriting recognition: a large-scale training data needs to be annotated with the boundaries of each character and effective features should be handcrafted by domain experts. To relieve such issues, the paper presents a novel end-to-end recognition method based on recurrent neural networks. A mixture architecture of deep bidirectional Long Short-Term Memory (LSTM) layers and feed forward subsampling layers is used to encode the long contextual history trajectories. The Connectionist Temporal Classification (CTC) objective function makes it possible to train the model without providing alignment information between input trajectories and output strings. During decoding, a modified CTC beam search algorithm is devised to integrate the linguistic constraints wisely. Our method is evaluated both on test set and competition set of CASIA-OLHWDB 2. x. Comparing with state-of-the-art methods, over 30% relative error reductions are observed on test set in terms of both correct rate and accurate rate. Even to the more challenging competition set, better results can be achieved by our method if the out-of-vocabulary problem can be ignored.