{"title":"阿拉伯语离线文本识别光学建模单元的比较研究","authors":"Mohamed Benzeghiba","doi":"10.1109/ICDAR.2017.170","DOIUrl":null,"url":null,"abstract":"The role of the optical model in a text recognition system is to model the textual information written in image documents. This paper compares the performance of four Arabic optical modeling units in a Multi-Dimensional Long Short-Term Memory based state-of-the-art Arabic text recognition system. These units are: 1) The isolated characters, 2) Extended isolated characters with the different shapes of Lam-Alef (), 3) The character shapes within their contexts and, 4) The recently proposed sub-character units that allow sharing similar patterns in the different character shapes. Experiments are conducted on six tasks using Maurdor and Khatt databases. For a fair comparison, optical models are trained from scratch. The decoding is performed using 1) the predictions of the optical model only and, 2) combined with a 3-gram hybrid word/part-of-Arabic word language model. Results in terms of Word Error Rate show that best results are generally obtained with systems using isolated characters as the basic modeling units, although differences in the performance among different systems are negligible.","PeriodicalId":433676,"journal":{"name":"2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"A Comparative Study on Optical Modeling Units for Off-Line Arabic Text Recognition\",\"authors\":\"Mohamed Benzeghiba\",\"doi\":\"10.1109/ICDAR.2017.170\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The role of the optical model in a text recognition system is to model the textual information written in image documents. This paper compares the performance of four Arabic optical modeling units in a Multi-Dimensional Long Short-Term Memory based state-of-the-art Arabic text recognition system. These units are: 1) The isolated characters, 2) Extended isolated characters with the different shapes of Lam-Alef (), 3) The character shapes within their contexts and, 4) The recently proposed sub-character units that allow sharing similar patterns in the different character shapes. Experiments are conducted on six tasks using Maurdor and Khatt databases. For a fair comparison, optical models are trained from scratch. The decoding is performed using 1) the predictions of the optical model only and, 2) combined with a 3-gram hybrid word/part-of-Arabic word language model. Results in terms of Word Error Rate show that best results are generally obtained with systems using isolated characters as the basic modeling units, although differences in the performance among different systems are negligible.\",\"PeriodicalId\":433676,\"journal\":{\"name\":\"2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR)\",\"volume\":\"34 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICDAR.2017.170\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDAR.2017.170","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Comparative Study on Optical Modeling Units for Off-Line Arabic Text Recognition
The role of the optical model in a text recognition system is to model the textual information written in image documents. This paper compares the performance of four Arabic optical modeling units in a Multi-Dimensional Long Short-Term Memory based state-of-the-art Arabic text recognition system. These units are: 1) The isolated characters, 2) Extended isolated characters with the different shapes of Lam-Alef (), 3) The character shapes within their contexts and, 4) The recently proposed sub-character units that allow sharing similar patterns in the different character shapes. Experiments are conducted on six tasks using Maurdor and Khatt databases. For a fair comparison, optical models are trained from scratch. The decoding is performed using 1) the predictions of the optical model only and, 2) combined with a 3-gram hybrid word/part-of-Arabic word language model. Results in terms of Word Error Rate show that best results are generally obtained with systems using isolated characters as the basic modeling units, although differences in the performance among different systems are negligible.