{"title":"多字体照本阿拉伯语文本的多阶段识别","authors":"Khaled El Gowely, O. El-Dessouki, A. Nazif","doi":"10.1109/ICPR.1990.118196","DOIUrl":null,"url":null,"abstract":"A new system for the recognition of a multifont photoscript Arabic text is introduced. The distinguishing feature of such text is that it is written cursively. This imposes an additional requirement of isolating each character or set of overlapping characters before recognition. The proposed system is composed of three interleaved phases. The segmentation phase attempts to produce an initial set of characters from the connected text according to a set of predefined rules. The output is then passed to a preliminary classification phase that attempts to label the unknown characters into one of ten possible classes according to a set of rules that acquire their parameter values through learning. The last phase contains a more elaborate set of rules that recognize characters within each class. This recognition phase is designed to allow errors during segmentation and/or classification to be rectified through an adaptive recognition technique. The system has been implemented and tested on several fonts with a recognition rate of 130 words/min and an error rate of less than 6%.<<ETX>>","PeriodicalId":135937,"journal":{"name":"[1990] Proceedings. 10th International Conference on Pattern Recognition","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1990-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"21","resultStr":"{\"title\":\"Multi-phase recognition of multifont photoscript Arabic text\",\"authors\":\"Khaled El Gowely, O. El-Dessouki, A. Nazif\",\"doi\":\"10.1109/ICPR.1990.118196\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A new system for the recognition of a multifont photoscript Arabic text is introduced. The distinguishing feature of such text is that it is written cursively. This imposes an additional requirement of isolating each character or set of overlapping characters before recognition. The proposed system is composed of three interleaved phases. The segmentation phase attempts to produce an initial set of characters from the connected text according to a set of predefined rules. The output is then passed to a preliminary classification phase that attempts to label the unknown characters into one of ten possible classes according to a set of rules that acquire their parameter values through learning. The last phase contains a more elaborate set of rules that recognize characters within each class. This recognition phase is designed to allow errors during segmentation and/or classification to be rectified through an adaptive recognition technique. The system has been implemented and tested on several fonts with a recognition rate of 130 words/min and an error rate of less than 6%.<<ETX>>\",\"PeriodicalId\":135937,\"journal\":{\"name\":\"[1990] Proceedings. 10th International Conference on Pattern Recognition\",\"volume\":\"13 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1990-06-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"21\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"[1990] Proceedings. 10th International Conference on Pattern Recognition\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICPR.1990.118196\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"[1990] Proceedings. 10th International Conference on Pattern Recognition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICPR.1990.118196","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Multi-phase recognition of multifont photoscript Arabic text
A new system for the recognition of a multifont photoscript Arabic text is introduced. The distinguishing feature of such text is that it is written cursively. This imposes an additional requirement of isolating each character or set of overlapping characters before recognition. The proposed system is composed of three interleaved phases. The segmentation phase attempts to produce an initial set of characters from the connected text according to a set of predefined rules. The output is then passed to a preliminary classification phase that attempts to label the unknown characters into one of ten possible classes according to a set of rules that acquire their parameter values through learning. The last phase contains a more elaborate set of rules that recognize characters within each class. This recognition phase is designed to allow errors during segmentation and/or classification to be rectified through an adaptive recognition technique. The system has been implemented and tested on several fonts with a recognition rate of 130 words/min and an error rate of less than 6%.<>