{"title":"孤立阿拉伯字符的快速识别系统","authors":"J. Cowell, F. Hussain","doi":"10.1109/IV.2002.1028844","DOIUrl":null,"url":null,"abstract":"This paper presents a very fast multi-stage algorithm for the recognition of non-Latin script. Although the examples use Arabic script, the system could be adapted in minutes to deal with any character set, in particular non-Latin characters where no commercial OCR systems are available. The approach used normalises isolated characters for size and extracts an image signature based on the number of black pixels in the rows and columns of the character and compares these values to a set of signatures for typical characters of the set. This technique identifies not only the closet match but gives the closeness of match to all other characters in the set, which is expressed in a triangular confusion matrix.","PeriodicalId":308951,"journal":{"name":"Proceedings Sixth International Conference on Information Visualisation","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"37","resultStr":"{\"title\":\"A fast recognition system for isolated arabic characters\",\"authors\":\"J. Cowell, F. Hussain\",\"doi\":\"10.1109/IV.2002.1028844\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents a very fast multi-stage algorithm for the recognition of non-Latin script. Although the examples use Arabic script, the system could be adapted in minutes to deal with any character set, in particular non-Latin characters where no commercial OCR systems are available. The approach used normalises isolated characters for size and extracts an image signature based on the number of black pixels in the rows and columns of the character and compares these values to a set of signatures for typical characters of the set. This technique identifies not only the closet match but gives the closeness of match to all other characters in the set, which is expressed in a triangular confusion matrix.\",\"PeriodicalId\":308951,\"journal\":{\"name\":\"Proceedings Sixth International Conference on Information Visualisation\",\"volume\":\"6 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2002-11-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"37\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings Sixth International Conference on Information Visualisation\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IV.2002.1028844\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings Sixth International Conference on Information Visualisation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IV.2002.1028844","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A fast recognition system for isolated arabic characters
This paper presents a very fast multi-stage algorithm for the recognition of non-Latin script. Although the examples use Arabic script, the system could be adapted in minutes to deal with any character set, in particular non-Latin characters where no commercial OCR systems are available. The approach used normalises isolated characters for size and extracts an image signature based on the number of black pixels in the rows and columns of the character and compares these values to a set of signatures for typical characters of the set. This technique identifies not only the closet match but gives the closeness of match to all other characters in the set, which is expressed in a triangular confusion matrix.