{"title":"利用链码信号一维域的模式匹配压缩二值印刷波斯语和阿拉伯语文本图像","authors":"Esmaeil Shojaei, H. Grailu","doi":"10.1109/ACT.2010.23","DOIUrl":null,"url":null,"abstract":"Pattern Matching is the most conventional method of binary text image compression that has been only used in the 2-D domain of textual image signals. In this paper a pattern matching technique is proposed in the 1-D domain of chain code description signal of printed binary textual Farsi-Arabic images. In printed Farsi-Arabic scripts, contrary to latin scripts, letters usually attach to each other and produce many different patterns. Hence some patterns are fully or partially subsets of others. Detecting such situations and exploiting them to reduce the number of library prototypes has a great effect on the compression efficiency. The Proposed method, contrary to the existing compression methods, has used this property for increasing the compression ratio. For the template matching part of the proposed method, we may use either the cross correlation or a proposed similarity measure which has lower computation time and better results. Experimental results show that the compression performance of the proposed method is as high as 4.5 times that of the conventional one.","PeriodicalId":147311,"journal":{"name":"2010 Second International Conference on Advances in Computing, Control, and Telecommunication Technologies","volume":"96 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Using Pattern Matching in the 1-D Domain of Chain Code Signals for the Compression of Binary Printed Farsi and Arabic Textual Images\",\"authors\":\"Esmaeil Shojaei, H. Grailu\",\"doi\":\"10.1109/ACT.2010.23\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Pattern Matching is the most conventional method of binary text image compression that has been only used in the 2-D domain of textual image signals. In this paper a pattern matching technique is proposed in the 1-D domain of chain code description signal of printed binary textual Farsi-Arabic images. In printed Farsi-Arabic scripts, contrary to latin scripts, letters usually attach to each other and produce many different patterns. Hence some patterns are fully or partially subsets of others. Detecting such situations and exploiting them to reduce the number of library prototypes has a great effect on the compression efficiency. The Proposed method, contrary to the existing compression methods, has used this property for increasing the compression ratio. For the template matching part of the proposed method, we may use either the cross correlation or a proposed similarity measure which has lower computation time and better results. Experimental results show that the compression performance of the proposed method is as high as 4.5 times that of the conventional one.\",\"PeriodicalId\":147311,\"journal\":{\"name\":\"2010 Second International Conference on Advances in Computing, Control, and Telecommunication Technologies\",\"volume\":\"96 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-12-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 Second International Conference on Advances in Computing, Control, and Telecommunication Technologies\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ACT.2010.23\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 Second International Conference on Advances in Computing, Control, and Telecommunication Technologies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACT.2010.23","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Using Pattern Matching in the 1-D Domain of Chain Code Signals for the Compression of Binary Printed Farsi and Arabic Textual Images
Pattern Matching is the most conventional method of binary text image compression that has been only used in the 2-D domain of textual image signals. In this paper a pattern matching technique is proposed in the 1-D domain of chain code description signal of printed binary textual Farsi-Arabic images. In printed Farsi-Arabic scripts, contrary to latin scripts, letters usually attach to each other and produce many different patterns. Hence some patterns are fully or partially subsets of others. Detecting such situations and exploiting them to reduce the number of library prototypes has a great effect on the compression efficiency. The Proposed method, contrary to the existing compression methods, has used this property for increasing the compression ratio. For the template matching part of the proposed method, we may use either the cross correlation or a proposed similarity measure which has lower computation time and better results. Experimental results show that the compression performance of the proposed method is as high as 4.5 times that of the conventional one.