{"title":"Text segmentation using linear transforms","authors":"N. Chaddha, A. Gupta","doi":"10.1109/ACSSC.1995.540937","DOIUrl":null,"url":null,"abstract":"Block-based linear transforms have found widespread use in image and video compression. However popular compression algorithms using such transforms, such as JPEG, which are very effective in compressing continuous tone images, do not perform well on mixed-mode images which have a substantial text component. With a growing number of applications where such images occur, e.g., color facsimile, digital libraries and educational videos, there are advantages in being able to classify each block as being text or continuous tone. With such a classification, different compression parameters or even algorithms may be employed for the two kinds of data to obtain high compression with minimal loss in visual quality. In this paper we propose algorithms for text segmentation based on a variety of linear transforms. We analyze the algorithms based on the accuracy and robustness of segmentation. Our results show that any of the popular linear transforms (DCT, DHT, DFT, WHT, DWT) can be used for accurate and robust text segmentation. An important practical implication of our results is that system designers can now use the same transform for both segmentation and compression, thus obtaining substantial savings in computational cost while improving quality.","PeriodicalId":171264,"journal":{"name":"Conference Record of The Twenty-Ninth Asilomar Conference on Signals, Systems and Computers","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1995-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"14","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Conference Record of The Twenty-Ninth Asilomar Conference on Signals, Systems and Computers","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACSSC.1995.540937","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 14
Abstract
Block-based linear transforms have found widespread use in image and video compression. However popular compression algorithms using such transforms, such as JPEG, which are very effective in compressing continuous tone images, do not perform well on mixed-mode images which have a substantial text component. With a growing number of applications where such images occur, e.g., color facsimile, digital libraries and educational videos, there are advantages in being able to classify each block as being text or continuous tone. With such a classification, different compression parameters or even algorithms may be employed for the two kinds of data to obtain high compression with minimal loss in visual quality. In this paper we propose algorithms for text segmentation based on a variety of linear transforms. We analyze the algorithms based on the accuracy and robustness of segmentation. Our results show that any of the popular linear transforms (DCT, DHT, DFT, WHT, DWT) can be used for accurate and robust text segmentation. An important practical implication of our results is that system designers can now use the same transform for both segmentation and compression, thus obtaining substantial savings in computational cost while improving quality.