Text segmentation using linear transforms

Conference Record of The Twenty-Ninth Asilomar Conference on Signals, Systems and Computers Pub Date : 1995-10-30 DOI:10.1109/ACSSC.1995.540937

N. Chaddha, A. Gupta

{"title":"Text segmentation using linear transforms","authors":"N. Chaddha, A. Gupta","doi":"10.1109/ACSSC.1995.540937","DOIUrl":null,"url":null,"abstract":"Block-based linear transforms have found widespread use in image and video compression. However popular compression algorithms using such transforms, such as JPEG, which are very effective in compressing continuous tone images, do not perform well on mixed-mode images which have a substantial text component. With a growing number of applications where such images occur, e.g., color facsimile, digital libraries and educational videos, there are advantages in being able to classify each block as being text or continuous tone. With such a classification, different compression parameters or even algorithms may be employed for the two kinds of data to obtain high compression with minimal loss in visual quality. In this paper we propose algorithms for text segmentation based on a variety of linear transforms. We analyze the algorithms based on the accuracy and robustness of segmentation. Our results show that any of the popular linear transforms (DCT, DHT, DFT, WHT, DWT) can be used for accurate and robust text segmentation. An important practical implication of our results is that system designers can now use the same transform for both segmentation and compression, thus obtaining substantial savings in computational cost while improving quality.","PeriodicalId":171264,"journal":{"name":"Conference Record of The Twenty-Ninth Asilomar Conference on Signals, Systems and Computers","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1995-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"14","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Conference Record of The Twenty-Ninth Asilomar Conference on Signals, Systems and Computers","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACSSC.1995.540937","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 14

Abstract

Block-based linear transforms have found widespread use in image and video compression. However popular compression algorithms using such transforms, such as JPEG, which are very effective in compressing continuous tone images, do not perform well on mixed-mode images which have a substantial text component. With a growing number of applications where such images occur, e.g., color facsimile, digital libraries and educational videos, there are advantages in being able to classify each block as being text or continuous tone. With such a classification, different compression parameters or even algorithms may be employed for the two kinds of data to obtain high compression with minimal loss in visual quality. In this paper we propose algorithms for text segmentation based on a variety of linear transforms. We analyze the algorithms based on the accuracy and robustness of segmentation. Our results show that any of the popular linear transforms (DCT, DHT, DFT, WHT, DWT) can be used for accurate and robust text segmentation. An important practical implication of our results is that system designers can now use the same transform for both segmentation and compression, thus obtaining substantial savings in computational cost while improving quality.

查看原文本刊更多论文

使用线性变换的文本分割

基于块的线性变换在图像和视频压缩中得到了广泛的应用。然而，使用这种变换的流行压缩算法，如JPEG，在压缩连续色调图像方面非常有效，但在具有大量文本成分的混合模式图像上表现不佳。随着越来越多的应用中出现这样的图像，例如，彩色传真，数字图书馆和教育视频，能够将每个块分类为文本或连续色调的优势。在这种分类下，可以对两类数据采用不同的压缩参数甚至算法，从而在视觉质量损失最小的情况下获得较高的压缩率。本文提出了基于各种线性变换的文本分割算法。基于分割的准确性和鲁棒性对算法进行了分析。我们的结果表明，任何流行的线性变换(DCT, DHT, DFT, WHT, DWT)都可以用于准确和鲁棒的文本分割。我们的结果的一个重要的实际含义是，系统设计者现在可以对分割和压缩使用相同的变换，从而在提高质量的同时获得大量的计算成本节省。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Conference Record of The Twenty-Ninth Asilomar Conference on Signals, Systems and Computers

自引率

0.00%

发文量