{"title":"Entropy- and complexity-constrained classified quantizer design for distributed image classification","authors":"H. Xie, Antonio Ortega","doi":"10.1109/MMSP.2002.1203252","DOIUrl":null,"url":null,"abstract":"In this paper, we address the issue of feature encoding for distributed image classification systems. Such systems often extract a set of features such as color, texture and shape from the raw multimedia data automatically and store them as content descriptors. This content-based metadata supports a wider variety of queries than text-based metadata and thus provides a promising approach for efficient database access and management. When the size of the database becomes large and the number of clients connected to the server increases, the feature data requires a significant amount of storage space and transmission bandwidth. Thus it is useful to devise techniques to compress the features. In this paper, we propose an optimal design of a classified quantizer in a rate-distortion-complexity optimization framework. A decision tree classifier (DTC) is applied to classify the compressed data. We employ the generalized Breiman, Freidman, Olshen, and Stone (G-BFOS) algorithm to design the optimal pre-classifier, which is a pruned sub-tree of the decision tree, and to perform the optimal bit allocation among classes. The optimization is carried out based not only on a rate budget, but also on a coding complexity constraint. We illustrate this framework by showing a texture classification example. Our results show that by using a classified quantizer to encode the features, we are able to improve the percentage of correct classification also leads to a reduction of the number of images transmitted between server and client.","PeriodicalId":398813,"journal":{"name":"2002 IEEE Workshop on Multimedia Signal Processing.","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2002-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"15","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2002 IEEE Workshop on Multimedia Signal Processing.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MMSP.2002.1203252","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 15
Abstract
In this paper, we address the issue of feature encoding for distributed image classification systems. Such systems often extract a set of features such as color, texture and shape from the raw multimedia data automatically and store them as content descriptors. This content-based metadata supports a wider variety of queries than text-based metadata and thus provides a promising approach for efficient database access and management. When the size of the database becomes large and the number of clients connected to the server increases, the feature data requires a significant amount of storage space and transmission bandwidth. Thus it is useful to devise techniques to compress the features. In this paper, we propose an optimal design of a classified quantizer in a rate-distortion-complexity optimization framework. A decision tree classifier (DTC) is applied to classify the compressed data. We employ the generalized Breiman, Freidman, Olshen, and Stone (G-BFOS) algorithm to design the optimal pre-classifier, which is a pruned sub-tree of the decision tree, and to perform the optimal bit allocation among classes. The optimization is carried out based not only on a rate budget, but also on a coding complexity constraint. We illustrate this framework by showing a texture classification example. Our results show that by using a classified quantizer to encode the features, we are able to improve the percentage of correct classification also leads to a reduction of the number of images transmitted between server and client.
本文主要研究分布式图像分类系统的特征编码问题。这种系统通常从原始多媒体数据中自动提取一组特征,如颜色、纹理和形状,并将其存储为内容描述符。这种基于内容的元数据比基于文本的元数据支持更广泛的查询,因此为有效的数据库访问和管理提供了一种很有前途的方法。当数据库规模变大,连接到服务器的客户端数量增加时,特征数据需要大量的存储空间和传输带宽。因此,设计压缩特征的技术是有用的。在本文中,我们提出了一个在率-失真-复杂度优化框架下的分类量化器的优化设计。采用决策树分类器(DTC)对压缩数据进行分类。我们采用广义Breiman, Freidman, Olshen, and Stone (G-BFOS)算法来设计最优预分类器,该预分类器是决策树的修剪子树,并在类之间进行最优位分配。优化不仅基于速率预算,而且基于编码复杂度约束。我们通过一个纹理分类的例子来说明这个框架。我们的结果表明,通过使用分类量化器对特征进行编码,我们能够提高正确分类的百分比,并减少服务器和客户端之间传输的图像数量。