{"title":"Entropy- and complexity-constrained classified quantizer design for distributed image classification","authors":"H. Xie, Antonio Ortega","doi":"10.1109/MMSP.2002.1203252","DOIUrl":"https://doi.org/10.1109/MMSP.2002.1203252","url":null,"abstract":"In this paper, we address the issue of feature encoding for distributed image classification systems. Such systems often extract a set of features such as color, texture and shape from the raw multimedia data automatically and store them as content descriptors. This content-based metadata supports a wider variety of queries than text-based metadata and thus provides a promising approach for efficient database access and management. When the size of the database becomes large and the number of clients connected to the server increases, the feature data requires a significant amount of storage space and transmission bandwidth. Thus it is useful to devise techniques to compress the features. In this paper, we propose an optimal design of a classified quantizer in a rate-distortion-complexity optimization framework. A decision tree classifier (DTC) is applied to classify the compressed data. We employ the generalized Breiman, Freidman, Olshen, and Stone (G-BFOS) algorithm to design the optimal pre-classifier, which is a pruned sub-tree of the decision tree, and to perform the optimal bit allocation among classes. The optimization is carried out based not only on a rate budget, but also on a coding complexity constraint. We illustrate this framework by showing a texture classification example. Our results show that by using a classified quantizer to encode the features, we are able to improve the percentage of correct classification also leads to a reduction of the number of images transmitted between server and client.","PeriodicalId":398813,"journal":{"name":"2002 IEEE Workshop on Multimedia Signal Processing.","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121392156","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Sindhu Emilda, L. Jacob, O. Daescu, B. Prabhakaran
{"title":"Flexible disk scheduling for multimedia presentation servers","authors":"Sindhu Emilda, L. Jacob, O. Daescu, B. Prabhakaran","doi":"10.1109/MMSP.2002.1203343","DOIUrl":"https://doi.org/10.1109/MMSP.2002.1203343","url":null,"abstract":"Objects composing a multimedia presentation need to be delivered based on the temporal relationships specified by the author or authors. Depending on the composition of multimedia presentations, disk access for retrieval may not be needed in every round. A presentation can skip a certain number of disk scheduling rounds before retrieving the next required chunk of data. In this paper, we present a min-max skip round disk access algorithm for the retrieval of multimedia presentations in a flexible manner depending on the presentation composition, client/server buffer, and server-disk/network bandwidths. The proposed min-max skip round disk scheduling algorithm can be used for push, pull, as well as hybrid approaches for delivering multimedia presentations. We consider storage of presentations on an array of disks as well as on multi-zone recording disks. A few simulation results are provided.","PeriodicalId":398813,"journal":{"name":"2002 IEEE Workshop on Multimedia Signal Processing.","volume":"83 10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132520527","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Two novel schemes for opportunistic multi-access","authors":"Ming-Xin Hu, Junshan Zhang","doi":"10.1109/MMSP.2002.1203333","DOIUrl":"https://doi.org/10.1109/MMSP.2002.1203333","url":null,"abstract":"We study opportunistic multiuser communications, and propose two novel schemes to address scheduling in asymmetric channels and admission control in such systems, respectively. We first device a relay-aided multiuser diversity (RAMD) scheme, in which a user can choose to communicate with the base station either directly or using relay transmission. We show that the RAMD scheme performs significantly better than Qualcomm's HDR scheme. Next, we propose a traffic-aided \"smooth\" admission control (SAC) scheme that guarantees the quality-of-service (QoS) provisioning. Specifically, building on the weighted proportional fair (WPF) scheduling, we devise a QoS driven weight adaptation algorithm. The weights assigned to new users are increased in a guarded manner. An admission decision is made based on the measured throughput within a time-out window. Our results show that the SAC scheme works well in opportunistic communication systems.","PeriodicalId":398813,"journal":{"name":"2002 IEEE Workshop on Multimedia Signal Processing.","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132524013","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Watermark embedding in DC components of DCT for binary images","authors":"Haiping Lu, Xuxia Shi, Y. Shi, A. Kot, Lihui Chen","doi":"10.1109/MMSP.2002.1203305","DOIUrl":"https://doi.org/10.1109/MMSP.2002.1203305","url":null,"abstract":"This paper investigates the feasibility of watermark embedding in the discrete cosine transform (DCT) domain for binary images. Watermark embedding is known to be difficult for binary images due to their binary nature. For frequency domain approach to binary image watermarking, a post-embedding binarization is a necessary step to ensure that the watermarked image is still a binary image. This step disturbs the watermark embedded and is likely to remove the watermark. We have succeeded in combating this interference by embedding watermarks in the DC components of DCT and employing a biased binarization threshold. This algorithm can be applied to binary images in general and experiments show that the embedding algorithm proposed can not only survive binarization, but also provide some degree of robustness against common image processing.","PeriodicalId":398813,"journal":{"name":"2002 IEEE Workshop on Multimedia Signal Processing.","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131393987","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Vector quantization-based automatic shot boundary detection","authors":"J. Gomes, M. Carli, S. Mitra, A. Neri","doi":"10.1109/MMSP.2002.1203285","DOIUrl":"https://doi.org/10.1109/MMSP.2002.1203285","url":null,"abstract":"A compact video description is required to efficiently browse a multimedia database. A possible solution is the creation of a summary by using a set of frames (key frames) representing the content of each part of the video. To this end, a novel shot boundary detection method, based on a vector quantization approach, is presented. Experimental results show the effectiveness, in terms of computational cost and performance, of the proposed system.","PeriodicalId":398813,"journal":{"name":"2002 IEEE Workshop on Multimedia Signal Processing.","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126649689","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Fast DCT-domain filtering using frequency division processing","authors":"Dongge Li, I. Sethi, N. Dimitrova","doi":"10.1109/MMSP.2002.1203244","DOIUrl":"https://doi.org/10.1109/MMSP.2002.1203244","url":null,"abstract":"This paper presents a new scheme for the development of efficient DCT-domain algorithms based on frequency division processing (FDP). In this scheme, a manipulation in the DCT domain is decomposed into multiple simple operations, each of which is performed independently on certain sets of DCT coefficients. Using FDP we designed a fast DCT-domain algorithm for two-dimensional convolution. The algorithm, according to both theoretical analysis of computational complexity and experimental results, is more efficient than traditional spatial-domain convolution algorithms as well as other compressed-domain convolution algorithms. The performance is several times higher for kernel size beyond 5 and comparable for smaller kernel size. In addition, convolution performed directly in the compressed domain can achieve a slightly higher precision due to the removal of decompression artifacts.","PeriodicalId":398813,"journal":{"name":"2002 IEEE Workshop on Multimedia Signal Processing.","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126669480","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A review of algorithms for audio fingerprinting","authors":"P. Cano, E. Batlle, T. Kalker, J. Haitsma","doi":"10.1109/MMSP.2002.1203274","DOIUrl":"https://doi.org/10.1109/MMSP.2002.1203274","url":null,"abstract":"An audio fingerprint is a content-based compact signature that summarizes an audio recording. Audio fingerprinting technologies have recently attracted attention since they allow the monitoring of audio independently of its format and without the need of meta-data or watermark embedding. The different approaches to fingerprinting are usually described with different rationales and terminology depending on the background: pattern matching, multimedia (music) information retrieval or cryptography (robust hashing). In this paper, we review different techniques mapping functional parts to blocks of a unified framework.","PeriodicalId":398813,"journal":{"name":"2002 IEEE Workshop on Multimedia Signal Processing.","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121694065","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Semi-regular mesh representation of 3D dynamic objects based on correspondence matching","authors":"Jeong-Hyu Yang, Chang-Su Kim, Sang Uk Lee","doi":"10.1109/MMSP.2002.1203238","DOIUrl":"https://doi.org/10.1109/MMSP.2002.1203238","url":null,"abstract":"In this paper, we propose a novel method to represent 3D dynamic objects, which are captured with a laser scanner in successive time instances. The objective is to convert an input mesh sequence into a semi-regular mesh sequence with time-invariant topology information, since it enables the easier manipulation of 3D data using many signal processing techniques. We achieve this objective by performing global motion estimation, local deformation estimation, and correspondence optimization subsequently. Simulation results show that the proposed algorithm reconstructs the original geometry faithfully. Furthermore, the resulting semi-regular mesh sequences can be effectively compressed by adopting wavelet coding schemes.","PeriodicalId":398813,"journal":{"name":"2002 IEEE Workshop on Multimedia Signal Processing.","volume":"225 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123034623","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Object based video watermarking scheme using inertia ellipse and shape adaptive DCT","authors":"Guo Jie, Shi Peng-fei, Fang Tao","doi":"10.1109/MMSP.2002.1203307","DOIUrl":"https://doi.org/10.1109/MMSP.2002.1203307","url":null,"abstract":"Video object (VO) is an important concept in MPEG-4. For objects can be easily manipulated without visible distortion, the copyright protection of video objects becomes an important issue. This paper presents a watermarking scheme for video objects. Different from other methods, the proposed scheme employed inertia ellipse to achieve fast synchronization recovery in case the object was manipulated. Shape adaptive DCT and visual mask were combined to embed the watermark into the arbitrarily shaped object, which was designed to achieve the trade-off between the invisibility and the robustness. Experiments show that our algorithm is robust to object manipulations such as rotations, translations, scaling and lossy compression. Our scheme can be easily incorporated into the object-based coding framework of MPEG-4.","PeriodicalId":398813,"journal":{"name":"2002 IEEE Workshop on Multimedia Signal Processing.","volume":"61 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123657125","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"On the performance of multiplicative spread spectrum watermarking","authors":"M. Barni, F. Bartolini, A. D. Rosa","doi":"10.1109/MMSP.2002.1203311","DOIUrl":"https://doi.org/10.1109/MMSP.2002.1203311","url":null,"abstract":"The performance of multiplicative spread spectrum watermarking algorithms are investigated under the simplifying assumption that host features are normally distributed, and attacks take the form of normally distributed additive noise. By relying on the obtained results, multiplicative spread spectrum watermarking is compared to its additive counterpart and to quantization based schemes.","PeriodicalId":398813,"journal":{"name":"2002 IEEE Workshop on Multimedia Signal Processing.","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122799158","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}