{"title":"A new filtering scheme for processing the chromatic signals of color images: definition and properties","authors":"L. Lucchese, S. Mitra","doi":"10.1109/MMSP.2002.1203256","DOIUrl":"https://doi.org/10.1109/MMSP.2002.1203256","url":null,"abstract":"This paper presents a new filtering scheme for processing the chromatic components of color images, expressed by the CIE u' and v' chromaticity coordinates. The new scheme extends to image filtering the center of gravity law of color mixture which describes the mixing of two colors within u'v' chromaticity diagrams. The most interesting property of the proposed filtering framework is the elimination of the annoying hue shifts along edges between bright and dark areas, which would be introduced by the simple linear filtering of the u' and v' signals. Four examples of lowpass filtering reported and discussed in the paper that clearly demonstrate this important feature.","PeriodicalId":398813,"journal":{"name":"2002 IEEE Workshop on Multimedia Signal Processing.","volume":"107 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131591955","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A joint channel estimation and unequal error protection scheme for image transmission in wireless OFDM systems","authors":"Yan Sun, Charles Pandana, Xiaowen Wang, K. Liu","doi":"10.1109/MMSP.2002.1203325","DOIUrl":"https://doi.org/10.1109/MMSP.2002.1203325","url":null,"abstract":"Orthogonal frequency division multiplexing (OFDM) modulation, adopted by the digital video broadcasting (DVB-T) standard, has been recognized for its good performance for high data rate wireless communications. Therefore, the study of the robust transmission of multimedia data over OFDM systems has attracted extensive research interests. In the past, channel estimation, which is an important aspect in OFDM systems, has not been exploited for multimedia transmission. When using the block training based channel estimation, OFDM data blocks experience unequal decoding error rate due to the imprecision of channel estimation. We use this property to provide unequal error protection (UEP) for transmission of SPIHT coded images. Compared with the systems using pilot training channel estimation schemes, which are recommended in the DVB-T standard, the proposed scheme improves the PSNR of reconstructed images by up to 2 dB.","PeriodicalId":398813,"journal":{"name":"2002 IEEE Workshop on Multimedia Signal Processing.","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131963249","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Real-time shot segmentation of unedited video stream for maintenance work","authors":"F. Tsutsumi","doi":"10.1109/MMSP.2002.1203281","DOIUrl":"https://doi.org/10.1109/MMSP.2002.1203281","url":null,"abstract":"We propose a shot segmentation method of live video stream for wearable assistants. This method can automatically divide an unedited video stream into meaningful shots, even if the stream contains shaking, vibration or blurring. The basic idea of the method is dividing video streams into \"stationary\" shots and \"transitive\" shots based on the tendency of visual changes. Two typical conventional methods were compared with our method on the 55 minute maintenance video recorded in a restricted environment. Analyzing the video and computing the recall and precision rate showed the adequacy of out method. Furthermore, the practical effectiveness of the methods was evaluated by another 54 minute patrol video in an outdoor environment. Nine human subjects tried to find randomly selected shots with different segmentation produced by three methods. The results show that our method supported users to find the shots most effectively (89% success in 5 minutes).","PeriodicalId":398813,"journal":{"name":"2002 IEEE Workshop on Multimedia Signal Processing.","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132981633","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Color image watermarking based on a color visual model","authors":"Chun-Hsien Chou, Kuo-Cheng Liu","doi":"10.1109/MMSP.2002.1203322","DOIUrl":"https://doi.org/10.1109/MMSP.2002.1203322","url":null,"abstract":"To locate the right places for embedding watermark signals, and to set the proper strength of the embedded watermark signal is a critical problem for obtaining a robust and transparent watermark in color images. In this paper, a color visual model and the associated watermarking scheme are proposed for solving this problem. The visual model can estimate the profile of error visibility thresholds for each wavelet subband in each color channel of the host image, by which groups of perceptually significant wavelet coefficients are located for watermark embedding and the quantizer step sizes for quantization index modulation can be appropriately determined. Simulation results show that robust watermarks can be attained while retaining high image quality.","PeriodicalId":398813,"journal":{"name":"2002 IEEE Workshop on Multimedia Signal Processing.","volume":"307 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133919513","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Video delivery in networks with fluctuating bandwidth","authors":"B. G. Heath, D. Monro","doi":"10.1109/MMSP.2002.1203341","DOIUrl":"https://doi.org/10.1109/MMSP.2002.1203341","url":null,"abstract":"A 'client pull' mechanism is described by which applications can reliably transmit point-point video over a network with time varying bit rate. The system is particularly useful using Internet delivery over wireless networks, and is suitable for carrying live video. The problem is considered at the application level, using protocols such as TCP which will guarantee data delivery, however slowly. At a fixed image quality, the client requests frames which the server codes on demand. Latency can be overcome by the client using recent network performance to predict the time at which a frame should be requested.","PeriodicalId":398813,"journal":{"name":"2002 IEEE Workshop on Multimedia Signal Processing.","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134019704","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A full-text retrieval approach to content-based audio identification","authors":"Andreas Ribbrock, F. Kurth","doi":"10.1109/MMSP.2002.1203280","DOIUrl":"https://doi.org/10.1109/MMSP.2002.1203280","url":null,"abstract":"We give an overview on a novel framework for content-based multimedia retrieval. In this paper, we present an implementation for audio identification. This framework consists of an index-based search combining algebraic methods with classical full-text retrieval. In the main part of the paper, we propose several feature extractors which may be used for indexing the PCM audio data. We give an overview on our test results containing performance data (e.g. query response times), memory requirements (e.g., index size), and robustness issues. The size of our index turns out to be only a 1/1000th to about 1/15000th of the original PCM material depending on the required granularity for identifying a piece of audio.","PeriodicalId":398813,"journal":{"name":"2002 IEEE Workshop on Multimedia Signal Processing.","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128226335","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Robust watermarking of 3D mesh models","authors":"H. S. Song, N. Cho, Jongweon Kim","doi":"10.1109/MMSP.2002.1203313","DOIUrl":"https://doi.org/10.1109/MMSP.2002.1203313","url":null,"abstract":"A robust watermarking algorithm for the 3D mesh models is proposed. The algorithm is based on the watermarking of images from a virtual 3D scanner, which mimics the operation of 3D scanner in the real world. The position of the object in the scanner is determined by the principle component analysis of the vertex points. After obtaining 2D range image from the virtual scanner, we embed the watermark using the conventional 2D image watermarking method based on the DCT. Then, the vertices of the model are moved according to the range values modified by the 2D watermark. For the watermark extraction, the virtual ranging is performed and then the retrieval process of 2D image watermarking is performed. Experimental results show that the proposed algorithm is robust against the attacks such as mesh simplification and Gaussian noise.","PeriodicalId":398813,"journal":{"name":"2002 IEEE Workshop on Multimedia Signal Processing.","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124141969","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Wipe effect detection for video sequences","authors":"P. Campisi, A. Neri, L. Sorgi","doi":"10.1109/MMSP.2002.1203272","DOIUrl":"https://doi.org/10.1109/MMSP.2002.1203272","url":null,"abstract":"The design of automatic tools to allow content-based analysis, browsing, and retrieval is of paramount importance due to the wide spread of multimedia databases and to the enormous amount of information they contain. In this paper we present an algorithm tailored to the detection of editing effects such as wipes, which are widely used in television and movies production to emphasize scene changes. In our approach a computationally inexpensive although effective algorithm for wipe detection is presented. It is based on trajectory estimation of the boundary line between two successive frame belonging to the wipe region. The experimental results highlight the effectiveness of the proposed method.","PeriodicalId":398813,"journal":{"name":"2002 IEEE Workshop on Multimedia Signal Processing.","volume":"747 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122967132","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Markus Schnell, Michael Küstner, O. Jokisch, R. Hoffmann
{"title":"Text-to-speech for low-resource systems","authors":"Markus Schnell, Michael Küstner, O. Jokisch, R. Hoffmann","doi":"10.1109/MMSP.2002.1203295","DOIUrl":"https://doi.org/10.1109/MMSP.2002.1203295","url":null,"abstract":"This article describes the restrictions and requirements low-resource systems to impose on text-to-speech (TTS) software. The most important point is available memory size, but computing time and implementation issues are discussed as well. For each restriction, one or more solutions are presented. The proferred solutions have been implemented by Infineon Technologies AG and the Technical University of Dresden.","PeriodicalId":398813,"journal":{"name":"2002 IEEE Workshop on Multimedia Signal Processing.","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122134689","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Joint source-channel coding of binary sources with side information at the decoder using IRA codes","authors":"A. Liveris, Zixiang Xiong, C. Georghiades","doi":"10.1109/MMSP.2002.1203246","DOIUrl":"https://doi.org/10.1109/MMSP.2002.1203246","url":null,"abstract":"We use systematic irregular repeat accumulate (IRA) codes as source-channel codes for the transmission of an equiprobable memoryless binary source with side information at the decoder. A special case of this problem is joint source-channel coding for a nonequiprobable memoryless binary source. The theoretical limits of this problem are given by combining the Slepian-Wolf theorem, the source entropy in the special case, with the channel capacity. The approach is based on viewing the correlation between the binary source output and the side information as a separate channel or an enhancement of the original channel. The joint source-channel encoding, decoding and code design procedures are explained in detail. The simulated performance results are better than the recently published solutions using turbo codes and very close to the theoretical limit.","PeriodicalId":398813,"journal":{"name":"2002 IEEE Workshop on Multimedia Signal Processing.","volume":"424 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128110805","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}