Proceedings. IEEE International Conference on Multimedia and Expo最新文献_第6页

Automatic feedback for content based image retrieval on the Web Web上基于内容的图像检索的自动反馈

Proceedings. IEEE International Conference on Multimedia and Expo Pub Date : 2002-11-07 DOI: 10.1109/ICME.2002.1035758

Y. Aslandogan, Clement T. Yu

引用次数: 8

Image retrieval based on multi-scale edge model 基于多尺度边缘模型的图像检索

Proceedings. IEEE International Conference on Multimedia and Expo Pub Date : 2002-11-07 DOI: 10.1109/ICME.2002.1035627

P. Bao, Xianjun Zhang

引用次数: 8

Bit-plane error recovery via cross subband for image transmission in JPEG2000 JPEG2000图像传输中基于交叉子带的位面误差恢复

Proceedings. IEEE International Conference on Multimedia and Expo Pub Date : 2002-11-07 DOI: 10.1109/ICME.2002.1035740

Pei-Jun Lee, Liang-Gee Chen

{"title":"Bit-plane error recovery via cross subband for image transmission in JPEG2000","authors":"Pei-Jun Lee, Liang-Gee Chen","doi":"10.1109/ICME.2002.1035740","DOIUrl":"https://doi.org/10.1109/ICME.2002.1035740","url":null,"abstract":"For multimedia transmission over noisy channels, the error robustness of JPEG2000 evidently outperforms that of JPEG. Since JPEG2000 is based on the discrete wavelet transform (DWT), traditional error concealment algorithms for still images in the discrete cosine transform (DCT) domain are not suitable for JPEG2000. In JPEG2000, decoding is processed bitplane by bitplane. Any data loss occurring in the bitstream will affect the consequent bitplanes and their wavelet coefficients. To solve this problem, the JPEG2000 VM7.2 program replaces the missing wavelet coefficients by zeros. However, the replacement may affect lots of significant nonzero coefficients such that some high frequency components are lost. In this paper, we present a novel error concealment algorithm for image transmission in the bitplane base. The proposed algorithm recovers the damaged bitplane data according to the cross subband and undamaged bitplane information. The recovered wavelet coefficients are similar with error-free data. The objective results show that the proposed algorithm has 3/spl sim/8dB improvement than those without the error resilient mechanism. From a subjective viewpoint, the proposed algorithm can achieve much smoother edges on the reconstructed image using our concealment algorithm.","PeriodicalId":90694,"journal":{"name":"Proceedings. IEEE International Conference on Multimedia and Expo","volume":"40 1","pages":"149-152 vol.1"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82292419","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 17

Self-optimized spectral correlation method for background music identification 背景音乐识别的自优化谱相关方法

Proceedings. IEEE International Conference on Multimedia and Expo Pub Date : 2002-11-07 DOI: 10.1109/ICME.2002.1035786

M. Abe, M. Nishiguchi

引用次数: 8

Optimized video streaming for networks with varying delay 针对不同延迟的网络优化了视频流

Proceedings. IEEE International Conference on Multimedia and Expo Pub Date : 2002-11-07 DOI: 10.1109/ICME.2002.1035431

S. Wee, Wai-tian Tan, J. Apostolopoulos, M. Etoh

{"title":"Optimized video streaming for networks with varying delay","authors":"S. Wee, Wai-tian Tan, J. Apostolopoulos, M. Etoh","doi":"10.1109/ICME.2002.1035431","DOIUrl":"https://doi.org/10.1109/ICME.2002.1035431","url":null,"abstract":"This paper presents a method for distortion-optimized streaming of predictively coded video over packet networks with varying delay. In networks with significant delay variations, coded video frames can arrive late at the decoder and miss their respective display deadlines. Furthermore, due to predictive coding, a late frame can also prevent a number of subsequent frames from being displayed properly, where the number of affected frames or degree of distortion depends on the particular coding dependencies of the late frame. In this paper, we present an optimized video streaming strategy based on frame reordering for networks with significant delay variations. This streaming strategy minimizes distortion by exploiting the fact that different late frames result in different degrees of distortion. We model the router-induced delay in a wired network with an analytical PDF and we model the link-layer retransmission delay of a wireless network with the 3GPP specification for W-CDMA radio link control. We compute the distortion for different frame reorderings using the network delay models and a source model that accounts for the prediction dependencies of predictively coded video. Our optimized streaming strategies are shown to reduce the number of late frames by 14 to 23% for the situations examined.","PeriodicalId":90694,"journal":{"name":"Proceedings. IEEE International Conference on Multimedia and Expo","volume":"83 1","pages":"89-92 vol.2"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80650185","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 37

Interactive room acoustic rendering in real time 实时互动式房间声学渲染

Proceedings. IEEE International Conference on Multimedia and Expo Pub Date : 2002-11-07 DOI: 10.1109/ICME.2002.1035827

L. Savioja, T. Lokki, J. Huopaniemi

引用次数: 1

On model-based clustering of video scenes using scenelets 基于模型的视频场景聚类研究

Proceedings. IEEE International Conference on Multimedia and Expo Pub Date : 2002-11-07 DOI: 10.1109/ICME.2002.1035778

Hong Lu, Yap-Peng Tan

引用次数: 5

Universal MPEG content access using compressed-domain system stream editing techniques 使用压缩域系统流编辑技术的通用MPEG内容访问

Proceedings. IEEE International Conference on Multimedia and Expo Pub Date : 2002-11-07 DOI: 10.1109/ICME.2002.1035419

Ching-Yung Lin, Belle L. Tseng, John R. Smith

{"title":"Universal MPEG content access using compressed-domain system stream editing techniques","authors":"Ching-Yung Lin, Belle L. Tseng, John R. Smith","doi":"10.1109/ICME.2002.1035419","DOIUrl":"https://doi.org/10.1109/ICME.2002.1035419","url":null,"abstract":"An MPEG system layer compressed-domain editing technique is proposed to facilitate the delivery and integration of multiple segments of MPEG files, residing on remote databases. Various multimedia applications, including retrieval and summarization, split MPEG files into small segments along shot boundaries and store them separately. This traditional method requires extra management and storage payload, provides only fixed segmentations, and may not be play smoothly. In order to solve this problem, our MPEG system-domain editing tool directly extracts video-audio information from the original MPEG sources and combines them to generate a single MPEG file. Manipulated wholly in the system bitstream domain, this method does not require decoding, re-encoding, and re-synchronization of audio and video data. Thus, it operates in real-time and provides great flexibility. This composite MPEG file can be transmitted and displayed through general Web interfaces. The proposed method is applied to our video retrieval, video summarization, and video editing systems, and has shown its great advantages.","PeriodicalId":90694,"journal":{"name":"Proceedings. IEEE International Conference on Multimedia and Expo","volume":"1 1","pages":"73-76 vol.2"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82918977","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 13

A cost-effective solution for eye-gaze assistive technology 一种具有成本效益的眼球注视辅助技术解决方案

Proceedings. IEEE International Conference on Multimedia and Expo Pub Date : 2002-11-07 DOI: 10.1109/ICME.2002.1035632

Fulvio Corno, L. Farinetti, I. Signorile

引用次数: 53

Retrieval of articulate objects from images and video using invariant signatures 使用不变签名从图像和视频中检索清晰的对象

Proceedings. IEEE International Conference on Multimedia and Expo Pub Date : 2002-11-07 DOI: 10.1109/ICME.2002.1035757

Ronald-Bryan O. Alferez, Yuan-fang Wang

{"title":"Retrieval of articulate objects from images and video using invariant signatures","authors":"Ronald-Bryan O. Alferez, Yuan-fang Wang","doi":"10.1109/ICME.2002.1035757","DOIUrl":"https://doi.org/10.1109/ICME.2002.1035757","url":null,"abstract":"We propose a new method of retrieving multi-part, articulate objects from images and video. The scheme is particularly well suited for analyzing images and video for objects that can pose differently with possible shape deformation and articulated motion. The scheme involves computing an invariant signature for each segmented region in the image, in a manner that is insensitive to translation, rotation, scale, and shear. Using circular cross-correlation, these signatures can then be efficiently compared with that of user-defined regions of interest. Ambiguities between individual region matches are then resolved through relaxation labeling techniques. A final match is established when a collection of segmented regions conform to the query object, both in terms of local shape description and global structural relation. The scheme thus allows for articulated movement of object parts within the scene. The procedure is easy to implement, yet shows promising results in its ability to isolate interesting regions in images and video, to account for structural and relational constraints among regions, and to integrate both local shape and global structural information for a detailed examination of the scene in a way that is invariant to many visual variations.","PeriodicalId":90694,"journal":{"name":"Proceedings. IEEE International Conference on Multimedia and Expo","volume":"6 1","pages":"217-220 vol.1"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89260330","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0