2005 IEEE 7th Workshop on Multimedia Signal Processing最新文献

筛选
英文 中文
A New Update Step for Reduction of PSNR Fluctuations in Motion-Compensated Lifted Wavelet Video Coding 运动补偿提升小波视频编码中减小PSNR波动的新方法
2005 IEEE 7th Workshop on Multimedia Signal Processing Pub Date : 2005-10-01 DOI: 10.1109/MMSP.2005.248667
Aditya Mavlankar, Sangeun Han, Chuo-Ling Chang, B. Girod
{"title":"A New Update Step for Reduction of PSNR Fluctuations in Motion-Compensated Lifted Wavelet Video Coding","authors":"Aditya Mavlankar, Sangeun Han, Chuo-Ling Chang, B. Girod","doi":"10.1109/MMSP.2005.248667","DOIUrl":"https://doi.org/10.1109/MMSP.2005.248667","url":null,"abstract":"In wavelet video coding, due to motion-compensation entwined in the temporal wavelet transform, the distortion in the temporal subbands propagates in an uneven manner into the reconstructed frames of a group of pictures. This leads to fluctuation of reconstruction quality in time, which can be visually displeasing. We propose a new update step which is derived by taking into account a Lagrangian term for even distribution of distortion among reconstructed frames in addition to minimizing total distortion. Additionally some heuristics for the implementation of this new update step are proposed. Experimental results show reduction of quality fluctuation compared to the conventional update step","PeriodicalId":191719,"journal":{"name":"2005 IEEE 7th Workshop on Multimedia Signal Processing","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131892299","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Comparison Between Multiple Description and Single Description Video Coding With Forward Error Correction 前向纠错下多描述与单描述视频编码的比较
2005 IEEE 7th Workshop on Multimedia Signal Processing Pub Date : 2005-10-01 DOI: 10.1109/MMSP.2005.248546
R. Bernardini, M. Durigon, R. Rinaldo, A. Vitali
{"title":"Comparison Between Multiple Description and Single Description Video Coding With Forward Error Correction","authors":"R. Bernardini, M. Durigon, R. Rinaldo, A. Vitali","doi":"10.1109/MMSP.2005.248546","DOIUrl":"https://doi.org/10.1109/MMSP.2005.248546","url":null,"abstract":"Video streaming over packet switched best-effort networks is a challenging topic, due to low latency, scalability and fault tolerance requirements. Many techniques can be used to deal with delay, loss and the time-varying nature of best-effort networks. In this paper we compare two techniques to improve the performance of video streaming, i.e., a multiple description (MD) scheme based on spatial polyphase downsampling, and a single description (SD) scheme where robustness to packet loss is increased using forward error correcting (FEC) codes. We consider both a single channel scenario and a multiple channel (or multi-path) scenario. We span a large set of channel conditions, to consider the high packet loss probabilities common in wireless communication systems. A H.264/AVC video coding standard with advanced error concealment capabilities is used. Experimental results show that MD can be competitive in practical scenarios with more flexibility and less complexity than the SD+FEC scheme","PeriodicalId":191719,"journal":{"name":"2005 IEEE 7th Workshop on Multimedia Signal Processing","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133557021","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Multiresolution Modeling of 3D Maps 三维地图的多分辨率建模
2005 IEEE 7th Workshop on Multimedia Signal Processing Pub Date : 2005-10-01 DOI: 10.1109/MMSP.2005.248577
I. R. Khan, M. Okuda
{"title":"Multiresolution Modeling of 3D Maps","authors":"I. R. Khan, M. Okuda","doi":"10.1109/MMSP.2005.248577","DOIUrl":"https://doi.org/10.1109/MMSP.2005.248577","url":null,"abstract":"Three-dimensional (3D) urban models, come with huge data size and their simplification is often needed for efficient streaming, rendering and visualization. In this paper, we present a strategy for organization of this data such that the models of reduced resolution can be readily created. Simplification techniques for different types of objects in the urban models are presented and are used to represent them at different levels of detail. Objects are classified based on their importance relative to other objects in the model, and based on this classification, a suitable level of details of each object is determined for inclusion in the model of reduced resolution","PeriodicalId":191719,"journal":{"name":"2005 IEEE 7th Workshop on Multimedia Signal Processing","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134040253","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A Content-Based Video Coding Method for Remote Monitoring of Neurosurgery 一种基于内容的神经外科远程监控视频编码方法
2005 IEEE 7th Workshop on Multimedia Signal Processing Pub Date : 2005-10-01 DOI: 10.1109/MMSP.2005.248548
Jian Xu, R. Sclabassi, Qiang Liu, L. Chaparro, Mingui Sun
{"title":"A Content-Based Video Coding Method for Remote Monitoring of Neurosurgery","authors":"Jian Xu, R. Sclabassi, Qiang Liu, L. Chaparro, Mingui Sun","doi":"10.1109/MMSP.2005.248548","DOIUrl":"https://doi.org/10.1109/MMSP.2005.248548","url":null,"abstract":"Transmitting high-quality neurophysiology video via the Internet is a challenging problem in telemedicine. We propose a novel video data processing and compression method based on the discrete wavelet transform (DWT). The DWT is utilized to decompose video frames into subframes, different bandwidth budgets are assigned to important and less important regions in each subframe, and several video encoders are employed in parallel to accelerate computation. When compared with the general-purpose video compression methods, our method allows scalability on real-time network bandwidth allocation, and offers higher video quality within the critical field of neurosurgery","PeriodicalId":191719,"journal":{"name":"2005 IEEE 7th Workshop on Multimedia Signal Processing","volume":"83 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132494291","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Language Identification using Warping and the Shifted Delta Cepstrum 使用扭曲和移位倒谱的语言识别
2005 IEEE 7th Workshop on Multimedia Signal Processing Pub Date : 2005-10-01 DOI: 10.1109/MMSP.2005.248554
Felicity Allen, E. Ambikairajah, J. Epps
{"title":"Language Identification using Warping and the Shifted Delta Cepstrum","authors":"Felicity Allen, E. Ambikairajah, J. Epps","doi":"10.1109/MMSP.2005.248554","DOIUrl":"https://doi.org/10.1109/MMSP.2005.248554","url":null,"abstract":"This paper proposes the novel use of feature warping for automatic language identification, in combination with the shifted delta cepstrum (SDC) and perceptual linear predictive coefficients in a Gaussian mixture model (GMM) based system. Experimental results on various configurations of front-end techniques reported herein demonstrate that, besides providing robustness against channel mismatch and noise as found in existing literature, feature warping is useful more generally as a technique for pre-mapping data for improved compatibility with a GMM back-end. The configuration reported in this paper provides a language identification performance of 76.4% using the OGI/NIST database, a 46.5% relative reduction in error rate when compared with a benchmark system employing Mel frequency cepstral coefficients and the SDC","PeriodicalId":191719,"journal":{"name":"2005 IEEE 7th Workshop on Multimedia Signal Processing","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132749430","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 56
Hierarchical Sound Classification using Mpeg-7 使用Mpeg-7的分层声音分类
2005 IEEE 7th Workshop on Multimedia Signal Processing Pub Date : 2005-10-01 DOI: 10.1109/MMSP.2005.248606
H. Crysandt
{"title":"Hierarchical Sound Classification using Mpeg-7","authors":"H. Crysandt","doi":"10.1109/MMSP.2005.248606","DOIUrl":"https://doi.org/10.1109/MMSP.2005.248606","url":null,"abstract":"Due to the increasing amount of multimedia contents such as images, audio signals and videos in digital form the need for automatic or semi-automatic classification applications become more and more important. This paper describes a new sound classification technique based on the sound classification algorithm included in the MPEG-7 standard without extending or modifying it. There sequential classification is turned into a hierarchical. Thereby it is possible to use more linear transformations for (lossy) feature vector compression. Thus it is possible to work out differences between sound classes more precisely. This paper also gives a detailed view on how the algorithm is implemented using a XML database to store and request content information of the audio signals and model descriptions of sound classes using the MPEG-7 standard","PeriodicalId":191719,"journal":{"name":"2005 IEEE 7th Workshop on Multimedia Signal Processing","volume":"68 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122404468","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Streaming Media Delivery with Proxy Cache for Heterogeneous Clients 基于代理缓存的异构客户端流媒体交付
2005 IEEE 7th Workshop on Multimedia Signal Processing Pub Date : 2005-10-01 DOI: 10.1109/MMSP.2005.248582
Yunqiang Liu, Songyu Yu
{"title":"Streaming Media Delivery with Proxy Cache for Heterogeneous Clients","authors":"Yunqiang Liu, Songyu Yu","doi":"10.1109/MMSP.2005.248582","DOIUrl":"https://doi.org/10.1109/MMSP.2005.248582","url":null,"abstract":"Efficient streaming media delivery scheme used multimedia service can largely increase system service ability by reducing the resource requirement. Because of the heterogeneity in the underlying network environments, streaming media delivery scheme should provide different, appropriate video quality to serve clients with different bandwidth. In this paper, we present an efficient streaming media delivery scheme with the aid of proxy caching to delivery layered encoded video for heterogeneous clients. The threshold-based multicast technique is used to delivery video streams form server to proxy via backbone network. In order to reduce the bandwidth requirement of the backbone network, we develop an effective approach to determine which layers of which videos should be cached according to the request rates. A simple and efficient replacement algorithm is presented to deal with the varying request rates. Simulation results demonstrate that our scheme achieve significantly bandwidth reduction","PeriodicalId":191719,"journal":{"name":"2005 IEEE 7th Workshop on Multimedia Signal Processing","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122776852","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Moment Features in Directional Subband Domain for Rotation Invariant Texture Classification 旋转不变纹理分类的方向子带域矩特征
2005 IEEE 7th Workshop on Multimedia Signal Processing Pub Date : 2005-10-01 DOI: 10.1109/MMSP.2005.248633
H. Man, Rong Duan
{"title":"Moment Features in Directional Subband Domain for Rotation Invariant Texture Classification","authors":"H. Man, Rong Duan","doi":"10.1109/MMSP.2005.248633","DOIUrl":"https://doi.org/10.1109/MMSP.2005.248633","url":null,"abstract":"This paper presents a study on moment features in directional subband domain for rotation invariant texture image classification. The directional subband decomposition is obtained through a biorthogonal angular filter bank. Moment features are extracted from each directional subband. Two rotation invariant feature generation techniques are examined, including eigenanalysis of covariance matrix and DFT encoding. Feature vectors are further classified by multi-class linear discriminant analysis (LDA). LDA training is based on feature vectors collected from non-rotated training images, and test is performed on images rotated at various angles. Experimental results are provided to demonstrate the effectiveness of directional subband domain feature extraction method for rotation invariant classification. Performance of various feature sets are compared, and the best feature combination is presented","PeriodicalId":191719,"journal":{"name":"2005 IEEE 7th Workshop on Multimedia Signal Processing","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127403570","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Detecting New Stable Objects In Surveillance Video 在监控视频中检测新的稳定物体
2005 IEEE 7th Workshop on Multimedia Signal Processing Pub Date : 2005-10-01 DOI: 10.1109/MMSP.2005.248578
R. Mathew, Zhenghua Yu, Jian Zhang
{"title":"Detecting New Stable Objects In Surveillance Video","authors":"R. Mathew, Zhenghua Yu, Jian Zhang","doi":"10.1109/MMSP.2005.248578","DOIUrl":"https://doi.org/10.1109/MMSP.2005.248578","url":null,"abstract":"We describe a novel method to detect new stable objects in video. This includes detecting new objects that appear in a scene and remain stationary for a period of time. Examples include detecting a dropped bag or a parked car. Our method utilizes the state transition history (or a record of the \"life cycle\") of individual Gaussian distributions in a Gaussian Mixture Model (GMM) used to model the background. In typical implementations of the GMM, this state transition information is ignored however we show that by observing and retaining the history of state transitions of individual distributions, it is possible to detect long term changes in a scene. In particular we identify changes to the most probable background distribution and impose certain conditions on the characteristics and temporal behavior of this distribution. Results presented in this paper illustrate the success of the proposed method and its relevance to surveillance applications","PeriodicalId":191719,"journal":{"name":"2005 IEEE 7th Workshop on Multimedia Signal Processing","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121573862","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 37
Quality Assessment of Panorama Video for Videoconferencing Applications 视频会议应用全景视频的质量评价
2005 IEEE 7th Workshop on Multimedia Signal Processing Pub Date : 2005-10-01 DOI: 10.1109/MMSP.2005.248624
Simone Leorin, L. Lucchese, Ross Cutler
{"title":"Quality Assessment of Panorama Video for Videoconferencing Applications","authors":"Simone Leorin, L. Lucchese, Ross Cutler","doi":"10.1109/MMSP.2005.248624","DOIUrl":"https://doi.org/10.1109/MMSP.2005.248624","url":null,"abstract":"New video-conference devices based on omnidirectional multi-camera systems have been emerging in the last few years. These devices require innovative and automated video quality assessment in the earlier stages of their design in order to guarantee competitive product development and quality monitoring. Current quality assessment techniques are not adequate since they are mostly tailored to single video cameras. Even if these techniques are capable of assessing the quality of each video stream separately, the overall quality of a composite video stream generated with the outputs of multiple cameras stitched together presents strong deviations from the results of subjective quality tests. In this paper, we present new strategies for assessing the quality of composite video streams with specific emphasis on the following problems: noticeable calibration differences between adjacent cameras, concentration of motion in limited regions of the panoramic scene, combined vignetting problems, non-uniformity of the surfaces in the seam region between two adjacent cameras. Combining different features from high and low level vision, we evaluate the proposed perceptual quality metric using a set of customized test sequences and verify its correlation with subjective quality tests","PeriodicalId":191719,"journal":{"name":"2005 IEEE 7th Workshop on Multimedia Signal Processing","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131286471","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信