2012 Visual Communications and Image Processing最新文献

筛选
英文 中文
Intra coding for depth maps using adaptive boundary location 使用自适应边界位置的深度图内部编码
2012 Visual Communications and Image Processing Pub Date : 2012-11-01 DOI: 10.1109/VCIP.2012.6410744
Lulu Chen, M. Hannuksela, Houqiang Li
{"title":"Intra coding for depth maps using adaptive boundary location","authors":"Lulu Chen, M. Hannuksela, Houqiang Li","doi":"10.1109/VCIP.2012.6410744","DOIUrl":"https://doi.org/10.1109/VCIP.2012.6410744","url":null,"abstract":"Depth maps, an essential part in the new generation of 3D video coding, allow rendering of arbitrary viewpoints of a video scene. Depth maps are characterized by sharp object boundaries, which significantly affect the rendering quality and account for the most bitrate for depth map coding. This paper proposes a novel intra coding method for depth maps based on a two-step adaptive boundary location process. By extracting a series of sub-blocks along a depth boundary and refining the boundary within sub-blocks, accurate predictions for blocks with arbitrary edge shapes can be realized. Experimental results show that the proposed scheme achieves bitrate reductions of up to 28% and 13% on average for seven test sequences of MPEG 3DV compared to original intra coding of H.264/AVC considering the same quality of synthesized views. Besides, subjective quality of virtual views is improved owning to well preserved boundary information.","PeriodicalId":103073,"journal":{"name":"2012 Visual Communications and Image Processing","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125802393","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Optimal intra coding of HEVC by structured set prediction mode with discriminative learning 基于判别学习的结构化集预测模式的HEVC优化内部编码
2012 Visual Communications and Image Processing Pub Date : 2012-11-01 DOI: 10.1109/VCIP.2012.6410818
Wenrui Dai, H. Xiong
{"title":"Optimal intra coding of HEVC by structured set prediction mode with discriminative learning","authors":"Wenrui Dai, H. Xiong","doi":"10.1109/VCIP.2012.6410818","DOIUrl":"https://doi.org/10.1109/VCIP.2012.6410818","url":null,"abstract":"This paper proposes a novel model on intra-coding for high efficiency video coding (HEVC), which can simultaneously make the set of prediction for block of pixels in an optimal rate-distortion sense. It not only utilizes the spatial statistical correlation for the optimal prediction based on 2-D contexts, but also formulates the data-driven structural interdependencies to make the prediction error coherent with the probability distribution which is favorable for subsequent transform and coding. The so-called structured set prediction model incorporates max-margin Markov network to regulate and reason the multiple prediction in the blocks. The model parameters are learned by discriminating the actual pixel value from the other possible estimates to the maximal margin. Distinguished from the existing methods concerning the minimal prediction error, the Markov network is adaptively derived to maintain the coherence of set of prediction. To be concrete, the proposed model seeks the concurrent optimization of the set of prediction by relating the loss function to the probability distribution of subsequent DCT coefficients. The prediction error is demonstrated to be asymptotically upper bounded by the training error under the decomposable loss function. For validation, we integrate the proposed model into HEVC intra coding and experimental results show obvious improvement of coding performance in terms of BD-rate.","PeriodicalId":103073,"journal":{"name":"2012 Visual Communications and Image Processing","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126043801","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
High-quality image interpolation via local autoregressive and nonlocal 3-D sparse regularization 基于局部自回归和非局部三维稀疏正则化的高质量图像插值
2012 Visual Communications and Image Processing Pub Date : 2012-11-01 DOI: 10.1109/VCIP.2012.6410749
Xinwei Gao, Jian Zhang, F. Jiang, Xiaopeng Fan, Siwei Ma, Debin Zhao
{"title":"High-quality image interpolation via local autoregressive and nonlocal 3-D sparse regularization","authors":"Xinwei Gao, Jian Zhang, F. Jiang, Xiaopeng Fan, Siwei Ma, Debin Zhao","doi":"10.1109/VCIP.2012.6410749","DOIUrl":"https://doi.org/10.1109/VCIP.2012.6410749","url":null,"abstract":"In this paper, we propose a novel image interpolation algorithm, which is formulated via combining both the local autoregressive (AR) model and the nonlocal adaptive 3-D sparse model as regularized constraints under the regularization framework. Estimating the high-resolution image by the local AR regularization is different from these conventional AR models, which weighted calculates the interpolation coefficients without considering the rough structural similarity between the low-resolution (LR) and high-resolution (HR) images. Then the nonlocal adaptive 3-D sparse model is formulated to regularize the interpolated HR image, which provides a way to modify these pixels with the problem of numerical stability caused by AR model. In addition, a new Split-Bregman based iterative algorithm is developed to solve the above optimization problem iteratively. Experiment results demonstrate that the proposed algorithm achieves significant performance improvements over the traditional algorithms in terms of both objective quality and visual perception.","PeriodicalId":103073,"journal":{"name":"2012 Visual Communications and Image Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130893067","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Edge modeling prediction for computed tomography images 计算机断层扫描图像的边缘建模预测
2012 Visual Communications and Image Processing Pub Date : 2012-11-01 DOI: 10.1109/VCIP.2012.6410795
Andreas Weinlich, P. Amon, A. Hutter, André Kaup
{"title":"Edge modeling prediction for computed tomography images","authors":"Andreas Weinlich, P. Amon, A. Hutter, André Kaup","doi":"10.1109/VCIP.2012.6410795","DOIUrl":"https://doi.org/10.1109/VCIP.2012.6410795","url":null,"abstract":"Predictive coding is applied in many state-of-the-art lossless image compression algorithms like JPEG-LS, CALIC, or least-squares-based methods. We propose a new approach for accurate intensity prediction in pixel-predictive coding of computed tomography (CT) images. Exploiting their particular edge characteristic, the method only relies on a small twelve-pixel context. It does neither require adaptation to larger-region image characteristics nor the transmission of side-information and therefore may be particularly suitable for compression of small images like in region-of-interest coding. While applying simple linear prediction with fixed weights in homogeneous regions, a Gauss error model-function is fit to given contexts in edge regions and then sampled at the position corresponding to the pixel to be predicted in order to obtain prediction values. By the example of CALIC, it is shown that for CT data the edge modeling prediction (EMP) approach can yield an even smaller prediction error than other methods relying on context modeling.","PeriodicalId":103073,"journal":{"name":"2012 Visual Communications and Image Processing","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134223939","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Multi-view video streaming over wireless networks with RD-optimized scheduling of network coded packets 无线网络上的多视图视频流,具有网络编码数据包的rd优化调度
2012 Visual Communications and Image Processing Pub Date : 2012-11-01 DOI: 10.1109/VCIP.2012.6410748
I. Nemoianu, C. Greco, Marco Cagnazzo, B. Pesquet-Popescu
{"title":"Multi-view video streaming over wireless networks with RD-optimized scheduling of network coded packets","authors":"I. Nemoianu, C. Greco, Marco Cagnazzo, B. Pesquet-Popescu","doi":"10.1109/VCIP.2012.6410748","DOIUrl":"https://doi.org/10.1109/VCIP.2012.6410748","url":null,"abstract":"Multi-view video streaming is an emerging video paradigm that enables new interactive services, such as free viewpoint television and immersive teleconferencing. However, it comes with a high bandwidth cost, as the equivalent of many single-view streams has to be transmitted. Network coding (NC) can improve the performance of the network by allowing nodes to combine received packets before retransmission. Several works have shown NC to be beneficiai in wireless networks, but the delay introduced by buffering before decoding raises a problem in real-time streaming applications. Here, we propose to use Expanding Window NC (EWNC) for multi-view streaming to allow immediate decoding of the received packets. The order in which the packets are included in the coding window is chosen via RD-optimization for the current sending opportunity. Results show that our approach consistently outperforms both classical NC applied on each view independently and transmission without NC.","PeriodicalId":103073,"journal":{"name":"2012 Visual Communications and Image Processing","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131649110","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Adaptive rate control for High Efficiency Video Coding 高效视频编码的自适应速率控制
2012 Visual Communications and Image Processing Pub Date : 2012-11-01 DOI: 10.1109/VCIP.2012.6410769
Junjun Si, Siwei Ma, Xinfeng Zhang, Wen Gao
{"title":"Adaptive rate control for High Efficiency Video Coding","authors":"Junjun Si, Siwei Ma, Xinfeng Zhang, Wen Gao","doi":"10.1109/VCIP.2012.6410769","DOIUrl":"https://doi.org/10.1109/VCIP.2012.6410769","url":null,"abstract":"A frame level adaptive rate control scheme for the emerging High Efficiency Video Coding (HEVC) standard is proposed in this paper, where both rate model and distortion model are provided. For rate modeling, a new rate model is proposed based on the weighted complexity estimation of the previously encoded frames. For distortion modeling, the distortion is modeled as an exponential function of the sum of absolute transformed difference (SATD) and the quantization parameter of the current frame. Moreover, a quality smoothing method based on the distortion model is proposed to reduce the quality fluctuation. The proposed rate control algorithm is implemented into HM5.0. The proposed scheme can control the bitrate accurately with smoothing quality, and the coding gain compared with state-of-the-art technique is up to 0.64dB for LB HE & LB LC, 0.33dB, 0.31dB, 0.42dB, 0.44dB for LP HE, LP LC, RA HE and RA LC respectively.","PeriodicalId":103073,"journal":{"name":"2012 Visual Communications and Image Processing","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132137813","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 27
IMAPS: A smart phone based real-time framework for prediction of affect in natural dyadic conversation IMAPS:一个基于智能手机的实时框架,用于预测自然二元对话中的情感
2012 Visual Communications and Image Processing Pub Date : 2012-11-01 DOI: 10.1109/VCIP.2012.6410828
A. Rahman, Md. Iftekhar Tanveer, A. Anam, M. Yeasin
{"title":"IMAPS: A smart phone based real-time framework for prediction of affect in natural dyadic conversation","authors":"A. Rahman, Md. Iftekhar Tanveer, A. Anam, M. Yeasin","doi":"10.1109/VCIP.2012.6410828","DOIUrl":"https://doi.org/10.1109/VCIP.2012.6410828","url":null,"abstract":"The lack of ability to perceive emotions and affective states is a setback for people who are blind or visually impaired in professional and social communications. Towards developing assistive technology solution in facilitating natural dyadic conversations for people with such disability, this paper describes the development of a smart phone based system called interactive mobile affect perception system (iMAPS) for prediction of affective dimensions (valence-arousal-dominance). The proposed solution utilizes an Android platform in conjunction with a wireless network to build a fully integrated iMAPS. Empirical analyses were conducted to measure the efficacy and utility of the proposed solution. It was found that the proposed framework can predict affect dimensions with good accuracy (Maximum Correlation Coefficient for valence: 0.68, arousal: 0.71, and dominance: 0.67) in natural dyadic conversation. The overall minimum and maximum response times are (181 milliseconds) and (500 milliseconds), respectively.","PeriodicalId":103073,"journal":{"name":"2012 Visual Communications and Image Processing","volume":"71 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132138055","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
An efficient surveillance coding method based on a timely and bit-saving background updating model 一种基于实时且节省比特的背景更新模型的高效监控编码方法
2012 Visual Communications and Image Processing Pub Date : 2012-11-01 DOI: 10.1109/VCIP.2012.6410781
Wei Chen, Xianguo Zhang, Yonghong Tian, Tiejun Huang
{"title":"An efficient surveillance coding method based on a timely and bit-saving background updating model","authors":"Wei Chen, Xianguo Zhang, Yonghong Tian, Tiejun Huang","doi":"10.1109/VCIP.2012.6410781","DOIUrl":"https://doi.org/10.1109/VCIP.2012.6410781","url":null,"abstract":"Background modeling is an important pre-processing step for object detection in surveillance video analysis systems. Recently, it has been proved to be useful for high-efficiency surveillance video coding. In existing works, the modeling background frame often needs to be high-quality encoded so as to achieve a large bit-rate saving. However, the high-quality background frame requires lots of bits in the code stream, so it is infeasible to update the background frame too frequently. Therefore, a better bit-allocation method is desirable to facilitate in-time background updating and bit-saving background coding. In this paper, we firstly build up a background updating model from a detailed analysis of results on surveillance video. Following this, we propose a bit-saving and quality-maintaining background frame coding method. In our method, the background frame can be updated more timely, consequently leading to the better coding efficiency. Experimental results show that our method can achieve more than 15% bit-rate decrease compared with three state-of-art methods.","PeriodicalId":103073,"journal":{"name":"2012 Visual Communications and Image Processing","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132755618","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A fast multiscale framework for data in high-dimensions: Measure estimation, anomaly detection, and compressive measurements 高维数据的快速多尺度框架:测量估计、异常检测和压缩测量
2012 Visual Communications and Image Processing Pub Date : 2012-11-01 DOI: 10.1109/VCIP.2012.6410789
Guangliang Chen, M. Iwen, S. Chin, M. Maggioni
{"title":"A fast multiscale framework for data in high-dimensions: Measure estimation, anomaly detection, and compressive measurements","authors":"Guangliang Chen, M. Iwen, S. Chin, M. Maggioni","doi":"10.1109/VCIP.2012.6410789","DOIUrl":"https://doi.org/10.1109/VCIP.2012.6410789","url":null,"abstract":"Data sets are often modeled as samples from some probability distribution lying in a very high dimensional space. In practice, they tend to exhibit low intrinsic dimensionality, which enables both fast construction of efficient data representations and solving statistical tasks such as regression of functions on the data, or even estimation of the probability distribution from which the data is generated. In this paper we introduce a novel multiscale density estimator for high dimensional data and apply it to the problem of detecting changes in the distribution of dynamic data, or in a time series of data sets. We also show that our data representations, which are not standard sparse linear expansions, are amenable to compressed measurements. Finally, we test our algorithms on both synthetic data and a real data set consisting of a times series of hyperspectral images, and demonstrate their high accuracy in the detection of anomalies.","PeriodicalId":103073,"journal":{"name":"2012 Visual Communications and Image Processing","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129301989","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 23
An open source MPEG DASH evaluation suite 一个开源的MPEG DASH评估套件
2012 Visual Communications and Image Processing Pub Date : 2012-11-01 DOI: 10.1109/VCIP.2012.6410860
Stefan Lederer, Christopher Müller, Benjamin Rainer, Markus Waltl, C. Timmerer
{"title":"An open source MPEG DASH evaluation suite","authors":"Stefan Lederer, Christopher Müller, Benjamin Rainer, Markus Waltl, C. Timmerer","doi":"10.1109/VCIP.2012.6410860","DOIUrl":"https://doi.org/10.1109/VCIP.2012.6410860","url":null,"abstract":"In this paper we demonstrate our MPEG-DASH evaluation suite, which comprises several components on the client side as well as on the server side. The major client components are the VLC DASH plugin, libDASH, and DASH-JS, a JavaScript-based DASH client. These tools enable performance tests on various platforms, e.g., Windows and Linux as well as mobile platforms such as Android. Moreover, due to their flexible structure it is possible to integrate adaptation logics and evaluate them under consistent conditions. On the server side we provide the content generation tool DASHEncoder, our MPEG-DASH datasets well as the MPEG-DASH conformance validator.","PeriodicalId":103073,"journal":{"name":"2012 Visual Communications and Image Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129728552","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信