2012 Visual Communications and Image Processing最新文献_第5页

New bounds on image denoising: Viewpoint of sparse representation and non-local averaging 图像去噪的新边界:稀疏表示和非局部平均的观点

2012 Visual Communications and Image Processing Pub Date : 2012-11-01 DOI: 10.1109/VCIP.2012.6410785

Jianzhou Feng, Li Song, X. Huo, Xiaokang Yang, Wenjun Zhang

{"title":"New bounds on image denoising: Viewpoint of sparse representation and non-local averaging","authors":"Jianzhou Feng, Li Song, X. Huo, Xiaokang Yang, Wenjun Zhang","doi":"10.1109/VCIP.2012.6410785","DOIUrl":"https://doi.org/10.1109/VCIP.2012.6410785","url":null,"abstract":"Image denoising plays a fundamental role in many image processing applications. Utilizing sparse representation and nonlocal averaging together is such a successful framework that leads to considerable progress in denoising. Almost all the newly proposed denoising algorithms are built base on it, different in detailed implementation, and the denoising performance seems converging. What is the denoising bound of this framework turns into a key question. In this paper, we assume all the possible algorithms under the framework can be approximated by a fixed two steps denoising process with different parameters. Step one cluster geometric similar image patches into groups so that patches within each group could be sparse represented under the basis of the group. Step two use the atoms of the group basis and radiometric similar patches of each patch for non-local averaging. The parameters of the process are the cluster number, the atoms and the number of radiometric similar patches for estimating each patch. Finally, the bound is derived as the minimum denoising error of all the possible parameters. Comparing with previous bounds, the new one is image specific and more practical. Experiment results show that there still exists room to improve the denoising performance for natural images.","PeriodicalId":103073,"journal":{"name":"2012 Visual Communications and Image Processing","volume":"148 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127243287","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Multiple sign bits hiding for High Efficiency Video Coding 多符号位隐藏，高效视频编码

2012 Visual Communications and Image Processing Pub Date : 2012-11-01 DOI: 10.1109/VCIP.2012.6410753

Jing Wang, Xiang Yu, Dake He, F. Henry, G. Clare

{"title":"Multiple sign bits hiding for High Efficiency Video Coding","authors":"Jing Wang, Xiang Yu, Dake He, F. Henry, G. Clare","doi":"10.1109/VCIP.2012.6410753","DOIUrl":"https://doi.org/10.1109/VCIP.2012.6410753","url":null,"abstract":"High Efficiency Video Coding (HEVC) is the next-generation video coding standard currently under development, which has demonstrated substantial bit savings (rate reduction by approximately half) compared to H.264/AVC. This paper presents the multiple sign bits hiding scheme that was adopted into the committee draft of HEVC at the 8th JCT-VC meeting. In HEVC, the quantized transform coefficients are entropy-coded in groups of 16 coefficients for each transform unit. With multiple sign bits hiding, for coefficient groups that satisfy certain conditions, the sign of the first non-zero coefficient along the scanning path is not explicitly transmitted in the bitstream and instead is inferred from the parity of the sum of all non-zero coefficients in that coefficient group at the decoder. To ensure the matching between the hidden sign and the parity of the sum of all non-zero coefficients, a parity adjustment method is employed at the encoder based on rate-distortion optimization or distortion minimization. Compared with conventional video coding schemes where quantization and coefficient coding are separately designed, the multiple sign bits hiding scheme in HEVC represents a joint quantization and coefficient coding design and provides consistent rate-distortion performance gains for all standard test sequences under standard test conditions.","PeriodicalId":103073,"journal":{"name":"2012 Visual Communications and Image Processing","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125513488","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 8

Simplified AMVP for High Efficiency Video Coding 简化AMVP高效视频编码

2012 Visual Communications and Image Processing Pub Date : 2012-11-01 DOI: 10.1109/VCIP.2012.6410747

Liang Zhao, Xun Guo, S. Lei, Siwei Ma, Debin Zhao

引用次数: 4

Segmentation-based view synthesis for three-dimensional video 基于分割的三维视频视图合成

2012 Visual Communications and Image Processing Pub Date : 2012-11-01 DOI: 10.1109/VCIP.2012.6410810

Maziar Loghman, Joohee Kim

引用次数: 1

Assessing photo quality with geo-context and crowdsourced photos 通过地理环境和众包照片评估照片质量

2012 Visual Communications and Image Processing Pub Date : 2012-11-01 DOI: 10.1109/VCIP.2012.6410821

Wenyuan Yin, Tao Mei, Chang Wen Chen

{"title":"Assessing photo quality with geo-context and crowdsourced photos","authors":"Wenyuan Yin, Tao Mei, Chang Wen Chen","doi":"10.1109/VCIP.2012.6410821","DOIUrl":"https://doi.org/10.1109/VCIP.2012.6410821","url":null,"abstract":"Automatic photo quality assessment emerged as a hot topic in recent years for its potential in numerous applications. Most existing approaches to photo quality assessment have predominantly focused on image content itself, while ignoring various contexts such as the associated geo-location and timestamp. However, such a universal aesthetic assessment model may not work well with significantly different contexts, since the photography rules are always scene and context dependent. In real cases, professional photographers use different photography knowledge when shooting various scenes in different places. Motivated by this observation, we leverage the geo-context information associated with photos for visual quality assessment. Specifically, we propose in this paper a Scene-Dependent Aesthetic Model (SDAM) to assess photo quality, by jointly leveraging the geo-context and visual content. Geo-contextual leveraged searching is performed to obtain relevant images with similar content to discover the scene-dependent photography principles for accurate photo quality assessment. To overcome the problem that in many cases the number of the contextually searched images is insufficient for learning the SDAM, we adopt transfer learning to utilize auxiliary photos within the same scene category from other locations for learning photography rules. Extensive experiments shows that the proposed SDAM scheme indeed improves the photo quality assessment accuracy via leveraging photo geo-contexts, compared with traditional universal aesthetic models.","PeriodicalId":103073,"journal":{"name":"2012 Visual Communications and Image Processing","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134204549","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 22

3D tubular structure extraction using kernel-based superellipsoid model with Gaussian process regression 基于高斯过程回归的超椭球核模型三维管状结构提取

2012 Visual Communications and Image Processing Pub Date : 2012-11-01 DOI: 10.1109/VCIP.2012.6410763

Qingxiang Zhu, Dayu Zheng, H. Xiong

引用次数: 4

Low bit-rate video coding via mode-dependent adaptive regression for wireless visual communications 基于模式相关自适应回归的无线视觉通信低比特率视频编码

2012 Visual Communications and Image Processing Pub Date : 2012-11-01 DOI: 10.1109/VCIP.2012.6410852

Xianming Liu, Xiaolin Wu, Xinwei Gao, Debin Zhao, Wen Gao

{"title":"Low bit-rate video coding via mode-dependent adaptive regression for wireless visual communications","authors":"Xianming Liu, Xiaolin Wu, Xinwei Gao, Debin Zhao, Wen Gao","doi":"10.1109/VCIP.2012.6410852","DOIUrl":"https://doi.org/10.1109/VCIP.2012.6410852","url":null,"abstract":"In this paper, a practical video coding scheme is developed to realize state-of-the-art video coding efficiency with lower encoder complexity at low bit-rate, while supporting standard compliance and error resilience. Such an architecture is particularly attractive for wireless visual communications. At the encoder, multiple descriptions of a video sequence are generated in the spatio-temporal domain by temporal multiplexing and spatial adaptive downsampling. The resulting side descriptions are interleaved with each other in temporal domain, and still with conventional square sample grids in spatial domain. As such, each side description can be compressed without any change to existing video coding standards. At the decoder, each side description is first decompressed, and then reconstructed to original resolution with the help of the other side description. In this procedure, the decoder recover the original video sequence in a constrained least squares regression process, using 2D or 3D piecewise autoregressive model according to different prediction modes. In this way, the spatial and temporal correlation is sufficiently explored to achieve superior quality. Experiment results demonstrate the proposed video coding scheme outperforms H.264 in rate-distortion performance at low bit-rates and achieves superior visual quality at medium bit-rates as well.","PeriodicalId":103073,"journal":{"name":"2012 Visual Communications and Image Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127974358","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

Histogram-Based stereo matching under varying illumination conditions 不同光照条件下基于直方图的立体匹配

2012 Visual Communications and Image Processing Pub Date : 2012-11-01 DOI: 10.1109/VCIP.2012.6410819

Il-Lyong Jung, Jae-Young Sim, Chang-Su Kim

引用次数: 2

Video processing techniques for 3D television 3D电视视频处理技术

2012 Visual Communications and Image Processing Pub Date : 2012-11-01 DOI: 10.1109/VCIP.2012.6410853

Yo-Sung Ho

引用次数: 2

Intra prediction based on statistical modeling of images 基于图像统计建模的图像内预测

2012 Visual Communications and Image Processing Pub Date : 2012-11-01 DOI: 10.1109/VCIP.2012.6410803

Fatih Kamisli

引用次数: 9