2012 Visual Communications and Image Processing最新文献

筛选
英文 中文
Intra mode coding in HEVC standard 内模式编码在HEVC标准
2012 Visual Communications and Image Processing Pub Date : 2012-11-01 DOI: 10.1109/VCIP.2012.6410750
Ximin Zhang, Shan Liu, S. Lei
{"title":"Intra mode coding in HEVC standard","authors":"Ximin Zhang, Shan Liu, S. Lei","doi":"10.1109/VCIP.2012.6410750","DOIUrl":"https://doi.org/10.1109/VCIP.2012.6410750","url":null,"abstract":"New High Efficiency Video Coding (HEVC) standard is designed to provide substantial coding efficiency improvement compared to H.264/AVC. Latest subjective testing shows 50% improvement has been achieved. Many new technologies contribute to the overall improvement. Intra prediction with 35 modes is one of the key improvements. Associated with that, there is a new intra mode coding method to efficiently signal the selected modes. This paper presents this new intra mode coding method that has been adopted by HEVC. In this method, the 35 intra modes are divided into two categories. One category includes 3 most probable modes (MPMs) and another category includes 32 remaining modes. In doing so, shorter codeword is used for coding MPMs and fixed length coding is used to code the remaining modes. Experimental results show the 3MPMs based method improve the coding efficiency compared to the prior art method used in H.264/AVC.","PeriodicalId":103073,"journal":{"name":"2012 Visual Communications and Image Processing","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125183968","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Vision Guided Compression on low-bit rate channels 低比特率信道的视觉引导压缩
2012 Visual Communications and Image Processing Pub Date : 2012-11-01 DOI: 10.1109/VCIP.2012.6410859
S. Chai, Yingxue Li, M. Isnardi, A. Kopansky
{"title":"Vision Guided Compression on low-bit rate channels","authors":"S. Chai, Yingxue Li, M. Isnardi, A. Kopansky","doi":"10.1109/VCIP.2012.6410859","DOIUrl":"https://doi.org/10.1109/VCIP.2012.6410859","url":null,"abstract":"Full motion video (FMV) in unreliable, low-bit rate network channels suffers from quality issues such as jitter and block artifacts. In this paper, we introduce Vision Guided Compression (VGC), as a pre-processing technology that can be coupled with standards-based video coding, to provide FMV at low-bit rates. VGC utilizes computer vision algorithms to track salient features and keep them sharp, while non-salient features are lowpass filtered. With this approach, VGC provides an additional spatial parameter to gracefully tune the QoS, while providing FMV and preserving salient visual information.","PeriodicalId":103073,"journal":{"name":"2012 Visual Communications and Image Processing","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121660355","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Efficient image/video deblocking via sparse representation 通过稀疏表示有效的图像/视频块化
2012 Visual Communications and Image Processing Pub Date : 2012-11-01 DOI: 10.1109/VCIP.2012.6410838
Yi-Wen Chiou, C. Yeh, Li-Wei Kang, Chia-Wen Lin, Shu-Jhen Fan-Jiang
{"title":"Efficient image/video deblocking via sparse representation","authors":"Yi-Wen Chiou, C. Yeh, Li-Wei Kang, Chia-Wen Lin, Shu-Jhen Fan-Jiang","doi":"10.1109/VCIP.2012.6410838","DOIUrl":"https://doi.org/10.1109/VCIP.2012.6410838","url":null,"abstract":"Blocking artifact, characterized by visually noticeable changes in pixel values along block boundaries, is a common problem in block-based image/video compression, especially at low bitrate coding. Various post-processing techniques have been proposed to reduce blocking artifacts, but they usually introduce excessive blurring or ringing effects. This paper proposes a self-learning-based image/ video deblocking framework via properly formulating deblocking as an MCA (morphological component analysis)-based image decomposition problem via sparse representation. The proposed method first decomposes an image/video frame into the low-frequency and high-frequency parts by applying BM3D (block-matching and 3D filtering) algorithm. The high-frequency part is then decomposed into a “blocking component” and a “non-blocking component” by performing dictionary learning and sparse coding based on MCA. As a result, the blocking component can be removed from the image/video frame successfully while preserving most original image/video details. Experimental results demonstrate the efficacy of the proposed algorithm.","PeriodicalId":103073,"journal":{"name":"2012 Visual Communications and Image Processing","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121891347","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
A comparison of fractional-pel interpolation filters in HEVC and H.264/AVC HEVC和H.264/AVC中分数像素插值滤波器的比较
2012 Visual Communications and Image Processing Pub Date : 2012-11-01 DOI: 10.1109/VCIP.2012.6410767
Hao Lv, Ronggang Wang, Xiaodong Xie, Huizhu Jia, Wen Gao
{"title":"A comparison of fractional-pel interpolation filters in HEVC and H.264/AVC","authors":"Hao Lv, Ronggang Wang, Xiaodong Xie, Huizhu Jia, Wen Gao","doi":"10.1109/VCIP.2012.6410767","DOIUrl":"https://doi.org/10.1109/VCIP.2012.6410767","url":null,"abstract":"The fractional-pel interpolation filter adopted in H.264/AVC improves motion compensation greatly. Recently, a new DCT-based fractional-pel interpolation filter is adopted in the oncoming standard HEVC. We are interested in the differences between these two types of fractional-pel interpolation filters. In this paper we describe the derivations of fractional-pel interpolation filters in HEVC and H.264/AVC in detail, and compare them on properties of frequency responses. We find that the half-pel interpolation filters in HEVC and H.264/AVC are very similar, but the low-pass properties of quarter-pel interpolation filters in HEVC are much better than those in H.264/AVC. Experimental results validate this phenomenon, the fractional-pel interpolation in H.264/AVC tends to increase BD-rates by more than 10% compared with that in HEVC, and this performance loss mainly comes from quarter-pel interpolation filters. On the other hand, the complexity of fractional-pel interpolation filtering in HEVC is greatly increased than that in H.264/AVC.","PeriodicalId":103073,"journal":{"name":"2012 Visual Communications and Image Processing","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121513792","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 36
A patch-based framework for detecting abnormal activities with a PTZ camera 用于检测PTZ相机异常活动的基于补丁的框架
2012 Visual Communications and Image Processing Pub Date : 2012-11-01 DOI: 10.1109/VCIP.2012.6410827
Yisi Tao, Yuanzhe Chen, Weiyao Lin, Xintong Han, Hongxiang Li, Zheng Lu
{"title":"A patch-based framework for detecting abnormal activities with a PTZ camera","authors":"Yisi Tao, Yuanzhe Chen, Weiyao Lin, Xintong Han, Hongxiang Li, Zheng Lu","doi":"10.1109/VCIP.2012.6410827","DOIUrl":"https://doi.org/10.1109/VCIP.2012.6410827","url":null,"abstract":"In this paper, a novel patch-based (PB) framework is proposed for detecting abnormal activities using a Pan-Tilt-Zoom (PTZ) camera. We first propose a new scene-patch-based (SSB) algorithm which can efficiently extract the target object's global trajectory from the PTZ camera. Furthermore, we propose an extended network-based (ENB) algorithm for detecting abnormal activities. The proposed ENB algorithm models the entire scene as a network where each node in the network corresponds to a patch of the scene and each edge between nodes corresponds to the activity correlation between the scene patchs. Based on this network, a recursive training strategy is proposed to train the edge weights in the network such that abnormal activities can be effectively detected through these trained edge weights. Experimental results demonstrate the effectiveness of our proposed framework.","PeriodicalId":103073,"journal":{"name":"2012 Visual Communications and Image Processing","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131691979","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Edge-based method for sharp region extraction from low depth of field images 基于边缘的低景深图像锐利区域提取方法
2012 Visual Communications and Image Processing Pub Date : 2012-11-01 DOI: 10.1109/VCIP.2012.6410778
N. Neverova, H. Konik
{"title":"Edge-based method for sharp region extraction from low depth of field images","authors":"N. Neverova, H. Konik","doi":"10.1109/VCIP.2012.6410778","DOIUrl":"https://doi.org/10.1109/VCIP.2012.6410778","url":null,"abstract":"This paper presents a method for extracting blur/sharp regions of interest (ROI) that benefits of using a combination of edge and region based approaches. It can be considered as a preliminary step for many vision applications tending to focus only on the most salient areas in low depth-of-field images. To localize focused regions, we first classify each edge as either sharp or blurred based on gradient profile width estimation. Then a mean shift oversegmentation allows to label each region using the density of marked edge pixels inside. Finally, the proposed algorithm is tested on a dataset of high resolution images and the results are compared with the manually established ground truth. It is shown that the given method outperforms known state-of-the-art techniques in terms of F-measure. The robustness of the method is confirmed by means of additional experiments on images with different values of defocus degree.","PeriodicalId":103073,"journal":{"name":"2012 Visual Communications and Image Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129912980","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
SEQM: Edge quality assessment based on structural pixel matching SEQM:基于结构像素匹配的边缘质量评估
2012 Visual Communications and Image Processing Pub Date : 2012-11-01 DOI: 10.1109/VCIP.2012.6410731
Won-Dong Jang, Chang-Su Kim
{"title":"SEQM: Edge quality assessment based on structural pixel matching","authors":"Won-Dong Jang, Chang-Su Kim","doi":"10.1109/VCIP.2012.6410731","DOIUrl":"https://doi.org/10.1109/VCIP.2012.6410731","url":null,"abstract":"A novel quality metric for binary edge maps, called the structural edge quality metric (SEQM), is proposed in this work. First, we define the matching cost between an edge pixel in a detected edge map and its candidate matching pixel in the ground-truth edge map. The matching cost includes a structural term, as well as a positional term, to measure the discrepancy between the local structures around the two pixels. Then, we determine the optimal matching pairs of pixels using the graph-cut optimization, in which a smoothness term is employed to take into account global edge structures in the matching. Finally, we sum up the matching costs of all edge pixels to determine the quality index of the detected edge map. Simulation results demonstrate that the proposed SEQM provides more faithful and reliable quality indices than conventional metrics.","PeriodicalId":103073,"journal":{"name":"2012 Visual Communications and Image Processing","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132963464","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Optimization of resource reconfiguration for cloud-based multimedia applications 基于云的多媒体应用的资源重构优化
2012 Visual Communications and Image Processing Pub Date : 2012-11-01 DOI: 10.1109/VCIP.2012.6410755
Xiaoming Nan, Yifeng He, L. Guan
{"title":"Optimization of resource reconfiguration for cloud-based multimedia applications","authors":"Xiaoming Nan, Yifeng He, L. Guan","doi":"10.1109/VCIP.2012.6410755","DOIUrl":"https://doi.org/10.1109/VCIP.2012.6410755","url":null,"abstract":"The cloud-based multimedia applications have been increasingly adopted in recent years. The key challenge for the application providers is how to optimally reconfigure resources to cope with the time-varying workload. In this paper, we study the optimal resource reconfiguration for cloud-based multimedia applications to minimize the average round-trip-time (RTT). Specifically, we propose the optimal resource reconfiguration schemes for single-site cloud and multi-site cloud, respectively. In each case, we formulate and solve the average RTT minimization problem. Simulation results demonstrate that the proposed optimal resource reconfiguration schemes can optimally utilize cloud resources to achieve the minimal average RTT.","PeriodicalId":103073,"journal":{"name":"2012 Visual Communications and Image Processing","volume":"59 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130745584","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Object-based coding for Kinect depth and color videos Kinect深度和颜色视频的基于对象的编码
2012 Visual Communications and Image Processing Pub Date : 2012-11-01 DOI: 10.1109/VCIP.2012.6410823
Cuiling Lan, Jizheng Xu, Feng Wu
{"title":"Object-based coding for Kinect depth and color videos","authors":"Cuiling Lan, Jizheng Xu, Feng Wu","doi":"10.1109/VCIP.2012.6410823","DOIUrl":"https://doi.org/10.1109/VCIP.2012.6410823","url":null,"abstract":"Simultaneously capturing of color and depth videos, e.g. with Kinect, favors many applications and has become very popular. Efficient representation and compression of such data is important yet challenging. In this paper, we have designed an object-based coding system to compress Kinect-like depth and color videos. Segmentation is first conducted to obtain different object planes, where a mask image is utilized to identify them. We compress depth and color images respectively using the proposed object-based coding codec, which is designed based on High Efficiency Video Coding (HEVC). The mask image is losslessly compressed by adding a new context-based mode to HEVC. To assure the alignment of object boundaries on the depth image and those on the color image, a pre-processing is conducted over the depth image. The separate coding of the different object planes for the depth image can avoid the inefficiency coding of edges blocks at object boundaries and thus bring obvious coding gain. Moreover, the attractive functionality of “content-based” coding which permits the transmission of the interested object planes rather than an entire image provides a practical way to decrease the bitrate.","PeriodicalId":103073,"journal":{"name":"2012 Visual Communications and Image Processing","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114308276","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Optimized nested protection for video Region of Interest with Raptor codes 利用猛禽代码优化了视频感兴趣区域的嵌套保护
2012 Visual Communications and Image Processing Pub Date : 2012-11-01 DOI: 10.1109/VCIP.2012.6410766
Zhengyi Luo, Li Song, Shibao Zheng, N. Ling
{"title":"Optimized nested protection for video Region of Interest with Raptor codes","authors":"Zhengyi Luo, Li Song, Shibao Zheng, N. Ling","doi":"10.1109/VCIP.2012.6410766","DOIUrl":"https://doi.org/10.1109/VCIP.2012.6410766","url":null,"abstract":"Due to the best effort feature of many existing transmission channels, video streams often suffer from inevitable transmission errors. In this paper, we propose a scheme of robust video transmission based on the state-of-the-art Raptor codes, whose applications are in full swing now. And considering Region of Interest (ROI) often draws much attention in images, the scheme adopts a nested protection framework to show partialities to ROI areas for better protection. Different from many existing Raptor codes based UEP methods, our scheme is developed based on the easy-to-use standardized Raptor codes. Experimental results show that significant robustness can be obtained for the video streams, especially for the ROI areas.","PeriodicalId":103073,"journal":{"name":"2012 Visual Communications and Image Processing","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134426699","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信