28th Picture Coding Symposium最新文献

筛选
英文 中文
Influences of frame delay and packet loss between left and right frames in stereoscopic video communications 立体视频通信中左右帧间帧延迟和丢包的影响
28th Picture Coding Symposium Pub Date : 2010-12-01 DOI: 10.1109/PCS.2010.5702550
Shuliang Lin, Yuichiro Sawa, Norishige Fukushima, Y. Ishibashi
{"title":"Influences of frame delay and packet loss between left and right frames in stereoscopic video communications","authors":"Shuliang Lin, Yuichiro Sawa, Norishige Fukushima, Y. Ishibashi","doi":"10.1109/PCS.2010.5702550","DOIUrl":"https://doi.org/10.1109/PCS.2010.5702550","url":null,"abstract":"This paper analyzes the influences of frame delay and packet loss on stereoscopic vision when stereoscopic video is transferred over a IP network. We employ live action videos which are transferred to a head-mount-display (HMD) and do the assessment on stereoscopic perception. As a result, we found that speed and movement direction of the attention object play a great role on the deterioration when frame delay and packet loss exist.","PeriodicalId":255142,"journal":{"name":"28th Picture Coding Symposium","volume":"84 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115777503","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On-line statistical analysis based fast mode decision for multi-view video coding 基于在线统计分析的多视点视频编码快速模式决策
28th Picture Coding Symposium Pub Date : 2010-12-01 DOI: 10.1109/PCS.2010.5702541
G. Chan, Jheng-Ping Lin, A. Tang
{"title":"On-line statistical analysis based fast mode decision for multi-view video coding","authors":"G. Chan, Jheng-Ping Lin, A. Tang","doi":"10.1109/PCS.2010.5702541","DOIUrl":"https://doi.org/10.1109/PCS.2010.5702541","url":null,"abstract":"The high computational complexity of multi-view video codecs makes it necessary to speed up for their realization in consumer electronics. Since fast encoding algorithms are expected to adapt to different video sequences, this paper proposes a fast algorithm that consists of fast mode decision and fast disparity estimation for multi-view video coding. The fast mode decision algorithm applies to both temporal and inter-view predictions. The candidates for mode decision are reduced based on a set of thresholds. Differ from the previous fast mode decision algorithms for MVC, this scheme determines the thresholds according to the online statistical analysis of motion and disparity costs of the first GOP in each view. Since the inter-view prediction is time consuming, we propose a fast disparity estimation algorithm to save encoding time. Experimental results show that our proposed scheme reduces the computational complexity significantly with negligible degradation of coding efficiency.","PeriodicalId":255142,"journal":{"name":"28th Picture Coding Symposium","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128866064","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Super-resolution decoding of JPEG-compressed image data with the shrinkage in the redundant DCT domain 利用冗余DCT域的收缩对jpeg压缩图像数据进行超分辨率解码
28th Picture Coding Symposium Pub Date : 2010-12-01 DOI: 10.1109/PCS.2010.5702436
T. Komatsu, Yasutaka Ueda, T. Saito
{"title":"Super-resolution decoding of JPEG-compressed image data with the shrinkage in the redundant DCT domain","authors":"T. Komatsu, Yasutaka Ueda, T. Saito","doi":"10.1109/PCS.2010.5702436","DOIUrl":"https://doi.org/10.1109/PCS.2010.5702436","url":null,"abstract":"Alter, Durand and Froment introduced the total-variation (TV) minimization approach to the artifact-free JPEG decoding, which is referred to as the ADF decoding method [1]. They formulated the decoding problem as the constrained TV restoration problem, in which the TV seminorm of its restored color image is minimized under the constraint that each DCT coefficient of the restored color image should be in the quantization interval of its corresponding DCT coefficient of the JPEG-compressed data. This paper proposes a new restoration approach to the JPEG decoding. Instead of the TV regularization, our new JPEG-decoding method employs a shrinkage operation in the redundant DCT domain, to mitigate degradations caused by the JPEG coding. Our new method not only can selectively suppress ringing artifacts near color edges, but also can efficiently eliminate blocking artifacts in originally smoothly-varying image regions, where the blocking artifacts are very noticeable. Through decoding simulations, we experimentally show that our new decoding method can reduce JPEG-coding artifacts more effectively than the ADF decoding method.","PeriodicalId":255142,"journal":{"name":"28th Picture Coding Symposium","volume":"224 4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131262123","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Compressive video sensing based on user attention model 基于用户注意力模型的压缩视频感知
28th Picture Coding Symposium Pub Date : 2010-12-01 DOI: 10.1109/PCS.2010.5702586
Jie Xu, Jianwei Ma, Dongming Zhang, Yongdong Zhang, Shouxun Lin
{"title":"Compressive video sensing based on user attention model","authors":"Jie Xu, Jianwei Ma, Dongming Zhang, Yongdong Zhang, Shouxun Lin","doi":"10.1109/PCS.2010.5702586","DOIUrl":"https://doi.org/10.1109/PCS.2010.5702586","url":null,"abstract":"We propose a compressive video sensing scheme based on user attention model (UAM) for real video sequences acquisition. In this work, for every group of consecutive video frames, we set the first frame as reference frame and build a UAM with visual rhythm analysis (VRA) to automatically determine region-of-interest (ROI) for non-reference frames. The determined ROI usually has significant movement and attracts more attention. Each frame of the video sequence is divided into non-overlapping blocks of 16×16 pixel size. Compressive video sampling is conducted in a block-by-block manner on each frame through a single operator and in a whole region manner on the ROIs through a different operator. Our video reconstruction algorithm involves alternating direction l1 — norm minimization algorithm (ADM) for the frame difference of non-ROI blocks and minimum total-variance (TV) method for the ROIs. Experimental results showed that our method could significantly enhance the quality of reconstructed video and reduce the errors accumulated during the reconstruction.","PeriodicalId":255142,"journal":{"name":"28th Picture Coding Symposium","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115708671","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Content-based retrieval by multiple image examples for sign board retrieval 基于内容的多图像实例检索标志板
28th Picture Coding Symposium Pub Date : 2010-12-01 DOI: 10.1109/PCS.2010.5702447
A. Yoshitaka, Terumasa Hyoudou
{"title":"Content-based retrieval by multiple image examples for sign board retrieval","authors":"A. Yoshitaka, Terumasa Hyoudou","doi":"10.1109/PCS.2010.5702447","DOIUrl":"https://doi.org/10.1109/PCS.2010.5702447","url":null,"abstract":"In the area of retrieving image databases, one of the promising approaches is to retrieve it by specifying image example. However, specifying a single image example is not always sufficient to get satisfactory result, since one image example does not give comprehensive ranges of values that reflect the various aspects of the object to be retrieved. In this paper, we propose a method of retrieving images by specifying multiple image examples that is designed for retrieving sign boards. Features of color, shape, and spatial relation of color regions are extracted from example images, and they are clustered so as to obtain proper range of values. Compared with QBE systems that accept only a single image as the query condition, MIERS (Multi-Image Example-based Retrieval System) returns better retrieval result, where the experimental result showed that specifying more examples helps to improve recall with little deterioration of precision.","PeriodicalId":255142,"journal":{"name":"28th Picture Coding Symposium","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114920988","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A hierarchical variable-sized block transform coding scheme for coding efficiency improvement on H.264/AVC 一种提高H.264/AVC编码效率的分层变大小块变换编码方案
28th Picture Coding Symposium Pub Date : 2010-12-01 DOI: 10.1109/PCS.2010.5702553
Bumshik Lee, Jae-il Kim, Sangsoo Ahn, Munchurl Kim, Hui-Yong Kim, Jong-Ho Kim, J. Choi
{"title":"A hierarchical variable-sized block transform coding scheme for coding efficiency improvement on H.264/AVC","authors":"Bumshik Lee, Jae-il Kim, Sangsoo Ahn, Munchurl Kim, Hui-Yong Kim, Jong-Ho Kim, J. Choi","doi":"10.1109/PCS.2010.5702553","DOIUrl":"https://doi.org/10.1109/PCS.2010.5702553","url":null,"abstract":"In this paper, a rate-distortion optimized variable block transform coding scheme is proposed based on a hierarchical structured transform for macroblock (MB) coding with a set of the order-4 and −8 integer cosine transform (ICT) kernels of H.264/AVC as well as a new order-16 ICT kernel. The set of order-4, −8 and −16 ICT kernels are applied for inter-predictive coding in square (4×4, 8×8 or 16×16) or non-square (16×8 or 8×16) transform for each MB in a hierarchical structured manner. The proposed hierarchical variable-sized block transform scheme using the order-16 ICT kernel achieves significant bitrate reduction up to 15%, compared to the High profile of H.264/AVC. Even if the number of candidates for the transform types increases, the encoding time can be reduced to average 4–6% over the H.264/AVC","PeriodicalId":255142,"journal":{"name":"28th Picture Coding Symposium","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114994715","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Multiscale recurrent pattern matching approach for depth map coding 深度图编码的多尺度循环模式匹配方法
28th Picture Coding Symposium Pub Date : 2010-12-01 DOI: 10.1109/PCS.2010.5702490
Danilo B. Graziosi, Nuno M. M. Rodrigues, C. Pagliari, E. Silva, S. Faria, Marcelo M. Perez, M. Carvalho
{"title":"Multiscale recurrent pattern matching approach for depth map coding","authors":"Danilo B. Graziosi, Nuno M. M. Rodrigues, C. Pagliari, E. Silva, S. Faria, Marcelo M. Perez, M. Carvalho","doi":"10.1109/PCS.2010.5702490","DOIUrl":"https://doi.org/10.1109/PCS.2010.5702490","url":null,"abstract":"In this article we propose to compress depth maps using a coding scheme based on multiscale recurrent pattern matching and evaluate its impact on depth image based rendering (DIBR). Depth maps are usually converted into gray scale images and compressed like a conventional luminance signal. However, using traditional transform-based encoders to compress depth maps may result in undesired artifacts at sharp edges due to the quantization of high frequency coefficients. The Multidimensional Multiscale Parser (MMP) is a pattern matching-based encoder, that is able to preserve and efficiently encode high frequency patterns, such as edge information. This ability is critical for encoding depth map images. Experimental results for encoding depth maps show that MMP is much more efficient in a rate-distortion sense than standard image compression techniques such as JPEG2000 or H.264/AVC. In addition, the depth maps compressed with MMP generate reconstructed views with a higher quality than all other tested compression algorithms.","PeriodicalId":255142,"journal":{"name":"28th Picture Coding Symposium","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116451860","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
An epipolar resticted inter-mode selection for stereoscopic video encoding 一种用于立体视频编码的极限模式间选择
28th Picture Coding Symposium Pub Date : 2010-12-01 DOI: 10.1109/PCS.2010.5702502
Guolei Yang, Luhong Liang, Wen Gao
{"title":"An epipolar resticted inter-mode selection for stereoscopic video encoding","authors":"Guolei Yang, Luhong Liang, Wen Gao","doi":"10.1109/PCS.2010.5702502","DOIUrl":"https://doi.org/10.1109/PCS.2010.5702502","url":null,"abstract":"Fast stereoscopic video encoding becomes a highly desired technique because the stereoscopic video has been realizable for applications like TV broadcasting and consumer electronics. The stereoscopic video has high inter-view dependency subject to epipolar restriction, which can be used to reduce the encoding complexity. In this paper, we propose a fast inter-prediction mode selection algorithm for stereoscopic video encoding. Different from methods using disparity estimation, candidate modes are generated by sliding a window along the macro-block line restricted by the epipolar. Then the motion information is utilized to rectify the candidate modes. A selection failure handling algorithm is also proposed to preserve coding quality. The proposed algorithm is evaluated using independent H.264/AVC encoders for left and right views and can be extended to MVC. Experimental results show that encoding times of one view are reduced by 41.4% and 24.4% for HD and VGA videos respectively with little quality loss.","PeriodicalId":255142,"journal":{"name":"28th Picture Coding Symposium","volume":"64 245 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125959075","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Two-dimensional Chebyshev polynomials for image fusion 二维切比雪夫多项式图像融合
28th Picture Coding Symposium Pub Date : 2010-12-01 DOI: 10.1109/PCS.2010.5702526
Z. Omar, N. Mitianoudis, T. Stathaki
{"title":"Two-dimensional Chebyshev polynomials for image fusion","authors":"Z. Omar, N. Mitianoudis, T. Stathaki","doi":"10.1109/PCS.2010.5702526","DOIUrl":"https://doi.org/10.1109/PCS.2010.5702526","url":null,"abstract":"This report documents in detail the research carried out by the author throughout his first year. The paper presents a novel method for fusing images in a domain concerning multiple sensors and modalities. Using Chebyshev polynomials as basis functions, the image is decomposed to perform fusion at feature level. Results show favourable performance compared to previous efforts on image fusion, namely ICA and DT-CWT, in noise affected images. The work presented here aims at providing a novel framework for future studies in image analysis and may introduce innovations in the fields of surveillance, medical imaging and remote sensing.","PeriodicalId":255142,"journal":{"name":"28th Picture Coding Symposium","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124745444","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Avoidance of singular point in reversible KLT 可逆KLT中奇点的避免
28th Picture Coding Symposium Pub Date : 2010-12-01 DOI: 10.1109/PCS.2010.5702435
M. Iwahashi, H. Kiya
{"title":"Avoidance of singular point in reversible KLT","authors":"M. Iwahashi, H. Kiya","doi":"10.1109/PCS.2010.5702435","DOIUrl":"https://doi.org/10.1109/PCS.2010.5702435","url":null,"abstract":"In this report, permutation of order and sign of signals are introduced to avoid singular point problem of a reversible transform. When a transform is implemented in the lifting structure, it can be \"reversible\" in spite of rounding operations inside the transform. Therefore it has been applied to lossless coding of digital signals. However some coefficient values of the transform have singular points (SP). Around the SP, rounding errors are magnified to huge amount and the coding efficiency is decreased. In this report, we analyze the SP of a three point KLT for RGB color components of an image signal, and introduce permutation of order and sign of signals to avoid the SP problem. It was experimentally confirmed that the proposed method improved PSNR by approximately 15 [dB] comparing to the worst case.","PeriodicalId":255142,"journal":{"name":"28th Picture Coding Symposium","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128537544","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信