F. Pan, Xiao Lin, S. Rahardja, K. Lim, Zhengguo Li, Dajun Wu, Si Wu, C. All, W. Ye, Z. Liang
{"title":"Fast intra mode decision algorithm for H.264/AVC video coding","authors":"F. Pan, Xiao Lin, S. Rahardja, K. Lim, Zhengguo Li, Dajun Wu, Si Wu, C. All, W. Ye, Z. Liang","doi":"10.1109/ICIP.2004.1419414","DOIUrl":"https://doi.org/10.1109/ICIP.2004.1419414","url":null,"abstract":"The emerging H.264-AVC video coding standard aims to significantly improve compression performance compared to all existing video coding standards. In order to achieve this, a robust rate-distortion optimization (RDO) technique is employed to select the best coding mode and reference frame for each macroblock. As a result, the complexity and computation load increase drastically. This paper presents a fast mode decision algorithm for H.264 intra prediction based on local edge information. Prior to intra prediction, an edge map is created and a local edge direction histogram is then established for each sub-block. Based on the distribution of the edge direction histogram, only a small part of intra prediction modes are chosen for RDO calculation. Experimental results show that the last intra mode decision scheme increases the speed of intra coding significantly with negligible loss of PSNR.","PeriodicalId":147245,"journal":{"name":"International Conference on Information Photonics","volume":"158 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125655903","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Processing of wavelet transform data for improved image compression","authors":"T. Muzaffar, T. Choi","doi":"10.1109/ICIP.2004.1419486","DOIUrl":"https://doi.org/10.1109/ICIP.2004.1419486","url":null,"abstract":"This paper presents a novel preprocessing technique of wavelet transform data to link all the significant coefficients together, in order to facilitate the image coding algorithms for increased compression. The proposed algorithm finds the isolated significant coefficients in the wavelet transformed data for current threshold value. All the insignificant coefficients in the wavelet tree that lies above that isolated significant coefficient (i.e., its insignificant parents) are changed to significant coefficients and their location is saved, they are then treated just like other significant coefficients. Coding algorithms have been modified to accommodate the processed output. In the end, algorithm indicates converted coordinates that are used to change the value back to original during reconstruction. Noticeably high compression ratio is achieved for most of the images, when the proposed method is used with the modified image codecs.","PeriodicalId":147245,"journal":{"name":"International Conference on Information Photonics","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115149805","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
J. Han, Xiaoyan Sun, Feng Wu, Shipeng Li, Zhaoyang Lu
{"title":"Variable block-size transform and entropy coding at the enhancement layer of FGS","authors":"J. Han, Xiaoyan Sun, Feng Wu, Shipeng Li, Zhaoyang Lu","doi":"10.1109/ICIP.2004.1418795","DOIUrl":"https://doi.org/10.1109/ICIP.2004.1418795","url":null,"abstract":"This paper proposes a variable block-size transform and context-based entropy coding techniques for the enhancement layer of FGS (fine granularity scalable) video coding. First, the variable block-size transform is introduced into the enhancement layer to improve the performance of FGS in terms of both visual quality and PSNR. Different from that used in the traditional single layer coding, an R-D selection algorithm is proposed to optimally decide the transform size of each block, under consideration of consistent performance at a range of bit rates. Furthermore, to fully take advantage of the characteristics and correlations of symbols coded in the FGS enhancement layer, different context models are designed for the arithmetic coding according to symbol type and transform size. Experimental results show that the coding efficiency of FGS can be increased by 0.2-0.90 dB with the proposed techniques.","PeriodicalId":147245,"journal":{"name":"International Conference on Information Photonics","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115042758","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Detection and tracking of moving objects in image sequences with varying illumination","authors":"Min Xu, R. Niu, P. Varshney","doi":"10.1109/ICIP.2004.1421634","DOIUrl":"https://doi.org/10.1109/ICIP.2004.1421634","url":null,"abstract":"Change detection is known to be a significant and difficult research problem in automated surveillance systems. In this paper, we propose a new change detection approach based on the least squares method, which is robust to changes in illumination and shadow conditions. This new approach is employed to design our detection and tracking system that is shown to successfully detect a moving object in a complex outdoor environment.","PeriodicalId":147245,"journal":{"name":"International Conference on Information Photonics","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127265643","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Layered unequal loss protection with pre-interleaving for progressive image transmission over packet loss channels","authors":"Jianfei Cai, Xiangjun Li, C. Chen","doi":"10.1109/ICIP.2004.1421619","DOIUrl":"https://doi.org/10.1109/ICIP.2004.1421619","url":null,"abstract":"Most existing ULP (unequal loss protection) schemes do not consider the minimum quality requirement and usually have high computation complexity. Previously, we proposed a layered ULP (L-ULP) scheme to solve the mentioned problems at the cost of performance degradation. In this paper, we propose to combine the L-ULP with the preinterleaving, which is able to delay the occurrence of the first unrecoverable loss in the source data bitstream while still keeping the original priorities among different layers. Experimental results show that the proposed joint L-ULP and pre-interleaving scheme is able to achieve as good performance as that of the ULP while the complexity is much lower.","PeriodicalId":147245,"journal":{"name":"International Conference on Information Photonics","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127345351","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A 2-level domain decomposition algorithm for inverse diffuse optical tomography","authors":"Il-Young Son, M. Guven, Xavier Intes, B. Yazıcı","doi":"10.1109/ICIP.2004.1421823","DOIUrl":"https://doi.org/10.1109/ICIP.2004.1421823","url":null,"abstract":"In this paper, we explore domain decomposition algorithms for the inverse DOT problem in order to reduce the computational complexity and accelerate the convergence of the optical image reconstruction. We propose a combination of a two-level multigrid algorithm with a modified multiplicative Schwarz algorithm, where a conjugate gradient is used as an accelerator to solve each sub-problem formulated on each of the partitioned sub-domains. For our experiments, simulated phantom configuration with two rectangular inclusions is used as a testbed to measure the computational efficiency of our algorithms. No a priori information about the configuration is assumed except for the source and detector locations. For the application of our modified Schwarz algorithm alone, we observe an increase in efficiency of 100% as compared to the conjugate gradient solution obtained for the full domain. With the addition of the coarse grid, this efficiency rises to 400%. The coarse grid also serves to improve the overall appearance of the reconstructed image at the boundaries of the inclusions.","PeriodicalId":147245,"journal":{"name":"International Conference on Information Photonics","volume":"354 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133029205","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Image scale and rotation from the phase-only bispectrum","authors":"J. Heikkilä","doi":"10.1109/ICIP.2004.1421420","DOIUrl":"https://doi.org/10.1109/ICIP.2004.1421420","url":null,"abstract":"This paper deals with the problem of aligning two images under translation, rotation and sealing. The method described utilizes the shift invariance property of the bispectrum to eliminate the effect of the translation component. Only the phase information is preserved from the bispectrum in order to achieve better resilience against nonuniform illumination changes. The scale and the rotation parameters are estimated from the remaining log-polar sampled spectrum using cross-correlation. The examples shown in the paper indicate that the method is quite robust against background clutter and occlusions.","PeriodicalId":147245,"journal":{"name":"International Conference on Information Photonics","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123572613","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Bayesian framework for recursive object removal in movie post-production","authors":"A. Kokaram, B. Collis, Simon Robinson","doi":"10.1109/ICIP.2003.1247118","DOIUrl":"https://doi.org/10.1109/ICIP.2003.1247118","url":null,"abstract":"Some of the most convincing film and video effects are created in digital post-production by removing apparatus that supports or manipulates actors and objects. Wires and people, for instance, can be removed by digitally painting them out of the scene provided some 'clean plate' image is available for pasting in the missing regions. This paper addresses the problem when no such plate is available. Object removal requires the estimation of the motion of the hidden material and then the reconstruction of the missing image data. Using the notion of temporal motion smoothness, this paper articulates the two problems using a Bayesian framework and so develops a unique tool for automated object removal. The tool is currently being tested in the film effects industry and initial feedback is very positive.","PeriodicalId":147245,"journal":{"name":"International Conference on Information Photonics","volume":"114 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124121250","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Fast view interpolation of stereo images using image gradient and disparity triangulation","authors":"J. Park, H. W. Park","doi":"10.1109/ICIP.2003.1246978","DOIUrl":"https://doi.org/10.1109/ICIP.2003.1246978","url":null,"abstract":"The paper proposes a fast view interpolation method based on image gradient and disparity triangulation. The image gradient is used to select the node points for triangulation and each node point is evaluated by its matching errors and cross correspondence of the disparity values. To model the abrupt changes of disparity on the object boundaries, new node points are added along the image gradient direction. In addition, some node points are removed by consideration of unreliable matching conditions. To construct the intermediate-view images, Delaunay triangulation and image warping are performed. The experimental results show that the proposed algorithm is fast and overcomes the drawbacks of previous methods.","PeriodicalId":147245,"journal":{"name":"International Conference on Information Photonics","volume":"EC-2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126553112","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Highly scalable video compression using a lifting-based 3D wavelet transform with deformable mesh motion compensation","authors":"Andrew Secker, D. Taubman","doi":"10.1109/ICIP.2002.1039080","DOIUrl":"https://doi.org/10.1109/ICIP.2002.1039080","url":null,"abstract":"This paper continues the development of a new framework for the construction of motion-compensated wavelet transforms for highly scalable video compression. The current authors recently proposed a motion adaptive wavelet transform based on motion-compensated lifting steps. This approach overcomes several limitations of existing methods. In particular, frame warping and block displacement methods cannot efficiently exploit complex motion without sacrificing invertibility. By contrast, the motion-compensated lifting transform remains invertible regardless of the motion model. The previous work was primarily in the context of a block motion model. However, block motion models inevitably yield discontinuous motion fields, which poorly represent complex motion in real video sequences. In this paper we consider the benefits of a continuous motion field, by incorporating a deformable mesh motion model into the existing framework. Experimental results show that this leads to improved compression performance. In addition, we show that the invertibility of continuous motion fields allows greater potential for compactly representing the motion information.","PeriodicalId":147245,"journal":{"name":"International Conference on Information Photonics","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125690522","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}