{"title":"A fluid model for error propagation characterization in video coding","authors":"Xiaoming Sun, C.-C. Jay Kuo","doi":"10.1109/ICIP.2004.1418719","DOIUrl":"https://doi.org/10.1109/ICIP.2004.1418719","url":null,"abstract":"An error corruption model (ECM) to describe the interframe error propagation phenomenon in a motion-compensated predictive video codec using fluid flow characteristics is proposed. First, we derive a diffusion differential equation and discuss its solution, which captures the damping effect of error propagation. Then, we propose a tracking quadrilateral (TQ) mechanism to capture the shaping and drilling effects of error propagation. Finally, we integrate these building blocks to form an adaptive fluid-based ECM (F-ECM). The accuracy of the proposed F-ECM is verified by experimental results.","PeriodicalId":184798,"journal":{"name":"2004 International Conference on Image Processing, 2004. ICIP '04.","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124592767","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Optimal multiresolution polygonal approximation","authors":"Alexander Kolesnikov, P. Fränti","doi":"10.1109/ICIP.2004.1421753","DOIUrl":"https://doi.org/10.1109/ICIP.2004.1421753","url":null,"abstract":"We propose optimal and near-optimal algorithm for multiresolution polygonal approximation of digital curves. The solution with minimum number of segments is constructed as the shortest path in a weighted graph where the weights are recursively defined as the number of segments of all embedded layers.","PeriodicalId":184798,"journal":{"name":"2004 International Conference on Image Processing, 2004. ICIP '04.","volume":"97 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124751942","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Semantic-based traffic video retrieval using activity pattern analysis","authors":"Dan Xie, Weiming Hu, T. Tan, Junyi Peng","doi":"10.1109/ICIP.2004.1418849","DOIUrl":"https://doi.org/10.1109/ICIP.2004.1418849","url":null,"abstract":"A semantic based retrieval framework for traffic video sequences is proposed. In order to estimate the low-level motion data, a cluster tracking algorithm is developed. A novel hierarchical self-organizing map is applied to learn the activity patterns. By using activity pattern analysis and semantic concepts assignment, a set of activity models is generated, which is used as the indexing key for accessing video clips and individual vehicles in the semantic level. The proposed retrieval framework supports various queries including query by keywords, query by sketch and multiple object queries.","PeriodicalId":184798,"journal":{"name":"2004 International Conference on Image Processing, 2004. ICIP '04.","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129687484","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Edge detection based on decision-level information fusion and its application in hybrid image filtering","authors":"Jia Li, Xiaojun Jing","doi":"10.1109/ICIP.2004.1418737","DOIUrl":"https://doi.org/10.1109/ICIP.2004.1418737","url":null,"abstract":"A new edge detection method, based on decision-level information fusion, is proposed to classify image pixels into edge and non-edge categories. Traditional edge detection algorithms make the detection decision under a single criterion, which may perform inefficiently with a change of noise model. We use fusion entropy as a criterion to integrate decisions from different classifiers in order to improve the edge detection accuracy. The proposed decision fusion based edge detection method is applied to image filtering and leads to a weighted hybrid-filtering algorithm. Simulation results show that the new edge detection method has better performance than the single criterion edge detection methods.","PeriodicalId":184798,"journal":{"name":"2004 International Conference on Image Processing, 2004. ICIP '04.","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130303138","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
E. Gelasca, T. Ebrahimi, Mylène C. Q. Farias, M. Carli, S. Mitra
{"title":"Annoyance of spatio-temporal artifacts in segmentation quality assessment [video sequences]","authors":"E. Gelasca, T. Ebrahimi, Mylène C. Q. Farias, M. Carli, S. Mitra","doi":"10.1109/ICIP.2004.1418761","DOIUrl":"https://doi.org/10.1109/ICIP.2004.1418761","url":null,"abstract":"This paper describes the results of a series of subjective experiments that investigated the annoyance caused by the most common artifacts present in segmented video sequences. Various types of artifacts were inserted into a reference segmented video, considered as ideal, and shown to our test subjects. The artifacts varied in their location, size, appearance and duration. Annoyance of segmentation artifacts are found to be tied up with their intrinsic characteristics (e.g., size, position) but only weakly related to the video content. The results identify the characteristics that should be taken into account in the design of a perceptually driven objective metric.","PeriodicalId":184798,"journal":{"name":"2004 International Conference on Image Processing, 2004. ICIP '04.","volume":"346 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126678071","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"On decoder-latency versus performance tradeoffs in differential predictive coding","authors":"P. Ishwar, K. Ramchandran","doi":"10.1109/ICIP.2004.1419494","DOIUrl":"https://doi.org/10.1109/ICIP.2004.1419494","url":null,"abstract":"Theoretical analysis of differential predictive coding (DPC) has almost exclusively focused on scalar quantizers and the high-rate regime for tractability reasons. As a result, the role of noncausal decoding in improving the quality has been largely ignored in the literature. In this work we conduct a rigorous performance analysis of DPC-based schemes under a simple independent, vector-Gaussian, AR-1 source model and large-block (as opposed to high-rate) asymptotics. This analysis reveals that noncausal decoding can offer a significant relative improvement in the mean squared error (by as much as 3 dB) at medium to low rates (0.1-0.5 bit per sample) for sources having strong temporal correlation. Furthermore, most of this relative improvement can be attained with a modest decoder-latency. At very high and very low rates, the gains are negligible.","PeriodicalId":184798,"journal":{"name":"2004 International Conference on Image Processing, 2004. ICIP '04.","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126799568","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Image segmentation by cooperative optimization","authors":"Xiaofei Huang","doi":"10.1109/ICIP.2004.1419456","DOIUrl":"https://doi.org/10.1109/ICIP.2004.1419456","url":null,"abstract":"This paper presents the application of a new cooperative optimization algorithm for image segmentation. In our experiments, it significantly outperforms graph cuts, an emerging powerful optimization algorithm for image processing and computer vision. Compared to graph cuts, it is 10 times faster much less restrictive on energy function forms, has an error rate two to three times smaller and does not need extra memory while graph cuts allocated 22 Mbytes more for a 384/spl times/288 image. Its operations are simple and fully parallel that can be implemented in a system of agents (e.g., neurons). Also, it has a solid theoretical foundation on its computational properties.","PeriodicalId":184798,"journal":{"name":"2004 International Conference on Image Processing, 2004. ICIP '04.","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129294975","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Joint dense 3D interpretation and multiple motion segmentation of temporal image sequences: a variational framework with active curve evolution and level sets","authors":"H. Sekkati, A. Mitiche","doi":"10.1109/ICIP.2004.1418814","DOIUrl":"https://doi.org/10.1109/ICIP.2004.1418814","url":null,"abstract":"The aim of this study is to introduce a novel method for the simultaneous motion segmentation and dense 3D interpretation of temporal sequences of monocular images. The problem is to recover simultaneously 3D structure, 3D motion, and a motion-based segmentation from the image sequence spatio-temporal variations. Motion in space is considered relative to the viewing system so that both the viewing system and environmental objects are allowed to move. The problem is stated as a 3D motion segmentation problem with simultaneous depth estimation within the regions of segmentation. The Euler-Lagrange equations of minimization of the objective functional lead to curve evolution PDE implemented via level sets.","PeriodicalId":184798,"journal":{"name":"2004 International Conference on Image Processing, 2004. ICIP '04.","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121296802","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Multiscale surface representation and rendering for point clouds","authors":"Sang-Btan Park, Sang Uk Lee, Hyeokho Choi","doi":"10.1109/ICIP.2004.1421459","DOIUrl":"https://doi.org/10.1109/ICIP.2004.1421459","url":null,"abstract":"We introduce a new multiscale geometry representation based on tree-structured piecewise plane approximation of 3-D surface geometry. Dyadic division and piano approximation of points within each dyadic cube result in a tree-structured multiscale representation of the 3-D surface described by point cloud data. We then perform a complexity-regularized tree pruning to obtain a compact representation of the surface geometry. Based on its adaptivity and multiscale structure, the proposed representation scheme provides a desirable framework for efficient geometry modelling, fast processing, and geometry coding. We apply the proposed geometry models to point cloud rendering to demonstrate supremacy of our efficient geometry representation.","PeriodicalId":184798,"journal":{"name":"2004 International Conference on Image Processing, 2004. ICIP '04.","volume":"90 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116195662","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
O. J. Omer, Sameer Kumar, Rajeev Bajpai, K. Venkatesh, Sumana Gupta
{"title":"Motion estimation from motion smear - a system identification approach","authors":"O. J. Omer, Sameer Kumar, Rajeev Bajpai, K. Venkatesh, Sumana Gupta","doi":"10.1109/ICIP.2004.1421438","DOIUrl":"https://doi.org/10.1109/ICIP.2004.1421438","url":null,"abstract":"Motion smear, which arises because of fast motion relative to the shutter time of a camera, is generally considered as an artifact. Little work has been done to use motion smear as a visual cue for motion estimation or image restoration. Here, we present a new approach to estimate motion from two successive frames of smeared images. The blurring system is modeled as temporal integration of instantaneous images and has been estimated using system identification theory. Motion parameters have been extracted from the estimated system. As compared to earlier approaches having a similar objective, no edge detection or optical flow analysis is required. Our approach establishes a trade off between signal to noise ratio (SNR) and computational complexity. Highly accurate results have been observed with SNR as low as 12 dB. Experimental results with both simulated and real images are shown.","PeriodicalId":184798,"journal":{"name":"2004 International Conference on Image Processing, 2004. ICIP '04.","volume":"73 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116218798","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}