{"title":"Layered unequal loss protection with preinterleaving for progressive image transmission over packet loss channels","authors":"Jianfei Cai, Xiangjun Li, C. Chen","doi":"10.1145/1111604.1111606","DOIUrl":"https://doi.org/10.1145/1111604.1111606","url":null,"abstract":"Most existing ULP (unequal loss protection) schemes do not consider the minimum quality requirement and usually have high computation complexity. Previously, we proposed a layered ULP (L-ULP) scheme to solve the mentioned problems at the cost of performance degradation. In this paper, we propose to combine the L-ULP with the preinterleaving, which is able to delay the occurrence of the first unrecoverable loss in the source data bitstream while still keeping the original priorities among different layers. Experimental results show that the proposed joint L-ULP and pre-interleaving scheme is able to achieve as good performance as that of the ULP while the complexity is much lower.","PeriodicalId":184798,"journal":{"name":"2004 International Conference on Image Processing, 2004. ICIP '04.","volume":"490 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116170189","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Filter bank selection for the ownership verification of wavelet based digital image watermarking","authors":"Min-Jen Tsai","doi":"10.1109/ICIP.2004.1421848","DOIUrl":"https://doi.org/10.1109/ICIP.2004.1421848","url":null,"abstract":"Wavelet transform has been widely used in many signal processing applications. Advanced researches have intensively investigated the characteristics of wavelet filters, adaptive wavelet decomposition structure for coding optimization and different combination of mathematical operations for digital image watermarking implementations. Among these studies, there are researches shown that the combination of wavelet filters or filter bank decomposition structure can be implemented as the function of the keys for the watermarking. Even this property provides the flexibility for practical usage, the false alarm of the rightful ownership verification is essentially existed since the resolution requirement for the transformed coefficients can be less demanded than other applications like compression or encryption. Therefore, this study has investigated and discussed the issues and proposed a solution which utilizes the distinguishing index as the selection criterion for wavelet based digital image watermarking in order to increase the difficulty of guessing the right filter banks, in the mean time, to reduce the probability of the misjudgment.","PeriodicalId":184798,"journal":{"name":"2004 International Conference on Image Processing, 2004. ICIP '04.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123742072","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Junqing Chen, T. Pappas, A. Mojsilovic, B. Rogowitz
{"title":"Perceptually-tuned multiscale color-texture segmentation","authors":"Junqing Chen, T. Pappas, A. Mojsilovic, B. Rogowitz","doi":"10.1109/ICIP.2004.1419450","DOIUrl":"https://doi.org/10.1109/ICIP.2004.1419450","url":null,"abstract":"We present a perceptually-tuned multiscale image segmentation algorithm that is based on spatially adaptive color and texture features. The proposed algorithm extends a previously proposed approach to include multiple texture scales. The determination of the multiscale texture features is based on perceptual considerations. We also examine the perceptual tuning of the algorithm and how it is affected by the presence of different texture scales. The multiscale extension is necessary for segmenting higher resolution images and is particularly effective in segmenting objects shown in different perspectives. The performance of the proposed algorithm is demonstrated in the domain of photographic images.","PeriodicalId":184798,"journal":{"name":"2004 International Conference on Image Processing, 2004. ICIP '04.","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128944821","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Win-bin Huang, Wei-Chen Chang, Yen-Wei Lu, A. Su, Y. Kuo
{"title":"Halftone/contone conversion using neural networks","authors":"Win-bin Huang, Wei-Chen Chang, Yen-Wei Lu, A. Su, Y. Kuo","doi":"10.1109/ICIP.2004.1421882","DOIUrl":"https://doi.org/10.1109/ICIP.2004.1421882","url":null,"abstract":"A novel neural network based method for halftoning and inverse halftoning of digital images is presented. We first start from inverse half-toning of images produced from error diffusion methods using an RBF network plus an MLP network. The restored contone images have had good quality already. Then, an SLP neural network is used to refine the halftoning processing and the training process of the inverse half-toning network is also involved. The combined training procedure produces half-tone images and the corresponding continuous tone images at the same time. It is found that these contone images have even better PSNR performance. Furthermore, the resulted half-tone images are visually sharper and clearer, too. The proposed inverse half-toning method is also compared to the well-known LUT method.","PeriodicalId":184798,"journal":{"name":"2004 International Conference on Image Processing, 2004. ICIP '04.","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133347154","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A novel visual distortion sensitivity analysis for video encoder bit allocation","authors":"Chih-Wei Tang, Ching-Ho Chen, Ya-Hui Yu, Chun-Jen Tsai","doi":"10.1109/ICIP.2004.1421800","DOIUrl":"https://doi.org/10.1109/ICIP.2004.1421800","url":null,"abstract":"A novel video bit allocation technique adopting a visual distortion sensitivity model for better rate-visual distortion coding control is proposed in this paper. Instead of applying complicated semantic understanding, the proposed automatic distortion sensitivity analysis process analyzes both the motion and the texture structures in the video sequences in order to achieve better bit allocation for rate-constrained video coding. This analysis evaluates the tolerable perceptual distortions on a macroblock basis, and allocates fewer bits to regions permitting large perceptual distortions for rate reduction. The proposed algorithm can be incorporated into any existing video coding rate control schemes to achieve same visual quality at greatly reduced bitrate. Experiments based on H.264 show that this technique achieves bit-rate saving of up to 40% with no perceptual quality degradations. The experiments also demonstrate the inadequacy of using PSNR as a distortion measure in a video coding framework.","PeriodicalId":184798,"journal":{"name":"2004 International Conference on Image Processing, 2004. ICIP '04.","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127909443","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
F. Verdicchio, Y. Andreopoulos, T. Clerckx, J. Barbarien, A. Munteanu, J. Cornelis, P. Schelkens
{"title":"Scalable video coding based on motion-compensated temporal filtering: complexity and functionality analysis","authors":"F. Verdicchio, Y. Andreopoulos, T. Clerckx, J. Barbarien, A. Munteanu, J. Cornelis, P. Schelkens","doi":"10.1109/ICIP.2004.1421705","DOIUrl":"https://doi.org/10.1109/ICIP.2004.1421705","url":null,"abstract":"Video coding techniques yielding state-of-the-art compression performance require large amount of computational resources, hence practical implementations, which target a broad market, often tend to trade-off coding efficiency and flexibility for reduced complexity. Scalable video coding instead, not only provides seamless adaptation to bit-rate variation, but also allows the end user to trim down the resources he needs to perform real-time decoding by limiting the process to a subset of the original content. Hence, by choosing the quality, frame-rate and/or resolution of the reconstructed sequence, each decoder can meet its hardware limitations without affecting the encoding process of the media provider. This paper proposes a preliminary analysis of the memory-access behavior of a fully scalable video decoder and investigates the capability of selecting the operational settings in order to adapt to the available hardware resources on the target device.","PeriodicalId":184798,"journal":{"name":"2004 International Conference on Image Processing, 2004. ICIP '04.","volume":"84 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126629856","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Optimal sensor selection for video-based target tracking in a wireless sensor network","authors":"P. Pahalawatta, T. Pappas, A. Katsaggelos","doi":"10.1109/ICIP.2004.1421762","DOIUrl":"https://doi.org/10.1109/ICIP.2004.1421762","url":null,"abstract":"The use of wireless sensor networks for target tracking is an active area of research. Imaging sensors that obtain video-rate images of a scene can have a significant impact in such networks, as they can measure vital information on the identity, position, and velocity of moving targets. Since wireless networks must operate under stringent energy constraints, it is important to identify the optimal set of imagers to be used in a tracking scenario such that the network lifetime is maximized. We formulate this problem as one of maximizing the information utility gained from a set of sensors subject to a constraint on the average energy consumption in the network. We use an unscented Kalman filter framework to solve the tracking and data fusion problem with multiple imaging sensors in a computationally efficient manner, and use a lookahead algorithm to optimize the sensor selection based on the predicted trajectory of the target. Simulation results show the effectiveness of this method of sensor selection.","PeriodicalId":184798,"journal":{"name":"2004 International Conference on Image Processing, 2004. ICIP '04.","volume":"69 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116084290","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"QIM watermarking games","authors":"A. Goteti, P. Moulin","doi":"10.1109/ICIP.2004.1419398","DOIUrl":"https://doi.org/10.1109/ICIP.2004.1419398","url":null,"abstract":"Quantization Index Modulation (QIM) methods are widely used for blind data embedding and watermarking. Given a QIM watermarking code, we ask what is the attacker's noise distribution that maximizes probability of error of the detector. For memoryless attacks, the problem is reduced to a convex programming problem. Next, we derive QIM code parameters that are minmax optimal.","PeriodicalId":184798,"journal":{"name":"2004 International Conference on Image Processing, 2004. ICIP '04.","volume":"574 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123167203","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
E. Soyak, Y. Eisenberg, F. Zhai, R. Berry, T. Pappas, A. Katsaggelos
{"title":"Channel modeling and its effect on the end-to-end distortion in wireless video communications","authors":"E. Soyak, Y. Eisenberg, F. Zhai, R. Berry, T. Pappas, A. Katsaggelos","doi":"10.1109/ICIP.2004.1421807","DOIUrl":"https://doi.org/10.1109/ICIP.2004.1421807","url":null,"abstract":"A major limitation faced by a mobile user is their dependence on a limited battery supply. For wireless video communications, joint source coding and transmission power management (JSCPM) has recently been considered as a means of efficiently allocating transmission energy. In order to reduce complexity, the design of many of these adaptive resource allocation algorithms utilizes simplified channel models that do not account for the burstiness of the channel. We analyze the effects of such channel model simplifications on the end-to-end distortion. We present a channel model that is based on information theoretic considerations, which captures the bursty nature of wireless channels and accounts for packet lengths when calculating the probability of loss. Given the source coding and transmission parameters derived using a simplified channel model, our goal is to analyze how the end-to-end distortion is affected when a more realistic complex channel model is used to simulate losses. Experimental results suggest that the performance gain predictions for JSCPM using a simpler channel model are also valid when more sophisticated channel simulations are used, provided that a number of additional steps are taken after the optimization to account for the complex characteristics of wireless channels.","PeriodicalId":184798,"journal":{"name":"2004 International Conference on Image Processing, 2004. ICIP '04.","volume":"70 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126766341","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Haohong Wang, F. Zhai, Y. Eisenberg, A. Katsaggelos
{"title":"Optimal object-based video communications over differentiated services networks","authors":"Haohong Wang, F. Zhai, Y. Eisenberg, A. Katsaggelos","doi":"10.1109/ICIP.2004.1421808","DOIUrl":"https://doi.org/10.1109/ICIP.2004.1421808","url":null,"abstract":"In this paper, we propose an optimal unequal error protection scheme for object-based video communications over differentiated services networks. Our goal is to achieve the best video quality (minimum total expected distortion) with constraints on transmission cost and delay. An end-to-end distortion estimation approach for object-based video is proposed, which can be used for different packetization schemes. The problem is solved using Lagrangian relaxation and dynamic programming. Experimental results indicate that the proposed unequal error protection schemes can significantly outperform equal error protection methods.","PeriodicalId":184798,"journal":{"name":"2004 International Conference on Image Processing, 2004. ICIP '04.","volume":"71 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122611119","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}