{"title":"QoS based video delivery with foveation","authors":"I. Cheng, A. Basu","doi":"10.1109/ICIP.2001.959213","DOIUrl":"https://doi.org/10.1109/ICIP.2001.959213","url":null,"abstract":"Spatially varying sensing (foveation) was first used as a means for image compression in our past research. We extend previous work to address the advantages of foveation in improving the performance of MPEG compression over bandwidth limited channels, such as the Internet. Unlike other approaches to foveating MPEG which used multiresolution representations, we use continuously spatially varying resolution and demonstrate that this approach is indeed advantageous over others. Two parameters, scaling and distortion, are used to allow us to adapt MPEG video to various compression ratios. Experimental results are presented, and can be viewed on the Web, validating our approach.","PeriodicalId":291827,"journal":{"name":"Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205)","volume":"104 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124743873","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Indoor vs outdoor classification of consumer photographs using low-level and semantic features","authors":"Jiebo Luo, A. Savakis","doi":"10.1109/ICIP.2001.958601","DOIUrl":"https://doi.org/10.1109/ICIP.2001.958601","url":null,"abstract":"Scene categorization to indoor vs outdoor may be approached by using low-level features for inferring high-level information about the image. Low-level features such as color and texture have been used extensively in image understanding research, however, they cannot solve the problem completely. We propose the use of a Bayesian network for integrating knowledge from low-level and semantic features for indoor vs outdoor classification of images. Using ground truth data for sky and grass detection, we demonstrate that the classification performance can be significantly improved when semantic features are employed in the classification process.","PeriodicalId":291827,"journal":{"name":"Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205)","volume":"64 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130483840","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
S. Malassiotis, F. Tsalakanidou, N. Mavridis, V. Giagourta, N. Grammalidis, M. Strintzis
{"title":"A face and gesture recognition system based on an active stereo sensor","authors":"S. Malassiotis, F. Tsalakanidou, N. Mavridis, V. Giagourta, N. Grammalidis, M. Strintzis","doi":"10.1109/ICIP.2001.958283","DOIUrl":"https://doi.org/10.1109/ICIP.2001.958283","url":null,"abstract":"The paper presents several novel 3D image analysis algorithms, applied towards the segmentation and modeling of faces and hands. These are subsequently used to build a face-based authentication system and a system for human-computer interaction based on static and dynamic gestures. The system relies on an active stereo sensor that uses a structured light approach to obtain 3D information. In this paper we demonstrate how the use of 3D information may significantly improve the efficiency of traditional face and gesture recognition techniques that use 2D images only.","PeriodicalId":291827,"journal":{"name":"Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126844262","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Mediated morphological filters","authors":"M. Sedaaghi, R. Daj, M. Khosravi","doi":"10.1109/ICIP.2001.958213","DOIUrl":"https://doi.org/10.1109/ICIP.2001.958213","url":null,"abstract":"This paper presents new morphological operators based on a special combination of median filtering and classical gray-scale morphological operators. The newly introduced operators demonstrate a superb performance compared with classical ones and even weighted morphological operators (Sedaaghi and Wu (1998)) introduced by the author, for signals/images buried in speckle, salt and pepper, and Gaussian noise. Their efficiency is comparable with convolutional filtering for speckle and Gaussian noise removal, where classical morphological operators fail to be applicable. The proposed algorithms have been applied for off-line biomedical signal/image processing successfully.","PeriodicalId":291827,"journal":{"name":"Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129035868","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Texture classification by means of HMM modeling of AM-FM features","authors":"E. Salles, L. Ling","doi":"10.1109/ICIP.2001.958081","DOIUrl":"https://doi.org/10.1109/ICIP.2001.958081","url":null,"abstract":"This paper studies the classification problem of non-rotated and rotated textures digitized from the Phil Brodatz Album. The proposed texture analysis technique is based on AM-FM characterization followed by HMM modeling. The detection of AM-FM features was performed via a Gabor filter bank presented in a multiresolution way. To solve the problem of texture rotation, a technique was applied to correct the inherent orientation. In both cases, rotated and non-rotated textures, a low order feature vector was obtained from instantaneous AM-FM 2D maps. The proposed method was tested extensively and compared with some well-known approaches in the literature.","PeriodicalId":291827,"journal":{"name":"Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205)","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123855554","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Out-lier removal algorithm for model-based coded video","authors":"J. Woods","doi":"10.1109/ICIP.2001.958063","DOIUrl":"https://doi.org/10.1109/ICIP.2001.958063","url":null,"abstract":"Model-based coding extracts texture from images of objects and projects them onto computer models of the same objects. Low bit rate transmission is achieved by approximating the shape and movement of the objects and relaying the parameters to the decoder. To estimate the motion of the models a number of features must be accurately tracked inter-frame. The proposed algorithm employs a large displacement vector set, and then proceeds to iteratively remove points exhibiting large disagreement between the 2D translational field and a projection of the estimated 3D rotation. Subjective improvement in motion estimation and better quantitative agreement between motion fields are reported.","PeriodicalId":291827,"journal":{"name":"Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205)","volume":"80 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123975477","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Multiple description video using rate-distortion splitting","authors":"A. Reibman, H. Jafarkhani, Yao Wang, M. Orchard","doi":"10.1109/ICIP.2001.959211","DOIUrl":"https://doi.org/10.1109/ICIP.2001.959211","url":null,"abstract":"We consider a simple multiple description (MD) video coder, that uses redundancy-rate-distortion criteria to split a one-layer stream generated by a standard video coder into two correlated streams. Our simulation results demonstrate that this MD coder has much better performance for large redundancies than our previous MDTC video coder, although it cannot perform as well at low redundancies. This MD video coder is very simple to implement and is compatible with H.263 to the extent that each description can be decoded by a standard H.263 decoder. This MD coder was used in a previous study on the transport of MD and layered video over an EGPRS wireless network, where the fact that it creates two streams with very balanced rates was a strong advantage.","PeriodicalId":291827,"journal":{"name":"Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123234840","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"On the security of the SARI image authentication system","authors":"R. Radhakrishnan, N. Memon","doi":"10.1109/ICIP.2001.958287","DOIUrl":"https://doi.org/10.1109/ICIP.2001.958287","url":null,"abstract":"We investigate the image authentication system, SARI, proposed by C.Y. Lin and S.F. Chang (see SPIE Storage and Retrieval of Image/Video Databases, 1998), that distinguishes JPEG compression from malicious manipulations. In particular, we look at the image digest component of this system. We show that if multiple images have been authenticated with the same secret key and the digests of these images are known to an attacker, Oscar, then he can cause arbitrary images to be authenticated with this same but unknown key. We show that the number of such images needed by Oscar to launch a successful attack is quite small, making the attack very practical. We then suggest possible solutions to enhance the security of this authentication system.","PeriodicalId":291827,"journal":{"name":"Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205)","volume":"28 4","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120907531","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Missile tracking using knowledge-based adaptive thresholding","authors":"S. Haker, G. Sapiro, A. Tannenbaum, D. Washburn","doi":"10.1109/ICIP.2001.959163","DOIUrl":"https://doi.org/10.1109/ICIP.2001.959163","url":null,"abstract":"We apply a knowledge-based segmentation method developed for still and video images to the problem of tracking missiles and high speed projectiles. Since we are only interested in segmenting a portion of the missile (namely, the nose cone), we use our segmentation procedure as a method of adapting thresholding. The key idea is to utilize a priori knowledge about the objects present in the image, e.g. missile and background, introduced via Bayes' rule. Posterior probabilities obtained in this way are anisotropically smoothed, and the image segmentation is obtained via MAP classifications of the smoothed data. When segmenting sequences of images, the smoothed posterior probabilities of past frames are used as prior distributions in succeeding frames.","PeriodicalId":291827,"journal":{"name":"Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121166648","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Color image-based angular map-driven snakes","authors":"A. Dumitras, A. Venetsanopoulos","doi":"10.1109/ICIP.2001.958970","DOIUrl":"https://doi.org/10.1109/ICIP.2001.958970","url":null,"abstract":"We propose a method for shape description of objects in color images. Our method employs angular maps to identify significant changes of color within the image, which are then used to drive snake models. Experimental results show that our angular map-driven snake method not only yields an accurate description of an object shape, but also it is computationally efficient.","PeriodicalId":291827,"journal":{"name":"Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205)","volume":"693 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116185589","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}