{"title":"Scene change detection in MPEG domain","authors":"N. Gamaz, Xiaolei Huang, S. Panchanathan","doi":"10.1109/IAI.1998.666852","DOIUrl":"https://doi.org/10.1109/IAI.1998.666852","url":null,"abstract":"Video is an important and challenging media and requires sophisticated indexing schemes for efficient retrieval from visual databases. Video segmentation is a fundamental step in video indexing and involves detection of scene changes. In this paper, we propose a fast and robust algorithm for detecting video shot boundaries in the MPEG-2 compressed bitstream with minimal decoding.","PeriodicalId":373701,"journal":{"name":"1998 IEEE Southwest Symposium on Image Analysis and Interpretation (Cat. No.98EX165)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123746025","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Multi-resolution vegetation classification along lower Rio Grande","authors":"L. Zhang, M. Desai, R. Lonard, F. Judd","doi":"10.1109/IAI.1998.666863","DOIUrl":"https://doi.org/10.1109/IAI.1998.666863","url":null,"abstract":"We present a quasi-wavelet based approach for multiresolution unsupervised and supervised vegetation classification of videography imagery. Due to certain limitations of traditional wavelet decomposition, we investigate a new quasi-wavelet decomposition to generate an alternative feature space for vegetation classification. Most existing wavelet-based classification approaches only consider the information on low-low frequency subspaces at each decomposed stage. However, both low and high frequency features are applied in our algorithms and are controlled by weight setting on the vector feature space. The thresholding method is used for generating the original class center to achieve a convergent result for unsupervised classification. The results on lower Rio Grande Valley videography imagery are presented.","PeriodicalId":373701,"journal":{"name":"1998 IEEE Southwest Symposium on Image Analysis and Interpretation (Cat. No.98EX165)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125099522","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Texture classification using combined feature sets","authors":"L. S. Ng, M. Nixon, J. Carter","doi":"10.1109/IAI.1998.666868","DOIUrl":"https://doi.org/10.1109/IAI.1998.666868","url":null,"abstract":"We consider two methods to combine texture descriptions for classification: a composite feature vector which combines data additively, and an extended k-nearest-neighbour (KNN) rule which returns a decision based on the highest confidence in features, both aimed to improve classification capability. These have been used to combine a wide range of relatively simple texture features, and have been shown to have significant advantage. Although nearly all previous approaches have used a limited subset of the Brodatz database, the new techniques have been applied to the whole Brodatz database with evaluation independent of the number of test classes used by measuring the number of perfect classes. The results of these new methods of combination show that an overall classification rate exceeding 90% can be achieved with 71 perfect classes, improving capabilities above using the measures individually.","PeriodicalId":373701,"journal":{"name":"1998 IEEE Southwest Symposium on Image Analysis and Interpretation (Cat. No.98EX165)","volume":"101 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124139835","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
V. Rakotomalala, L. Macaire, M. Valette, P. Labalette, Y. Mouton, J. Postaire
{"title":"Bidimensional retinal blood vessel reconstruction by a new color edge tracking procedure","authors":"V. Rakotomalala, L. Macaire, M. Valette, P. Labalette, Y. Mouton, J. Postaire","doi":"10.1109/IAI.1998.666891","DOIUrl":"https://doi.org/10.1109/IAI.1998.666891","url":null,"abstract":"The authors present a new color edge tracking procedure in order to achieve a bidimensional reconstruction of the retinal blood vessels. The major branches of the reconstructed vessels will serve as landmarks in order to locate the retinal lesions called CytoMegaloVirus (CMV) retinitis, on the color fundus images of patients with acquired immune deficiency syndrome (AIDS). The reconstruction is based on a recursive tracking of the two vessel edges extracted by means of color edge detection. After an interactive selection of the starting edge pixels, the tracking process automatically searches for the side-branches of the major vessel being tracked, providing a tree representation of the vessel vasculature. During the tracking, contour and region information (color of the vessel body) are associated in order to get more accurate extraction.","PeriodicalId":373701,"journal":{"name":"1998 IEEE Southwest Symposium on Image Analysis and Interpretation (Cat. No.98EX165)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115068081","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Segmentation strategies with multiple analysis for an SMD object recognition system","authors":"A. E. Brito, E. Whittenberger, S. Cabrera","doi":"10.1109/IAI.1998.666860","DOIUrl":"https://doi.org/10.1109/IAI.1998.666860","url":null,"abstract":"We present two segmentation strategies that use multiple analysis for the preprocessing stage of an object recognition system to detect the presence of surface mounted devices (SMD) on printed circuit boards. This work concentrates only on the preprocessing stage and simple segmentation algorithms for fast real-time implementation. The system uses two images of the same scene, with top and side illuminations. One approach uses both the top- and the side-illuminated images while the other one uses only the top-illuminated image. Experiments are performed using the two model-based segmentation approaches, which produce a gray level region of interest (ROI) that has the SMD isolated as a target when it is present or a smaller area when the SMD is absent. The suppression of the copper-pads is a key step in the processing. For the first strategy, the comparison criteria used to evaluate its performance are the binary area and the energy of the ROI. For the second strategy, our evaluation is a comparison of the results with a fixed size reference ROI mask located at a centroid chosen by visual analysis of the image. Using a database of 1500 images, the distributions of the two comparison criteria are shown for three possible scenes: the SMD is present in the images, the SMD is absent but a speck of glue is present, or the SMD and the glue are both absent. Examples of the processing results are shown.","PeriodicalId":373701,"journal":{"name":"1998 IEEE Southwest Symposium on Image Analysis and Interpretation (Cat. No.98EX165)","volume":"77 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129976206","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Visual perception tools for natural interaction - a gaze capture and tracking system","authors":"C. Collet, R. Gherbi","doi":"10.1109/IAI.1998.666866","DOIUrl":"https://doi.org/10.1109/IAI.1998.666866","url":null,"abstract":"In this paper we present a vision system with the aim of building a perceptual tool in order to make machines aware of humans allowing natural interaction in human-machine communication. The system must satisfy interaction constraints and be non-intrusive. Such a real-time camera-based system is designed for gaze tracking from an image sequence of a human in front of a machine. Therefore we use a CCD camera, placed between the keyboard and the screen. The system detects the user's presence, locates and then tracks the face, nose and both eyes. These operations are performed by combining image processing techniques and pattern recognition methods. The tracking process is based on a prediction-verification strategy using dynamic information of detection. The system continuously adapts its recognition parameters to take into account environment variations. Some results of recognition scores are also presented.","PeriodicalId":373701,"journal":{"name":"1998 IEEE Southwest Symposium on Image Analysis and Interpretation (Cat. No.98EX165)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127777694","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Convex projections restoration of Hadamard naturalness-preserving transform coded images","authors":"C. Fore, R. Yarlagadda","doi":"10.1109/IAI.1998.666853","DOIUrl":"https://doi.org/10.1109/IAI.1998.666853","url":null,"abstract":"This paper investigates an image coding system using the Hadamard naturalness-preserving transform (HNPT). It focuses both on the encoding and the decoding processes. Symmetric Lloyd-Max quantizers are designed to encode the HNPT coefficients. The theory of projections onto convex sets (POCS) is used as a basis for the design of an iterative reconstruction algorithm. A constraint set is defined that imposes consistency with known values. Experimental results to date demonstrate the convergence of the restoration algorithm.","PeriodicalId":373701,"journal":{"name":"1998 IEEE Southwest Symposium on Image Analysis and Interpretation (Cat. No.98EX165)","volume":"134 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116341254","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Modified mean curvature motion for multispectral anisotropic diffusion","authors":"K. Pope, S. Acton","doi":"10.1109/IAI.1998.666877","DOIUrl":"https://doi.org/10.1109/IAI.1998.666877","url":null,"abstract":"This paper introduces a new anisotropic diffusion algorithm for enhancing and segmenting multispectral image data. The algorithm is based upon mean curvature motion. Using a modified image gradient computation, the diffusion method is further improved by allowing the control of feature scale, and the sensitivity to heavy-tailed noise is eliminated. For comparison, a vector distance dissimilarity method is introduced and extended for multi-scale processing. The experiments on remotely sensed imagery and color imagery demonstrate the performance of the algorithms in terms of image entropy reduction and impulse elimination as well as visual quality.","PeriodicalId":373701,"journal":{"name":"1998 IEEE Southwest Symposium on Image Analysis and Interpretation (Cat. No.98EX165)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122579070","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Semi-automatic methods and segmentation of hyphal images","authors":"I. Inglis, A. J. Gray, C. Glasbey","doi":"10.1109/IAI.1998.666859","DOIUrl":"https://doi.org/10.1109/IAI.1998.666859","url":null,"abstract":"The underlying aim of this research is to characterise the growth and morphology of arbuscular mycorrhizal hyphal distributions (i.e. spatial distributions of hyphae from a type of fungus), in order to enable assessment of efficiency of hyphal colonisation of a growth medium relative to localised variations in nutrient availability. Patterns of distribution are likely to have a profound influence on nutrient acquisition and relocation within a medium. Although software exists to measure the hyphae (once extracted manually from an image) segmenting each hypha is inaccurate, tedious and time consuming. Hyphal segmentation is successfully tackled using three different computer-assisted approaches with varying degrees of user-input. Results are reported from an evaluation study comparing the methods both quantitatively and qualitatively; this involved several test subjects segmenting simulated images. This work forms part of a wider study to investigate a number of semi-automatic techniques for performance, flexibility and practicality of use.","PeriodicalId":373701,"journal":{"name":"1998 IEEE Southwest Symposium on Image Analysis and Interpretation (Cat. No.98EX165)","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132676145","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Wavelet-based PCA for human face recognition","authors":"P. C. Yuela, D. Dai, Guo-Can Feng","doi":"10.1109/IAI.1998.666889","DOIUrl":"https://doi.org/10.1109/IAI.1998.666889","url":null,"abstract":"This paper addresses the speed and accuracy problems in human face recognition. The wavelet transform is adopted to decompose an image and a particular frequency band is selected for principal component analysis (PCA) for face recognition. Experimental results show that the proposed method is better than the original PCA method in terms of both speed and accuracy.","PeriodicalId":373701,"journal":{"name":"1998 IEEE Southwest Symposium on Image Analysis and Interpretation (Cat. No.98EX165)","volume":"9 3","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121001768","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}