{"title":"Robust detection of people in thermal imagery","authors":"James W. Davis, V. Sharma","doi":"10.1109/ICPR.2004.1333872","DOIUrl":"https://doi.org/10.1109/ICPR.2004.1333872","url":null,"abstract":"We present a new contour analysis technique to detect people in thermal imagery. Background-subtraction is first used to identify local regions-of-interest. Gradient information within each region is then combined into a contour saliency map. To extract contour fragments, a watershed-based selection algorithm is used. A path-constrained A* search is employed to complete any broken contours, from which silhouettes are formed. Results using thermal video sequences demonstrate the capability of the approach to robustly detect people across a wider range of environmental conditions than is possible with standard approaches.","PeriodicalId":335842,"journal":{"name":"Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004.","volume":"68 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115233835","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Classification of machine-printed and handwritten addresses on Korean mail piece images using geometric features","authors":"Seung-Ick Jang, Seon-Hwa Jeong, Yun-Seok Nam","doi":"10.1109/ICPR.2004.235","DOIUrl":"https://doi.org/10.1109/ICPR.2004.235","url":null,"abstract":"We propose an effective method for classifying machine-printed and handwritten addresses on Korean mail piece images. It is of vital importance to know if an input image is machine-printed or handwritten in such applications as address reading, form processing, FAX routing, and etc., since approaches for handwritten images are developed quite differently from those for machine-printed images. Our method consists of three blocks: valid connected component grouping, feature extraction and classification. A set of features related to width and position of groups of valid connected components is used for the classification based on a multi-layer perceptrons network. The experiment done with address images extracted from Korean live mail piece images has demonstrated the superiority of the proposed method. The correct classification rate for 3,147 testing images was about 98.9%.","PeriodicalId":335842,"journal":{"name":"Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004.","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116770782","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A system identification approach for video-based face recognition","authors":"G. Aggarwal, A. Roy-Chowdhury, R. Chellappa","doi":"10.1109/ICPR.2004.1333732","DOIUrl":"https://doi.org/10.1109/ICPR.2004.1333732","url":null,"abstract":"The paper poses video-to-video face recognition as a dynamical system identification and classification problem. We model a moving face as a linear dynamical system whose appearance changes with pose. An autoregressive and moving average (ARMA) model is used to represent such a system. The choice of ARMA model is based on its ability to take care of the change in appearance while modeling the dynamics of pose, expression etc. Recognition is performed using the concept of sub space angles to compute distances between probe and gallery video sequences. The results obtained are very promising given the extent of pose, expression and illumination variation in the video data used for experiments.","PeriodicalId":335842,"journal":{"name":"Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004.","volume":"54 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116775448","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Multi-view face detection under complex scene based on combined SVMs","authors":"Peng Wang, Q. Ji","doi":"10.1109/ICPR.2004.1333733","DOIUrl":"https://doi.org/10.1109/ICPR.2004.1333733","url":null,"abstract":"A single face classifier has difficulty in detecting multiview faces under real and complex scenes due to various poses, cluttering environment and small size of faces. In this paper, we propose a novel combination of SVMs to detect multi-view faces, using both cascading and bagging methods. In our method, the faces are divided into seven views. Each of them models a typical pose under complex scenes. By the modified bootstrap method applied in our method, a cascade of SVMs are constructed to quickly select face candidates from image with expected accuracy. Bagging of different SVMs can further eliminate the false detections that are difficult to handle by single SVM. Such combination of SVMs can effectively detect multi-view faces even with large rotation angles and heavy shadow. The experiment results show better accuracy and generalization performance over single classifier.","PeriodicalId":335842,"journal":{"name":"Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120937653","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A survey of approaches to three-dimensional face recognition","authors":"K. Bowyer, K. Chang, P. Flynn","doi":"10.1109/ICPR.2004.1334126","DOIUrl":"https://doi.org/10.1109/ICPR.2004.1334126","url":null,"abstract":"The vast majority of face recognition research has focused on the use of two-dimensional intensity images, and is covered in existing survey papers. This survey focuses on face recognition using three-dimensional data, either alone or in combination with two-dimensional intensity images. Challenges involved in developing more accurate three-dimensional face recognition are identified.","PeriodicalId":335842,"journal":{"name":"Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004.","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127103232","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Affine propagation for surface reconstruction in wide baseline stereo","authors":"Z. Megyesi, D. Chetverikov","doi":"10.1109/ICPR.2004.1333709","DOIUrl":"https://doi.org/10.1109/ICPR.2004.1333709","url":null,"abstract":"The problem of dense matching in wide baseline stereo is considered. It is assumed that the scene is formed by piecewise-smooth, Lambertian, textured surfaces and that rectified images of the scene are available. We present a novel dense matching algorithm that accounts for local affine distortion and propagates the best matching affine parameters on each surface until a surface discontinuity is reached. Disparity maps and projective reconstructions for real-world wide baseline stereo images are shown.","PeriodicalId":335842,"journal":{"name":"Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004.","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127107743","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Spoken document classification with SVMs using linguistic unit weighting and probabilistic couplers","authors":"U. Iurgel, G. Rigoll","doi":"10.1109/ICPR.2004.1334347","DOIUrl":"https://doi.org/10.1109/ICPR.2004.1334347","url":null,"abstract":"The task addressed by this paper is spoken document classification (SDC) of German TV news with support vector machines (SVMs). It shows the benefits of weighting different linguistic units when combined into one feature vector. Further experiments show that probabilistic SVMs (pSVMs) with couplers perform well on a SDC task. New couplers for multi-category classification, both for pSVMs and non-pSVMs, are discussed. They are easy to implement and show good and promising results. It turns out that using the distance instead of the decision value can be favorable. Theoretical justification is given for our approaches, and some results are explained theoretically.","PeriodicalId":335842,"journal":{"name":"Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004.","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127524503","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"An algorithm for rule generation in fuzzy expert systems","authors":"K. Dmitry, V. Dmitry","doi":"10.1109/ICPR.2004.149","DOIUrl":"https://doi.org/10.1109/ICPR.2004.149","url":null,"abstract":"Although using fuzzy logic in control systems has become widely established as an appropriate approach, its application in area of pattern recognition and data mining seems to be restricted. These systems have several bottlenecks mainly concerning fuzzy rules generation and fuzzy sets forming. The state-of-the-art technique here is neuro-fuzzy approach which has some disadvantages. In this article an algorithm is considered for rules generation based on alternative principles.","PeriodicalId":335842,"journal":{"name":"Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004.","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124815472","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Nearest intra-class space classifier for face recognition","authors":"Wei Liu, Yunhong Wang, S. Li, T. Tan","doi":"10.1109/ICPR.2004.1333819","DOIUrl":"https://doi.org/10.1109/ICPR.2004.1333819","url":null,"abstract":"We propose a novel classification method, called nearest intra-class space (NICS), for face recognition. In our method, the distribution of face patterns of each person is represented by the intra-class space to capture all intra-class variations. Then, a regular principal subspace is derived from each intra-class space using principal component analysis. The classification is based on the nearest weighted distance, combining distance-from-subspace and distance-in-subspace, between the query face and each intra-class subspace. Experimental results show that the NICS classifier outperforms other classifiers in terms of recognition performance.","PeriodicalId":335842,"journal":{"name":"Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004.","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124890505","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Tracking of point targets in IR image sequence using multiple model based particle filtering and MRF based data association","authors":"M. Zaveri, S. Merchant, U. Desai","doi":"10.1109/ICPR.2004.1333876","DOIUrl":"https://doi.org/10.1109/ICPR.2004.1333876","url":null,"abstract":"Particle filtering is being investigated extensively due to its important feature of target tracking based on nonlinear and non-Gaussian model. It tracks a trajectory with a known model at a given time. It means that particle filter tracks an arbitrary trajectory only if the time instant when trajectory switches from one model to another model is known a priori. Because of this reason particle filter is not able to track any arbitrary trajectory where transition from one model to another model is not known. For real world application, trajectory is always random in nature and may follow more than one model. In this paper we propose a novel method, which overcomes the above problem. In the proposed method a multiple model based approach is used along with particle filtering, which automates the model selection process for tracking an arbitrary trajectory. In the proposed approach, there is no need to have a priori information about the exact model that a target may follow. For data association, Markov random field (MRF) based method has been utilized. It allows us to exploit the neighborhood concept for data association, i.e. the association of a measurement influences an association of its neighbor measurement.","PeriodicalId":335842,"journal":{"name":"Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004.","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124950303","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}