{"title":"一种用于人脸检测和定位的局部特征评价","authors":"M. Asbach, Peter Hosten, M. Unger","doi":"10.1109/WIAMIS.2008.58","DOIUrl":null,"url":null,"abstract":"Local features have the ability to overcome the major drawback of traditional, holistic object detection approaches, because they are inherently invariant to geometric deformation and pose; in addition scale and rotation invariance can be easily achieved as well. However, the selection of discriminative feature locations and local descriptions is a complex task that has not been generally solved. In case of face detection, features must possess the discriminative power to differentiate between facial parts and cluttered backgrounds while they have to remain person agnostic. A multitude of suggestions for selecting facial features for tracking or identification / recognition can be found in literature, most of which rely on semi-automatic or manual definition of the feature locations. In contrast, fully automatic feature selection and generic description approaches like SIFT and SURF have been shown to provide excellent performance for rigid as well as non-rigid registration and even for object class recognition. While quantitative evaluations exist that give a hint on the registration performance of the competing designs, these scenarios are not directly transferable to object detection. In this paper we provide qualitative and quantitative analysis of existing interest point detectors as well as local descriptions in the context of face detection and localization.","PeriodicalId":325635,"journal":{"name":"2008 Ninth International Workshop on Image Analysis for Multimedia Interactive Services","volume":"25 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":"{\"title\":\"An Evaluation of Local Features for Face Detection and Localization\",\"authors\":\"M. Asbach, Peter Hosten, M. Unger\",\"doi\":\"10.1109/WIAMIS.2008.58\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Local features have the ability to overcome the major drawback of traditional, holistic object detection approaches, because they are inherently invariant to geometric deformation and pose; in addition scale and rotation invariance can be easily achieved as well. However, the selection of discriminative feature locations and local descriptions is a complex task that has not been generally solved. In case of face detection, features must possess the discriminative power to differentiate between facial parts and cluttered backgrounds while they have to remain person agnostic. A multitude of suggestions for selecting facial features for tracking or identification / recognition can be found in literature, most of which rely on semi-automatic or manual definition of the feature locations. In contrast, fully automatic feature selection and generic description approaches like SIFT and SURF have been shown to provide excellent performance for rigid as well as non-rigid registration and even for object class recognition. While quantitative evaluations exist that give a hint on the registration performance of the competing designs, these scenarios are not directly transferable to object detection. In this paper we provide qualitative and quantitative analysis of existing interest point detectors as well as local descriptions in the context of face detection and localization.\",\"PeriodicalId\":325635,\"journal\":{\"name\":\"2008 Ninth International Workshop on Image Analysis for Multimedia Interactive Services\",\"volume\":\"25 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2008-05-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"12\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2008 Ninth International Workshop on Image Analysis for Multimedia Interactive Services\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/WIAMIS.2008.58\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 Ninth International Workshop on Image Analysis for Multimedia Interactive Services","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WIAMIS.2008.58","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An Evaluation of Local Features for Face Detection and Localization
Local features have the ability to overcome the major drawback of traditional, holistic object detection approaches, because they are inherently invariant to geometric deformation and pose; in addition scale and rotation invariance can be easily achieved as well. However, the selection of discriminative feature locations and local descriptions is a complex task that has not been generally solved. In case of face detection, features must possess the discriminative power to differentiate between facial parts and cluttered backgrounds while they have to remain person agnostic. A multitude of suggestions for selecting facial features for tracking or identification / recognition can be found in literature, most of which rely on semi-automatic or manual definition of the feature locations. In contrast, fully automatic feature selection and generic description approaches like SIFT and SURF have been shown to provide excellent performance for rigid as well as non-rigid registration and even for object class recognition. While quantitative evaluations exist that give a hint on the registration performance of the competing designs, these scenarios are not directly transferable to object detection. In this paper we provide qualitative and quantitative analysis of existing interest point detectors as well as local descriptions in the context of face detection and localization.