Nastaran Ghadarghadar, E. Cansizoglu, Peng Zhang, Deniz Erdoğmuş
{"title":"基于sift点分布的头部姿态估计方法","authors":"Nastaran Ghadarghadar, E. Cansizoglu, Peng Zhang, Deniz Erdoğmuş","doi":"10.1109/MLSP.2012.6349751","DOIUrl":null,"url":null,"abstract":"Estimating the head pose of a person in a video or image sequence is a challenging problem in computer vision. In this paper, we present a new technique on how to estimate the human face pose from a video sequence, by creating a probabilistic model based on the scale invariant features of the face. This method consists of four major steps: (1) the face is detected using the basic CAMSHIFT algorithm, (2) a training dataset is created for each face pose, (3) the distinctive invariant features of the training and test face image sets are extracted using the scale-invariant feature transform (SIFT) algorithm, (4) a kernel density estimate (KDE) of SIFT points on each image is generated. Pose classification is achieved by nearest-neighbor search using a KDE overlap measure. Results indicate that the proposed method is robust, accurate, not computationally expensive, and can successfully be used for pose estimation.","PeriodicalId":262601,"journal":{"name":"2012 IEEE International Workshop on Machine Learning for Signal Processing","volume":"44 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"A SIFT-point distribution-based method for head pose estimation\",\"authors\":\"Nastaran Ghadarghadar, E. Cansizoglu, Peng Zhang, Deniz Erdoğmuş\",\"doi\":\"10.1109/MLSP.2012.6349751\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Estimating the head pose of a person in a video or image sequence is a challenging problem in computer vision. In this paper, we present a new technique on how to estimate the human face pose from a video sequence, by creating a probabilistic model based on the scale invariant features of the face. This method consists of four major steps: (1) the face is detected using the basic CAMSHIFT algorithm, (2) a training dataset is created for each face pose, (3) the distinctive invariant features of the training and test face image sets are extracted using the scale-invariant feature transform (SIFT) algorithm, (4) a kernel density estimate (KDE) of SIFT points on each image is generated. Pose classification is achieved by nearest-neighbor search using a KDE overlap measure. Results indicate that the proposed method is robust, accurate, not computationally expensive, and can successfully be used for pose estimation.\",\"PeriodicalId\":262601,\"journal\":{\"name\":\"2012 IEEE International Workshop on Machine Learning for Signal Processing\",\"volume\":\"44 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-11-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 IEEE International Workshop on Machine Learning for Signal Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/MLSP.2012.6349751\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE International Workshop on Machine Learning for Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MLSP.2012.6349751","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A SIFT-point distribution-based method for head pose estimation
Estimating the head pose of a person in a video or image sequence is a challenging problem in computer vision. In this paper, we present a new technique on how to estimate the human face pose from a video sequence, by creating a probabilistic model based on the scale invariant features of the face. This method consists of four major steps: (1) the face is detected using the basic CAMSHIFT algorithm, (2) a training dataset is created for each face pose, (3) the distinctive invariant features of the training and test face image sets are extracted using the scale-invariant feature transform (SIFT) algorithm, (4) a kernel density estimate (KDE) of SIFT points on each image is generated. Pose classification is achieved by nearest-neighbor search using a KDE overlap measure. Results indicate that the proposed method is robust, accurate, not computationally expensive, and can successfully be used for pose estimation.