基于sift点分布的头部姿态估计方法

2012 IEEE International Workshop on Machine Learning for Signal Processing Pub Date : 2012-11-12 DOI:10.1109/MLSP.2012.6349751

Nastaran Ghadarghadar, E. Cansizoglu, Peng Zhang, Deniz Erdoğmuş

{"title":"基于sift点分布的头部姿态估计方法","authors":"Nastaran Ghadarghadar, E. Cansizoglu, Peng Zhang, Deniz Erdoğmuş","doi":"10.1109/MLSP.2012.6349751","DOIUrl":null,"url":null,"abstract":"Estimating the head pose of a person in a video or image sequence is a challenging problem in computer vision. In this paper, we present a new technique on how to estimate the human face pose from a video sequence, by creating a probabilistic model based on the scale invariant features of the face. This method consists of four major steps: (1) the face is detected using the basic CAMSHIFT algorithm, (2) a training dataset is created for each face pose, (3) the distinctive invariant features of the training and test face image sets are extracted using the scale-invariant feature transform (SIFT) algorithm, (4) a kernel density estimate (KDE) of SIFT points on each image is generated. Pose classification is achieved by nearest-neighbor search using a KDE overlap measure. Results indicate that the proposed method is robust, accurate, not computationally expensive, and can successfully be used for pose estimation.","PeriodicalId":262601,"journal":{"name":"2012 IEEE International Workshop on Machine Learning for Signal Processing","volume":"44 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"A SIFT-point distribution-based method for head pose estimation\",\"authors\":\"Nastaran Ghadarghadar, E. Cansizoglu, Peng Zhang, Deniz Erdoğmuş\",\"doi\":\"10.1109/MLSP.2012.6349751\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Estimating the head pose of a person in a video or image sequence is a challenging problem in computer vision. In this paper, we present a new technique on how to estimate the human face pose from a video sequence, by creating a probabilistic model based on the scale invariant features of the face. This method consists of four major steps: (1) the face is detected using the basic CAMSHIFT algorithm, (2) a training dataset is created for each face pose, (3) the distinctive invariant features of the training and test face image sets are extracted using the scale-invariant feature transform (SIFT) algorithm, (4) a kernel density estimate (KDE) of SIFT points on each image is generated. Pose classification is achieved by nearest-neighbor search using a KDE overlap measure. Results indicate that the proposed method is robust, accurate, not computationally expensive, and can successfully be used for pose estimation.\",\"PeriodicalId\":262601,\"journal\":{\"name\":\"2012 IEEE International Workshop on Machine Learning for Signal Processing\",\"volume\":\"44 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-11-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 IEEE International Workshop on Machine Learning for Signal Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/MLSP.2012.6349751\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE International Workshop on Machine Learning for Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MLSP.2012.6349751","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

在视频或图像序列中估计人的头部姿势是计算机视觉中的一个具有挑战性的问题。在本文中，我们提出了一种新的技术，如何从视频序列中估计人脸姿态，通过创建基于人脸的比例不变特征的概率模型。该方法包括四个主要步骤:(1)使用基本CAMSHIFT算法检测人脸;(2)为每个人脸姿态创建训练数据集;(3)使用尺度不变特征变换(SIFT)算法提取训练和测试人脸图像集的不同不变特征;(4)生成每张图像上SIFT点的核密度估计(KDE)。姿态分类是通过使用KDE重叠度量的最近邻搜索来实现的。结果表明，该方法鲁棒性好，精度高，计算量小，可成功用于姿态估计。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A SIFT-point distribution-based method for head pose estimation

Estimating the head pose of a person in a video or image sequence is a challenging problem in computer vision. In this paper, we present a new technique on how to estimate the human face pose from a video sequence, by creating a probabilistic model based on the scale invariant features of the face. This method consists of four major steps: (1) the face is detected using the basic CAMSHIFT algorithm, (2) a training dataset is created for each face pose, (3) the distinctive invariant features of the training and test face image sets are extracted using the scale-invariant feature transform (SIFT) algorithm, (4) a kernel density estimate (KDE) of SIFT points on each image is generated. Pose classification is achieved by nearest-neighbor search using a KDE overlap measure. Results indicate that the proposed method is robust, accurate, not computationally expensive, and can successfully be used for pose estimation.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2012 IEEE International Workshop on Machine Learning for Signal Processing

自引率

0.00%

发文量