多阶段实时人体头部姿态估计

Xiangwei Zhang, Dongping Zhang, Jun Ge, Kui Hu, Li Yang, Ping Chen
{"title":"多阶段实时人体头部姿态估计","authors":"Xiangwei Zhang, Dongping Zhang, Jun Ge, Kui Hu, Li Yang, Ping Chen","doi":"10.1109/ICSAI48974.2019.9010492","DOIUrl":null,"url":null,"abstract":"The paper explores a human head pose estimation method combining face detection and face dichotomy optimization algorithm. The estimation of head pose is mainly to obtain the angle information of face orientation. The human head pose estimation algorithm in this paper mainly estimates the 3D Euler Angle of the input face patch. That is to say, a regressor is trained in a data-driven way, which can directly predict the input face blocks. In this paper, we use three models to make the final prediction of human head posture. First, the first model is used to detect faces. So as to prevent the loss of face detection in realtime face detection, we set a small threshold on the face detection model. Second, on the basis of face detection, a face dichotomy model is added to optimize the detected face to determine whether it is a face. Third, if the result of the second step is a face, the head pose estimation of the detected face is carried out. This paper mainly aims at the change of head pose angle to judge people's attention, and puts forward multi-stage head pose estimation to meet the demand of real-time engineering calculation. In this paper, the first novelty is that the average speed on Windows with CPU(i5-7500 3.40GHz) is 35ms/frame, and the second novelty is that we have collected 64101 face images under the surveillance camera, and the third novelty is that we have mixed depthwise separable convolution for head pose recognition. The method is tested on the UMDFaces Dataset, and the consequences show that compared with extra approaches, the proposed method can obtain improvements in efficiency.","PeriodicalId":270809,"journal":{"name":"2019 6th International Conference on Systems and Informatics (ICSAI)","volume":"74 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multi-stage Real-time Human Head Pose Estimation\",\"authors\":\"Xiangwei Zhang, Dongping Zhang, Jun Ge, Kui Hu, Li Yang, Ping Chen\",\"doi\":\"10.1109/ICSAI48974.2019.9010492\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The paper explores a human head pose estimation method combining face detection and face dichotomy optimization algorithm. The estimation of head pose is mainly to obtain the angle information of face orientation. The human head pose estimation algorithm in this paper mainly estimates the 3D Euler Angle of the input face patch. That is to say, a regressor is trained in a data-driven way, which can directly predict the input face blocks. In this paper, we use three models to make the final prediction of human head posture. First, the first model is used to detect faces. So as to prevent the loss of face detection in realtime face detection, we set a small threshold on the face detection model. Second, on the basis of face detection, a face dichotomy model is added to optimize the detected face to determine whether it is a face. Third, if the result of the second step is a face, the head pose estimation of the detected face is carried out. This paper mainly aims at the change of head pose angle to judge people's attention, and puts forward multi-stage head pose estimation to meet the demand of real-time engineering calculation. In this paper, the first novelty is that the average speed on Windows with CPU(i5-7500 3.40GHz) is 35ms/frame, and the second novelty is that we have collected 64101 face images under the surveillance camera, and the third novelty is that we have mixed depthwise separable convolution for head pose recognition. The method is tested on the UMDFaces Dataset, and the consequences show that compared with extra approaches, the proposed method can obtain improvements in efficiency.\",\"PeriodicalId\":270809,\"journal\":{\"name\":\"2019 6th International Conference on Systems and Informatics (ICSAI)\",\"volume\":\"74 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 6th International Conference on Systems and Informatics (ICSAI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICSAI48974.2019.9010492\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 6th International Conference on Systems and Informatics (ICSAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSAI48974.2019.9010492","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

研究了一种结合人脸检测和人脸二分优化算法的人脸姿态估计方法。头部姿态的估计主要是获取人脸方向的角度信息。本文的人脸姿态估计算法主要是对输入人脸贴片的三维欧拉角进行估计。也就是说,以数据驱动的方式训练回归器,它可以直接预测输入面块。在本文中,我们使用三种模型对人类头部姿势进行最终预测。首先,使用第一个模型来检测人脸。为了防止在实时人脸检测中丢失人脸检测,我们在人脸检测模型上设置了较小的阈值。其次,在人脸检测的基础上,加入人脸二分模型,对检测到的人脸进行优化,判断其是否为人脸;第三步,如果第二步的结果是人脸,则对检测到的人脸进行头姿估计。本文主要针对头部姿态角度的变化来判断人的注意力,提出了多级头部姿态估计,以满足实时工程计算的需求。本文的第一个新颖之处是在CPU(i5-7500 3.40GHz)的Windows上平均速度为35ms/帧,第二个新颖之处是我们在监控摄像头下采集了64101张人脸图像,第三个新颖之处是我们将混合深度可分离卷积用于头部姿态识别。在UMDFaces数据集上对该方法进行了测试,结果表明,与其他方法相比,该方法的效率有所提高。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Multi-stage Real-time Human Head Pose Estimation
The paper explores a human head pose estimation method combining face detection and face dichotomy optimization algorithm. The estimation of head pose is mainly to obtain the angle information of face orientation. The human head pose estimation algorithm in this paper mainly estimates the 3D Euler Angle of the input face patch. That is to say, a regressor is trained in a data-driven way, which can directly predict the input face blocks. In this paper, we use three models to make the final prediction of human head posture. First, the first model is used to detect faces. So as to prevent the loss of face detection in realtime face detection, we set a small threshold on the face detection model. Second, on the basis of face detection, a face dichotomy model is added to optimize the detected face to determine whether it is a face. Third, if the result of the second step is a face, the head pose estimation of the detected face is carried out. This paper mainly aims at the change of head pose angle to judge people's attention, and puts forward multi-stage head pose estimation to meet the demand of real-time engineering calculation. In this paper, the first novelty is that the average speed on Windows with CPU(i5-7500 3.40GHz) is 35ms/frame, and the second novelty is that we have collected 64101 face images under the surveillance camera, and the third novelty is that we have mixed depthwise separable convolution for head pose recognition. The method is tested on the UMDFaces Dataset, and the consequences show that compared with extra approaches, the proposed method can obtain improvements in efficiency.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信