Simple Real-Time Multi-face Tracking Based on Convolutional Neural Networks

Xile Li, J. Lang
{"title":"Simple Real-Time Multi-face Tracking Based on Convolutional Neural Networks","authors":"Xile Li, J. Lang","doi":"10.1109/CRV.2018.00054","DOIUrl":null,"url":null,"abstract":"We present a simple real-time system that is able to track multiple faces for live videos, broadcast, real-time conference recording, etc. Our proposed tracking system is comprised of three parts: face detection, feature extraction and tracking. We employ a previously proposed cascaded Multi-Task Convolutional Neural Network (MTCNN) to detect a face, a simple CNN to extract the features of detected faces and show that a shallow network for face tracking based on the extracted feature maps of the face is sufficient. Our multi-face tracker runs in real-time without any on-line training. We do not adjust any parameters according to different input videos, and the tracker's run-time will not significantly increase with an increase in the number of faces being tracked, i.e., it is easy to deploy in new real-time applications. We evaluate our tracker based on two commonly used metrics in comparison to five recent face trackers. Our proposed simple tracker can perform competitively in comparison to these trackers despite occlusions in the videos and false positives or false negatives during face detection.","PeriodicalId":281779,"journal":{"name":"2018 15th Conference on Computer and Robot Vision (CRV)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 15th Conference on Computer and Robot Vision (CRV)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CRV.2018.00054","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

We present a simple real-time system that is able to track multiple faces for live videos, broadcast, real-time conference recording, etc. Our proposed tracking system is comprised of three parts: face detection, feature extraction and tracking. We employ a previously proposed cascaded Multi-Task Convolutional Neural Network (MTCNN) to detect a face, a simple CNN to extract the features of detected faces and show that a shallow network for face tracking based on the extracted feature maps of the face is sufficient. Our multi-face tracker runs in real-time without any on-line training. We do not adjust any parameters according to different input videos, and the tracker's run-time will not significantly increase with an increase in the number of faces being tracked, i.e., it is easy to deploy in new real-time applications. We evaluate our tracker based on two commonly used metrics in comparison to five recent face trackers. Our proposed simple tracker can perform competitively in comparison to these trackers despite occlusions in the videos and false positives or false negatives during face detection.
基于卷积神经网络的简单实时多人脸跟踪
我们提出了一个简单的实时系统,能够跟踪多个面孔的直播视频,广播,实时会议录制等。我们提出的跟踪系统由三个部分组成:人脸检测、特征提取和跟踪。我们使用先前提出的级联多任务卷积神经网络(MTCNN)来检测人脸,一个简单的CNN来提取被检测人脸的特征,并表明基于提取的人脸特征映射的浅网络用于人脸跟踪是足够的。我们的多面跟踪器实时运行,无需任何在线培训。我们不根据不同的输入视频调整任何参数,跟踪器的运行时间不会随着被跟踪人脸数量的增加而显著增加,即易于部署在新的实时应用中。我们根据两个常用的指标来评估我们的跟踪器,并与最近的五个面部跟踪器进行比较。我们提出的简单跟踪器可以与这些跟踪器相比具有竞争力,尽管视频中存在遮挡,并且在人脸检测过程中存在假阳性或假阴性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信