最新人脸标记检测器在人脸遮挡受试者眼状态分类中的比较研究

K. S. Shanmugam, N. Badruddin, V. Asirvadam
{"title":"最新人脸标记检测器在人脸遮挡受试者眼状态分类中的比较研究","authors":"K. S. Shanmugam, N. Badruddin, V. Asirvadam","doi":"10.1109/ECBIOS54627.2022.9945018","DOIUrl":null,"url":null,"abstract":"A study on drowsiness detection is being conducted based on the video dataset of 22 subjects who are wearing EEG sensors while driving on a simulator. The video dataset is not labelled, and it is in the process of being labelled by using Percentage of Eye Closure (PERCLOS) which is an indicator of drowsiness level. One of the preliminary steps in obtaining PERCLOS values, is to determine the eye state (open or closed) of subjects. A common method used by researchers to determine the eye state of the subject is using Eye Aspect Ratio (EAR) values that are obtained from face landmark detectors such as DLib's 68 face landmarks predictor. Based on the evaluation of the DLib's solution on 151,537 frames (approximately 84 minutes) of one subject, it was found that 98.66% of eyes state were classified correctly which resulted in 378 blinks to be detected (expected 212 blinks). The 468 3D face landmarks detector from MediaPipe (Google) was proposed as an alternative and it managed to classify the same subject with a classification accuracy of 99.87% with 264 blinks (only 52 extra blinks). Plenty of researchers are not aware of this solution and are uncertain of how it compares to DLib's solution. Thus, this paper compares the performance of these two solutions in terms of processing time and eye state classification metrics based on 10 diverse subjects.","PeriodicalId":330175,"journal":{"name":"2022 IEEE 4th Eurasia Conference on Biomedical Engineering, Healthcare and Sustainability (ECBIOS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Comparative Study of State-of-the-art Face Landmark Detectors for Eye State Classification in Subjects with Face Occlusion\",\"authors\":\"K. S. Shanmugam, N. Badruddin, V. Asirvadam\",\"doi\":\"10.1109/ECBIOS54627.2022.9945018\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A study on drowsiness detection is being conducted based on the video dataset of 22 subjects who are wearing EEG sensors while driving on a simulator. The video dataset is not labelled, and it is in the process of being labelled by using Percentage of Eye Closure (PERCLOS) which is an indicator of drowsiness level. One of the preliminary steps in obtaining PERCLOS values, is to determine the eye state (open or closed) of subjects. A common method used by researchers to determine the eye state of the subject is using Eye Aspect Ratio (EAR) values that are obtained from face landmark detectors such as DLib's 68 face landmarks predictor. Based on the evaluation of the DLib's solution on 151,537 frames (approximately 84 minutes) of one subject, it was found that 98.66% of eyes state were classified correctly which resulted in 378 blinks to be detected (expected 212 blinks). The 468 3D face landmarks detector from MediaPipe (Google) was proposed as an alternative and it managed to classify the same subject with a classification accuracy of 99.87% with 264 blinks (only 52 extra blinks). Plenty of researchers are not aware of this solution and are uncertain of how it compares to DLib's solution. Thus, this paper compares the performance of these two solutions in terms of processing time and eye state classification metrics based on 10 diverse subjects.\",\"PeriodicalId\":330175,\"journal\":{\"name\":\"2022 IEEE 4th Eurasia Conference on Biomedical Engineering, Healthcare and Sustainability (ECBIOS)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-05-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE 4th Eurasia Conference on Biomedical Engineering, Healthcare and Sustainability (ECBIOS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ECBIOS54627.2022.9945018\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 4th Eurasia Conference on Biomedical Engineering, Healthcare and Sustainability (ECBIOS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ECBIOS54627.2022.9945018","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

正在对22名佩戴脑电图传感器的人在模拟器上驾驶时的视频数据进行检测睡意的研究。视频数据集没有标记,它正在使用眼睛关闭百分比(PERCLOS)进行标记,这是一个困倦程度的指标。获得PERCLOS值的初步步骤之一是确定受试者的眼睛状态(睁眼或闭眼)。研究人员常用的一种确定受试者眼睛状态的方法是使用眼睛宽高比(EAR)值,该值是从面部地标检测器(如DLib的68个面部地标预测器)中获得的。通过对一名受试者151537帧(约84分钟)的DLib解进行评估,发现98.66%的眼睛状态被正确分类,从而检测到378次眨眼(预期为212次)。MediaPipe (Google)的468 3D人脸地标检测器被提出作为一种替代方案,它在264次眨眼(只额外眨眼52次)的情况下对同一主题进行了分类,分类准确率达到99.87%。许多研究人员不知道这个解决方案,也不确定它与DLib的解决方案相比如何。因此,本文比较了这两种解决方案在处理时间和基于10个不同主题的眼状态分类指标方面的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Comparative Study of State-of-the-art Face Landmark Detectors for Eye State Classification in Subjects with Face Occlusion
A study on drowsiness detection is being conducted based on the video dataset of 22 subjects who are wearing EEG sensors while driving on a simulator. The video dataset is not labelled, and it is in the process of being labelled by using Percentage of Eye Closure (PERCLOS) which is an indicator of drowsiness level. One of the preliminary steps in obtaining PERCLOS values, is to determine the eye state (open or closed) of subjects. A common method used by researchers to determine the eye state of the subject is using Eye Aspect Ratio (EAR) values that are obtained from face landmark detectors such as DLib's 68 face landmarks predictor. Based on the evaluation of the DLib's solution on 151,537 frames (approximately 84 minutes) of one subject, it was found that 98.66% of eyes state were classified correctly which resulted in 378 blinks to be detected (expected 212 blinks). The 468 3D face landmarks detector from MediaPipe (Google) was proposed as an alternative and it managed to classify the same subject with a classification accuracy of 99.87% with 264 blinks (only 52 extra blinks). Plenty of researchers are not aware of this solution and are uncertain of how it compares to DLib's solution. Thus, this paper compares the performance of these two solutions in terms of processing time and eye state classification metrics based on 10 diverse subjects.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信