基于PERCLOS算法的闭眼判断改进

Muhammad Ammar Zulkarnanie, K. S. Shanmugam, N. Badruddin, M. N. Saad
{"title":"基于PERCLOS算法的闭眼判断改进","authors":"Muhammad Ammar Zulkarnanie, K. S. Shanmugam, N. Badruddin, M. N. Saad","doi":"10.1109/ICFTSC57269.2022.10039811","DOIUrl":null,"url":null,"abstract":"This study presents an algorithm that can detect people’s facial features being studied and then applied mainly on daily basis activities, as an example in driving which is detection of driver drowsiness. In this study, the algorithm named ‘PERCLOS’ which stands for ‘percentage of eye closure’ was tested to detect face by using two face landmark detectors, that are pre-trained model and library Dlib’s 68-points facial landmark and 468 3D face landmarks detector from MediaPipe by Google as an alternative and detects the condition of a person’s eye based on Eye Aspect Ratio (EAR). Initial assessment of the Dlib’s solution on 151,537 frames (about 84 minutes) of one of tested subjects revealed that 98.66% of eye states were properly identified, resulting in 378 blinks to be recorded. Despite having rather good accuracy, the algorithm produced 166 more blinks than the 212 blinks that were expected. As for MediaPipe, with 264 blinks and only 52 additional blinks, the MediaPipe Face Mesh solution was able to categorize the identical subject with a classification accuracy of 99.87%. Additionally, adaptive thresholds for different subjects were applied in order to investigate a way to improve the studied algorithm. Surprisingly, the adaptive threshold method being studied resulted in decreasing accuracy and precision for some of the subjects. For one of tested subject, the resulted precision of studied algorithm somehow drops from 100% to 98.60%.","PeriodicalId":386462,"journal":{"name":"2022 International Conference on Future Trends in Smart Communities (ICFTSC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Enhancements to PERCLOS Algorithm for Determining Eye Closures\",\"authors\":\"Muhammad Ammar Zulkarnanie, K. S. Shanmugam, N. Badruddin, M. N. Saad\",\"doi\":\"10.1109/ICFTSC57269.2022.10039811\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This study presents an algorithm that can detect people’s facial features being studied and then applied mainly on daily basis activities, as an example in driving which is detection of driver drowsiness. In this study, the algorithm named ‘PERCLOS’ which stands for ‘percentage of eye closure’ was tested to detect face by using two face landmark detectors, that are pre-trained model and library Dlib’s 68-points facial landmark and 468 3D face landmarks detector from MediaPipe by Google as an alternative and detects the condition of a person’s eye based on Eye Aspect Ratio (EAR). Initial assessment of the Dlib’s solution on 151,537 frames (about 84 minutes) of one of tested subjects revealed that 98.66% of eye states were properly identified, resulting in 378 blinks to be recorded. Despite having rather good accuracy, the algorithm produced 166 more blinks than the 212 blinks that were expected. As for MediaPipe, with 264 blinks and only 52 additional blinks, the MediaPipe Face Mesh solution was able to categorize the identical subject with a classification accuracy of 99.87%. Additionally, adaptive thresholds for different subjects were applied in order to investigate a way to improve the studied algorithm. Surprisingly, the adaptive threshold method being studied resulted in decreasing accuracy and precision for some of the subjects. For one of tested subject, the resulted precision of studied algorithm somehow drops from 100% to 98.60%.\",\"PeriodicalId\":386462,\"journal\":{\"name\":\"2022 International Conference on Future Trends in Smart Communities (ICFTSC)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 International Conference on Future Trends in Smart Communities (ICFTSC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICFTSC57269.2022.10039811\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Future Trends in Smart Communities (ICFTSC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICFTSC57269.2022.10039811","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

本研究提出了一种算法,可以检测被研究的人的面部特征,并将其主要应用于日常活动中,以驾驶为例,即检测驾驶员的睡意。在本研究中,我们测试了名为“PERCLOS”的算法,即“闭眼百分比”,通过使用两个面部地标检测器来检测人脸,这两个人脸地标检测器是预训练模型和库Dlib的68点面部地标和谷歌MediaPipe的468个3D面部地标检测器作为替代,并基于眼睛纵横比(EAR)检测人的眼睛状况。对其中一名测试对象的151537帧(约84分钟)的Dlib解决方案的初步评估显示,98.66%的眼睛状态被正确识别,从而记录了378次眨眼。尽管准确率相当高,但该算法产生的眨眼次数比预期的212次多出166次。对于MediaPipe,使用264次眨眼和52次额外眨眼,MediaPipe Face Mesh解决方案能够对相同的主题进行分类,分类准确率为99.87%。此外,采用不同对象的自适应阈值,探索改进算法的方法。令人惊讶的是,所研究的自适应阈值法导致某些对象的准确性和精密度下降。对于其中一个被测试对象,所研究算法的结果精度不知何故从100%下降到98.60%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Enhancements to PERCLOS Algorithm for Determining Eye Closures
This study presents an algorithm that can detect people’s facial features being studied and then applied mainly on daily basis activities, as an example in driving which is detection of driver drowsiness. In this study, the algorithm named ‘PERCLOS’ which stands for ‘percentage of eye closure’ was tested to detect face by using two face landmark detectors, that are pre-trained model and library Dlib’s 68-points facial landmark and 468 3D face landmarks detector from MediaPipe by Google as an alternative and detects the condition of a person’s eye based on Eye Aspect Ratio (EAR). Initial assessment of the Dlib’s solution on 151,537 frames (about 84 minutes) of one of tested subjects revealed that 98.66% of eye states were properly identified, resulting in 378 blinks to be recorded. Despite having rather good accuracy, the algorithm produced 166 more blinks than the 212 blinks that were expected. As for MediaPipe, with 264 blinks and only 52 additional blinks, the MediaPipe Face Mesh solution was able to categorize the identical subject with a classification accuracy of 99.87%. Additionally, adaptive thresholds for different subjects were applied in order to investigate a way to improve the studied algorithm. Surprisingly, the adaptive threshold method being studied resulted in decreasing accuracy and precision for some of the subjects. For one of tested subject, the resulted precision of studied algorithm somehow drops from 100% to 98.60%.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信