Muhammad Ammar Zulkarnanie, K. S. Shanmugam, N. Badruddin, M. N. Saad
{"title":"基于PERCLOS算法的闭眼判断改进","authors":"Muhammad Ammar Zulkarnanie, K. S. Shanmugam, N. Badruddin, M. N. Saad","doi":"10.1109/ICFTSC57269.2022.10039811","DOIUrl":null,"url":null,"abstract":"This study presents an algorithm that can detect people’s facial features being studied and then applied mainly on daily basis activities, as an example in driving which is detection of driver drowsiness. In this study, the algorithm named ‘PERCLOS’ which stands for ‘percentage of eye closure’ was tested to detect face by using two face landmark detectors, that are pre-trained model and library Dlib’s 68-points facial landmark and 468 3D face landmarks detector from MediaPipe by Google as an alternative and detects the condition of a person’s eye based on Eye Aspect Ratio (EAR). Initial assessment of the Dlib’s solution on 151,537 frames (about 84 minutes) of one of tested subjects revealed that 98.66% of eye states were properly identified, resulting in 378 blinks to be recorded. Despite having rather good accuracy, the algorithm produced 166 more blinks than the 212 blinks that were expected. As for MediaPipe, with 264 blinks and only 52 additional blinks, the MediaPipe Face Mesh solution was able to categorize the identical subject with a classification accuracy of 99.87%. Additionally, adaptive thresholds for different subjects were applied in order to investigate a way to improve the studied algorithm. Surprisingly, the adaptive threshold method being studied resulted in decreasing accuracy and precision for some of the subjects. For one of tested subject, the resulted precision of studied algorithm somehow drops from 100% to 98.60%.","PeriodicalId":386462,"journal":{"name":"2022 International Conference on Future Trends in Smart Communities (ICFTSC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Enhancements to PERCLOS Algorithm for Determining Eye Closures\",\"authors\":\"Muhammad Ammar Zulkarnanie, K. S. Shanmugam, N. Badruddin, M. N. Saad\",\"doi\":\"10.1109/ICFTSC57269.2022.10039811\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This study presents an algorithm that can detect people’s facial features being studied and then applied mainly on daily basis activities, as an example in driving which is detection of driver drowsiness. In this study, the algorithm named ‘PERCLOS’ which stands for ‘percentage of eye closure’ was tested to detect face by using two face landmark detectors, that are pre-trained model and library Dlib’s 68-points facial landmark and 468 3D face landmarks detector from MediaPipe by Google as an alternative and detects the condition of a person’s eye based on Eye Aspect Ratio (EAR). Initial assessment of the Dlib’s solution on 151,537 frames (about 84 minutes) of one of tested subjects revealed that 98.66% of eye states were properly identified, resulting in 378 blinks to be recorded. Despite having rather good accuracy, the algorithm produced 166 more blinks than the 212 blinks that were expected. As for MediaPipe, with 264 blinks and only 52 additional blinks, the MediaPipe Face Mesh solution was able to categorize the identical subject with a classification accuracy of 99.87%. Additionally, adaptive thresholds for different subjects were applied in order to investigate a way to improve the studied algorithm. Surprisingly, the adaptive threshold method being studied resulted in decreasing accuracy and precision for some of the subjects. For one of tested subject, the resulted precision of studied algorithm somehow drops from 100% to 98.60%.\",\"PeriodicalId\":386462,\"journal\":{\"name\":\"2022 International Conference on Future Trends in Smart Communities (ICFTSC)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 International Conference on Future Trends in Smart Communities (ICFTSC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICFTSC57269.2022.10039811\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Future Trends in Smart Communities (ICFTSC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICFTSC57269.2022.10039811","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
摘要
本研究提出了一种算法,可以检测被研究的人的面部特征,并将其主要应用于日常活动中,以驾驶为例,即检测驾驶员的睡意。在本研究中,我们测试了名为“PERCLOS”的算法,即“闭眼百分比”,通过使用两个面部地标检测器来检测人脸,这两个人脸地标检测器是预训练模型和库Dlib的68点面部地标和谷歌MediaPipe的468个3D面部地标检测器作为替代,并基于眼睛纵横比(EAR)检测人的眼睛状况。对其中一名测试对象的151537帧(约84分钟)的Dlib解决方案的初步评估显示,98.66%的眼睛状态被正确识别,从而记录了378次眨眼。尽管准确率相当高,但该算法产生的眨眼次数比预期的212次多出166次。对于MediaPipe,使用264次眨眼和52次额外眨眼,MediaPipe Face Mesh解决方案能够对相同的主题进行分类,分类准确率为99.87%。此外,采用不同对象的自适应阈值,探索改进算法的方法。令人惊讶的是,所研究的自适应阈值法导致某些对象的准确性和精密度下降。对于其中一个被测试对象,所研究算法的结果精度不知何故从100%下降到98.60%。
Enhancements to PERCLOS Algorithm for Determining Eye Closures
This study presents an algorithm that can detect people’s facial features being studied and then applied mainly on daily basis activities, as an example in driving which is detection of driver drowsiness. In this study, the algorithm named ‘PERCLOS’ which stands for ‘percentage of eye closure’ was tested to detect face by using two face landmark detectors, that are pre-trained model and library Dlib’s 68-points facial landmark and 468 3D face landmarks detector from MediaPipe by Google as an alternative and detects the condition of a person’s eye based on Eye Aspect Ratio (EAR). Initial assessment of the Dlib’s solution on 151,537 frames (about 84 minutes) of one of tested subjects revealed that 98.66% of eye states were properly identified, resulting in 378 blinks to be recorded. Despite having rather good accuracy, the algorithm produced 166 more blinks than the 212 blinks that were expected. As for MediaPipe, with 264 blinks and only 52 additional blinks, the MediaPipe Face Mesh solution was able to categorize the identical subject with a classification accuracy of 99.87%. Additionally, adaptive thresholds for different subjects were applied in order to investigate a way to improve the studied algorithm. Surprisingly, the adaptive threshold method being studied resulted in decreasing accuracy and precision for some of the subjects. For one of tested subject, the resulted precision of studied algorithm somehow drops from 100% to 98.60%.