在多粒度深度框架中整合记忆机制，用于驾驶员昏昏欲睡检测

Intelligence & Robotics Pub Date : 2023-11-27 DOI:10.20517/ir.2023.34

Handan Zhang, Tie Liu, Jie Lyu, Dapeng Chen, Zejian Yuan

{"title":"在多粒度深度框架中整合记忆机制，用于驾驶员昏昏欲睡检测","authors":"Handan Zhang, Tie Liu, Jie Lyu, Dapeng Chen, Zejian Yuan","doi":"10.20517/ir.2023.34","DOIUrl":null,"url":null,"abstract":"Driver drowsiness detection is a critical task for early warning of safe driving, while existing spatial feature-based methods face the challenges of large variations of head pose. This paper proposes a novel approach to integrate the memory mechanism in a multi-granularity deep framework to detect driver drowsiness, and the temporal dependencies over sequential frames are well integrated with the spatial deep learning framework on the frontal faces. The proposed approach includes two steps. First, the spatial Multi-granularity Convolutional Neural Network is designed to utilize a group of parallel Convolutional Neural Network extractors on well-aligned facial patches of different granularities and extract facial representations effectively for large variations of head pose. Furthermore, it can flexibly fuse detailed appearance clues of the main parts and local-to-global spatial constraints. Second, the memory mechanism is set up using a deep long short-term memory network of facial representations to explore long-term relationships with variable length over sequential frames, which is capable of distinguishing the states with temporal dependencies, such as blinking and closing eyes. The proposed approach achieves 90.05% accuracy and about 37 frames per second (FPS) speed on the evaluation set of the National Tsing Hua University Driver Drowsiness Detection dataset, which is applied to the intelligent vehicle for driver drowsiness detection. A dataset named Forward Instant Driver Drowsiness Detection is also built and will be publicly accessible to speed up the study of driver drowsiness detection.","PeriodicalId":426514,"journal":{"name":"Intelligence & Robotics","volume":"77 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Integrate memory mechanism in multi-granularity deep framework for driver drowsiness detection\",\"authors\":\"Handan Zhang, Tie Liu, Jie Lyu, Dapeng Chen, Zejian Yuan\",\"doi\":\"10.20517/ir.2023.34\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Driver drowsiness detection is a critical task for early warning of safe driving, while existing spatial feature-based methods face the challenges of large variations of head pose. This paper proposes a novel approach to integrate the memory mechanism in a multi-granularity deep framework to detect driver drowsiness, and the temporal dependencies over sequential frames are well integrated with the spatial deep learning framework on the frontal faces. The proposed approach includes two steps. First, the spatial Multi-granularity Convolutional Neural Network is designed to utilize a group of parallel Convolutional Neural Network extractors on well-aligned facial patches of different granularities and extract facial representations effectively for large variations of head pose. Furthermore, it can flexibly fuse detailed appearance clues of the main parts and local-to-global spatial constraints. Second, the memory mechanism is set up using a deep long short-term memory network of facial representations to explore long-term relationships with variable length over sequential frames, which is capable of distinguishing the states with temporal dependencies, such as blinking and closing eyes. The proposed approach achieves 90.05% accuracy and about 37 frames per second (FPS) speed on the evaluation set of the National Tsing Hua University Driver Drowsiness Detection dataset, which is applied to the intelligent vehicle for driver drowsiness detection. A dataset named Forward Instant Driver Drowsiness Detection is also built and will be publicly accessible to speed up the study of driver drowsiness detection.\",\"PeriodicalId\":426514,\"journal\":{\"name\":\"Intelligence & Robotics\",\"volume\":\"77 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-11-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Intelligence & Robotics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.20517/ir.2023.34\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Intelligence & Robotics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.20517/ir.2023.34","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

驾驶员瞌睡检测是安全驾驶预警的一项关键任务，而现有的基于空间特征的方法面临着头部姿态变化大的挑战。本文提出了一种在多粒度深度框架中集成记忆机制的新方法来检测驾驶员的嗜睡程度，并将连续帧上的时间依赖性与正面人脸的空间深度学习框架很好地集成在一起。所提出的方法包括两个步骤。首先，设计了空间多粒度卷积神经网络，利用一组并行的卷积神经网络提取器对不同粒度的对齐面部斑块进行提取，并在头部姿态发生较大变化时有效地提取面部表征。此外，它还能灵活融合主要部位的详细外观线索和局部到全局的空间约束。其次，利用面部表征的深度长短期记忆网络建立记忆机制，探索连续帧上长度可变的长期关系，能够区分眨眼和闭眼等具有时间依赖性的状态。所提出的方法在清华大学驾驶员昏昏欲睡检测数据集的评估集上达到了 90.05% 的准确率和约 37 帧/秒（FPS）的速度，该数据集被应用于智能汽车的驾驶员昏昏欲睡检测。此外，还建立了一个名为 "前向即时驾驶员嗜睡检测 "的数据集，并将向公众开放，以加快驾驶员嗜睡检测的研究。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Integrate memory mechanism in multi-granularity deep framework for driver drowsiness detection

Driver drowsiness detection is a critical task for early warning of safe driving, while existing spatial feature-based methods face the challenges of large variations of head pose. This paper proposes a novel approach to integrate the memory mechanism in a multi-granularity deep framework to detect driver drowsiness, and the temporal dependencies over sequential frames are well integrated with the spatial deep learning framework on the frontal faces. The proposed approach includes two steps. First, the spatial Multi-granularity Convolutional Neural Network is designed to utilize a group of parallel Convolutional Neural Network extractors on well-aligned facial patches of different granularities and extract facial representations effectively for large variations of head pose. Furthermore, it can flexibly fuse detailed appearance clues of the main parts and local-to-global spatial constraints. Second, the memory mechanism is set up using a deep long short-term memory network of facial representations to explore long-term relationships with variable length over sequential frames, which is capable of distinguishing the states with temporal dependencies, such as blinking and closing eyes. The proposed approach achieves 90.05% accuracy and about 37 frames per second (FPS) speed on the evaluation set of the National Tsing Hua University Driver Drowsiness Detection dataset, which is applied to the intelligent vehicle for driver drowsiness detection. A dataset named Forward Instant Driver Drowsiness Detection is also built and will be publicly accessible to speed up the study of driver drowsiness detection.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Intelligence & Robotics

自引率

0.00%

发文量