使用深度学习和图像处理方法对面部和眼球运动进行计算机控制

Muhammet Fatih Çapşek, Abdulkadir Karacı
{"title":"使用深度学习和图像处理方法对面部和眼球运动进行计算机控制","authors":"Muhammet Fatih Çapşek, Abdulkadir Karacı","doi":"10.31202/ecjse.1131377","DOIUrl":null,"url":null,"abstract":"Today's computing is one of the basic needs of every human being. Many actions are done with the mouse in the use of the computer. Individuals with physical disabilities, paralysis from the neck down, or ALS patients who have difficulty in making physical contact with the computer are having difficulty using computers. In this study, an artificial intelligence-assisted system has been developed for these individuals, where they can control the mouse with head and eye movements. In this system, facial movements and eyes are detected in real-time through the library of Haar Cascade, Dlib, and Open CV from the images acquired through the camera. When Haar Cascade is used to detect the face region, the Dlib library is used to acquire right and left eye region images from this detected face image. These eye areas are provided as an introduction to the CNN model, which is trained with 2874 eye data (https://github.com/iparaskev/simple-blink-detector), and it is determined that the eye is closed or open. The CNN model 1500 is trained on a public eye image dataset representing open and 1374 closed-eye conditions. The left eye closed and opened state allows the mouse to click left and the right eye to close and open, and the right mouse to click. In addition, the location of the face detected with Haar Cascade is used to model mouse motion. The developed system is a real-time hybrid system with a combination of different methods and has been tested on different users. According to the test results, it was observed that the system correctly identified the eyes and the closed state of these eyes, classifying the blink event with CNN in both eyes correctly. However, it has been determined that there has been a slowness in modeling mouse movement or a poor fit to facial movement. The next study will focus on this issue and improve it by fine-tuning the system with data from many people.","PeriodicalId":11622,"journal":{"name":"El-Cezeri Fen ve Mühendislik Dergisi","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2022-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Computer Control with Face and Eye Movements Using Deep Learning and Image Processing Methods\",\"authors\":\"Muhammet Fatih Çapşek, Abdulkadir Karacı\",\"doi\":\"10.31202/ecjse.1131377\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Today's computing is one of the basic needs of every human being. Many actions are done with the mouse in the use of the computer. Individuals with physical disabilities, paralysis from the neck down, or ALS patients who have difficulty in making physical contact with the computer are having difficulty using computers. In this study, an artificial intelligence-assisted system has been developed for these individuals, where they can control the mouse with head and eye movements. In this system, facial movements and eyes are detected in real-time through the library of Haar Cascade, Dlib, and Open CV from the images acquired through the camera. When Haar Cascade is used to detect the face region, the Dlib library is used to acquire right and left eye region images from this detected face image. These eye areas are provided as an introduction to the CNN model, which is trained with 2874 eye data (https://github.com/iparaskev/simple-blink-detector), and it is determined that the eye is closed or open. The CNN model 1500 is trained on a public eye image dataset representing open and 1374 closed-eye conditions. The left eye closed and opened state allows the mouse to click left and the right eye to close and open, and the right mouse to click. In addition, the location of the face detected with Haar Cascade is used to model mouse motion. The developed system is a real-time hybrid system with a combination of different methods and has been tested on different users. According to the test results, it was observed that the system correctly identified the eyes and the closed state of these eyes, classifying the blink event with CNN in both eyes correctly. However, it has been determined that there has been a slowness in modeling mouse movement or a poor fit to facial movement. The next study will focus on this issue and improve it by fine-tuning the system with data from many people.\",\"PeriodicalId\":11622,\"journal\":{\"name\":\"El-Cezeri Fen ve Mühendislik Dergisi\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-09-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"El-Cezeri Fen ve Mühendislik Dergisi\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.31202/ecjse.1131377\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"El-Cezeri Fen ve Mühendislik Dergisi","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.31202/ecjse.1131377","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

今天的计算是每个人的基本需求之一。在使用计算机时,许多操作都是用鼠标完成的。身体残疾的人、颈部以下瘫痪的人或与电脑有身体接触困难的肌萎缩性侧索硬化症患者都有使用电脑的困难。在这项研究中,为这些人开发了一种人工智能辅助系统,他们可以通过头部和眼睛的运动来控制鼠标。在该系统中,通过Haar Cascade, Dlib和Open CV库从相机获取的图像中实时检测面部运动和眼睛。当使用Haar Cascade对人脸区域进行检测时,使用Dlib库从检测到的人脸图像中获取左右眼区域图像。这些眼睛区域是作为CNN模型的介绍,CNN模型使用2874个眼睛数据(https://github.com/iparaskev/simple-blink-detector)进行训练,并确定眼睛是闭着的还是打开的。CNN模型1500是在代表睁眼和闭眼条件的公共眼睛图像数据集上训练的。左眼闭眼和睁眼状态允许鼠标点击左眼和右眼闭眼和睁眼,鼠标右键点击。此外,利用Haar Cascade检测到的人脸位置来模拟鼠标运动。所开发的系统是一个实时混合系统,结合了不同的方法,并在不同的用户上进行了测试。从测试结果可以看出,系统正确地识别了眼睛和眼睛的闭合状态,用CNN对双眼的眨眼事件进行了正确的分类。然而,已经确定的是,在模拟鼠标运动或面部运动不适合缓慢。接下来的研究将集中在这个问题上,并通过使用来自许多人的数据对系统进行微调来改进它。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Computer Control with Face and Eye Movements Using Deep Learning and Image Processing Methods
Today's computing is one of the basic needs of every human being. Many actions are done with the mouse in the use of the computer. Individuals with physical disabilities, paralysis from the neck down, or ALS patients who have difficulty in making physical contact with the computer are having difficulty using computers. In this study, an artificial intelligence-assisted system has been developed for these individuals, where they can control the mouse with head and eye movements. In this system, facial movements and eyes are detected in real-time through the library of Haar Cascade, Dlib, and Open CV from the images acquired through the camera. When Haar Cascade is used to detect the face region, the Dlib library is used to acquire right and left eye region images from this detected face image. These eye areas are provided as an introduction to the CNN model, which is trained with 2874 eye data (https://github.com/iparaskev/simple-blink-detector), and it is determined that the eye is closed or open. The CNN model 1500 is trained on a public eye image dataset representing open and 1374 closed-eye conditions. The left eye closed and opened state allows the mouse to click left and the right eye to close and open, and the right mouse to click. In addition, the location of the face detected with Haar Cascade is used to model mouse motion. The developed system is a real-time hybrid system with a combination of different methods and has been tested on different users. According to the test results, it was observed that the system correctly identified the eyes and the closed state of these eyes, classifying the blink event with CNN in both eyes correctly. However, it has been determined that there has been a slowness in modeling mouse movement or a poor fit to facial movement. The next study will focus on this issue and improve it by fine-tuning the system with data from many people.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信