Improvement of Eye Tracking Based on Deep Learning Model for General Purpose Applications

Al-Nahrain Journal for Engineering Sciences Pub Date : 2022-04-03 DOI:10.29194/njes.25010012

Ahmed Aamer Almindelawy, Mohammed H. Ali

{"title":"Improvement of Eye Tracking Based on Deep Learning Model for General Purpose Applications","authors":"Ahmed Aamer Almindelawy, Mohammed H. Ali","doi":"10.29194/njes.25010012","DOIUrl":null,"url":null,"abstract":"The interest in the Eye-tracking technology field dramatically grew up in the last two decades for different purposes and applications like keeping the focus of where the person is looking, how his pupils and irises are reacting for a variety of actions, etc. The resulted data can deliver an extraordinary amount of information about the user when it's interlocked through advanced data analysis systems, it may show information concerned with the user’s age, gender, biometric identity, interests, etc. This paper is concerned about eye motion tracking as an unadulterated tool for different applications in any field required. The improvements in this area of artificial intelligence (AI), machine learning (ML), and deep learning (DL) with eye-tracking techniques allow large opportunities to develop algorithms and applications. In this paper number of models were proposed based on Convolutional neural network (CNN) have been designed, and then the most powerful and accurate model was chosen. The dataset used for the training process (for 16 screen points) consists of 2800 training images and 800 test images (with an average of 175 training images and 50 test images for each spot on the screen of the 16 spots), and it can be collected by the user of any application based on this model. The highest accuracy achieved by the best model was (91.25%) and the minimum loss was (0.23%). The best model consists of (11) layers (4 convolutions, 4 Max pooling, and 3 Dense). Python 3.7 was used to implement the algorithms, KERAS framework for the deep learning algorithms, Visual studio code as an Integrated Development Environment (IDE), and Anaconda navigator for downloading the different libraries. The model was trained with data that can be gathered using cameras of laptops or PCs and without the necessity of special and expensive equipment, also It can be trained for any single eye, depending on application requirements.","PeriodicalId":7470,"journal":{"name":"Al-Nahrain Journal for Engineering Sciences","volume":"7 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Al-Nahrain Journal for Engineering Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.29194/njes.25010012","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

The interest in the Eye-tracking technology field dramatically grew up in the last two decades for different purposes and applications like keeping the focus of where the person is looking, how his pupils and irises are reacting for a variety of actions, etc. The resulted data can deliver an extraordinary amount of information about the user when it's interlocked through advanced data analysis systems, it may show information concerned with the user’s age, gender, biometric identity, interests, etc. This paper is concerned about eye motion tracking as an unadulterated tool for different applications in any field required. The improvements in this area of artificial intelligence (AI), machine learning (ML), and deep learning (DL) with eye-tracking techniques allow large opportunities to develop algorithms and applications. In this paper number of models were proposed based on Convolutional neural network (CNN) have been designed, and then the most powerful and accurate model was chosen. The dataset used for the training process (for 16 screen points) consists of 2800 training images and 800 test images (with an average of 175 training images and 50 test images for each spot on the screen of the 16 spots), and it can be collected by the user of any application based on this model. The highest accuracy achieved by the best model was (91.25%) and the minimum loss was (0.23%). The best model consists of (11) layers (4 convolutions, 4 Max pooling, and 3 Dense). Python 3.7 was used to implement the algorithms, KERAS framework for the deep learning algorithms, Visual studio code as an Integrated Development Environment (IDE), and Anaconda navigator for downloading the different libraries. The model was trained with data that can be gathered using cameras of laptops or PCs and without the necessity of special and expensive equipment, also It can be trained for any single eye, depending on application requirements.

查看原文本刊更多论文

基于深度学习模型的通用眼动追踪改进

在过去的二十年里，人们对眼球追踪技术领域的兴趣急剧增长，用于不同的目的和应用，比如保持人们注视的焦点，他的瞳孔和虹膜对各种动作的反应等等。通过先进的数据分析系统，生成的数据可以提供大量关于用户的信息，它可能显示与用户的年龄、性别、生物识别身份、兴趣等有关的信息。本文关注眼动追踪作为一个纯粹的工具，在任何领域的不同应用需要。人工智能(AI)、机器学习(ML)和深度学习(DL)与眼动追踪技术在这一领域的进步为开发算法和应用程序提供了巨大的机会。本文在卷积神经网络(CNN)的基础上设计了多个模型，并选择了最强大、最准确的模型。用于训练过程的数据集(16个屏幕点)由2800张训练图像和800张测试图像组成(16个点的屏幕上每个点平均175张训练图像和50张测试图像)，基于该模型的任何应用程序的用户都可以收集到该数据集。最佳模型的准确率最高(91.25%)，损失最小(0.23%)。最好的模型由(11)层组成(4个卷积，4个Max池和3个Dense)。Python 3.7用于实现算法，KERAS框架用于深度学习算法，Visual studio代码作为集成开发环境(IDE)， Anaconda导航器用于下载不同的库。该模型的训练数据可以使用笔记本电脑或个人电脑的摄像头收集，不需要特殊和昂贵的设备，也可以根据应用需求训练任何一只眼睛。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Al-Nahrain Journal for Engineering Sciences

自引率

0.00%

发文量