Lightweight Convolutional Neural Network for Image Processing Method for Gaze Estimation and Eye Movement Event Detection

Q3 Biochemistry, Genetics and Molecular Biology

IPSJ Transactions on Bioinformatics Pub Date : 2020-01-01 DOI:10.2197/ipsjtbio.13.7

Joshua Emoto, Y. Hirata

{"title":"Lightweight Convolutional Neural Network for Image Processing Method for Gaze Estimation and Eye Movement Event Detection","authors":"Joshua Emoto, Y. Hirata","doi":"10.2197/ipsjtbio.13.7","DOIUrl":null,"url":null,"abstract":": Advancements in technology have recently made it possible to obtain various types of biometric informa- tion from humans, enabling studies on estimation of human conditions in medicine, automobile safety, marketing, and other areas. These studies have particularly pointed to eye movement as an e ﬀ ective indicator of human conditions, and research on its applications is actively being pursued. The devices now widely used for measuring eye movements are based on the video-oculography (VOG) method, wherein the direction of gaze is estimated by processing eye images obtained through a camera. Applying convolutional neural networks (ConvNet) to the processing of eye images has been shown to enable accurate and robust gaze estimation. Conventional image processing, however, is premised on execution using a personal computer, making it di ﬃ cult to carry out real-time gaze estimation using ConvNet, which involves the use of a large number of parameters, in a small arithmetic unit. Also, detecting eye movement events, such as blinking and saccadic movements, from the inferred gaze direction sequence for particular purposes requires the use of a separate algorithm. We therefore propose a new eye image processing method that batch-processes gaze estimation and event detection from end to end using an independently designed lightweight ConvNet. This paper discusses the structure of the proposed lightweight ConvNet, the methods for learning and evaluation used, and the proposed method’s ability to simultaneously detect gaze direction and event occurrence using a smaller memory and at lower computational complexity than conventional methods.","PeriodicalId":38959,"journal":{"name":"IPSJ Transactions on Bioinformatics","volume":"1 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.2197/ipsjtbio.13.7","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IPSJ Transactions on Bioinformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2197/ipsjtbio.13.7","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Biochemistry, Genetics and Molecular Biology","Score":null,"Total":0}

引用次数: 2

Abstract

: Advancements in technology have recently made it possible to obtain various types of biometric informa- tion from humans, enabling studies on estimation of human conditions in medicine, automobile safety, marketing, and other areas. These studies have particularly pointed to eye movement as an e ﬀ ective indicator of human conditions, and research on its applications is actively being pursued. The devices now widely used for measuring eye movements are based on the video-oculography (VOG) method, wherein the direction of gaze is estimated by processing eye images obtained through a camera. Applying convolutional neural networks (ConvNet) to the processing of eye images has been shown to enable accurate and robust gaze estimation. Conventional image processing, however, is premised on execution using a personal computer, making it di ﬃ cult to carry out real-time gaze estimation using ConvNet, which involves the use of a large number of parameters, in a small arithmetic unit. Also, detecting eye movement events, such as blinking and saccadic movements, from the inferred gaze direction sequence for particular purposes requires the use of a separate algorithm. We therefore propose a new eye image processing method that batch-processes gaze estimation and event detection from end to end using an independently designed lightweight ConvNet. This paper discusses the structure of the proposed lightweight ConvNet, the methods for learning and evaluation used, and the proposed method’s ability to simultaneously detect gaze direction and event occurrence using a smaller memory and at lower computational complexity than conventional methods.

查看原文本刊更多论文

用于注视估计和眼动事件检测的轻量级卷积神经网络图像处理方法

最近，技术的进步使得从人类身上获取各种生物特征信息成为可能，从而可以在医学、汽车安全、营销和其他领域对人类状况进行估计研究。这些研究特别指出，眼球运动是人类状况的有效指标，有关其应用的研究正在积极进行。目前广泛用于测量眼球运动的设备是基于视频视觉(VOG)方法，其中通过处理通过相机获得的眼睛图像来估计凝视方向。将卷积神经网络(ConvNet)应用于眼睛图像的处理已被证明可以实现准确和鲁棒的凝视估计。然而，传统的图像处理是以在个人计算机上执行为前提的，这使得使用卷积神经网络进行实时凝视估计很难，这涉及到在一个小的算术单元中使用大量参数。此外，为了特定目的，从推断的凝视方向序列中检测眼球运动事件，如眨眼和扫视运动，需要使用单独的算法。因此，我们提出了一种新的眼睛图像处理方法，该方法使用独立设计的轻量级卷积神经网络从端到端批量处理凝视估计和事件检测。本文讨论了所提出的轻量级ConvNet的结构，所使用的学习和评估方法，以及所提出的方法使用更小的内存和比传统方法更低的计算复杂度同时检测凝视方向和事件发生的能力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IPSJ Transactions on Bioinformatics Biochemistry, Genetics and Molecular Biology-Biochemistry, Genetics and Molecular Biology (miscellaneous)

CiteScore

1.90

自引率

0.00%

发文量