基于红外传感器阵列可穿戴手环和高效一维卷积神经网络的实时手势分类

IF 2.2 Q3 ENGINEERING, ELECTRICAL & ELECTRONIC

IEEE Sensors Letters Pub Date : 2025-03-22 DOI:10.1109/LSENS.2025.3572736

Agastasya Dahiya;Rohan Katti;Luigi G. Occhipinti

{"title":"基于红外传感器阵列可穿戴手环和高效一维卷积神经网络的实时手势分类","authors":"Agastasya Dahiya;Rohan Katti;Luigi G. Occhipinti","doi":"10.1109/LSENS.2025.3572736","DOIUrl":null,"url":null,"abstract":"Hand gesture recognition is pivotal for intuitive human–machine interaction, particularly in healthcare and assistive technologies, where traditional interfaces (e.g., keyboards) are impractical. Existing modalities, such as electromyography and inertial sensors (inertial measurement units), struggle with noise sensitivity, motion dependence, or limited resolution for fine gestures. This work proposes infrared sensing as a robust alternative, leveraging reflected light patterns to capture both macrogestures and microgestures without relying on muscle activity or pronounced arm movements. We conducted experiments and compared different architectures to ensure the correct classification of hand gestures with the lowest possible latency, when targeting real-time processing. Experimental results demonstrate that shallow two-layer 1-D convolutional neural networks (CNNs) achieve rapid inference (3 ms) and minimal memory (56 kB) but suffer from low accuracy (81.45%), while deeper 12-layer CNNs attain 98.29% accuracy at prohibitive cost (176 ms latency, 17.4 MB memory). A six-layer 1-D CNN strikes an optimal balance, delivering 95.97% accuracy with moderate resources (56 ms latency, 640 kB memory), outperforming similarly accurate long short-term memory (94.88%, 136 ms), and recurrent neural network (96.59%, 102 ms) models. Confusion matrix analysis confirms consistent performance across seven gestures, including nuanced distinctions, such as thumb–index versus thumb–pinky pinches. By optimizing architectural depth and sensor integration, this work enables real-time operation on microcontrollers, such as the STM32F7, advancing applications in touchless medical interfaces and assistive devices for users with motor impairments.","PeriodicalId":13014,"journal":{"name":"IEEE Sensors Letters","volume":"9 6","pages":"1-4"},"PeriodicalIF":2.2000,"publicationDate":"2025-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Real-Time Hand Gesture Classification Using Infrared Sensor Arrays-Based Wearable Bracelet and Efficient 1-D Convolutional Neural Network\",\"authors\":\"Agastasya Dahiya;Rohan Katti;Luigi G. Occhipinti\",\"doi\":\"10.1109/LSENS.2025.3572736\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Hand gesture recognition is pivotal for intuitive human–machine interaction, particularly in healthcare and assistive technologies, where traditional interfaces (e.g., keyboards) are impractical. Existing modalities, such as electromyography and inertial sensors (inertial measurement units), struggle with noise sensitivity, motion dependence, or limited resolution for fine gestures. This work proposes infrared sensing as a robust alternative, leveraging reflected light patterns to capture both macrogestures and microgestures without relying on muscle activity or pronounced arm movements. We conducted experiments and compared different architectures to ensure the correct classification of hand gestures with the lowest possible latency, when targeting real-time processing. Experimental results demonstrate that shallow two-layer 1-D convolutional neural networks (CNNs) achieve rapid inference (3 ms) and minimal memory (56 kB) but suffer from low accuracy (81.45%), while deeper 12-layer CNNs attain 98.29% accuracy at prohibitive cost (176 ms latency, 17.4 MB memory). A six-layer 1-D CNN strikes an optimal balance, delivering 95.97% accuracy with moderate resources (56 ms latency, 640 kB memory), outperforming similarly accurate long short-term memory (94.88%, 136 ms), and recurrent neural network (96.59%, 102 ms) models. Confusion matrix analysis confirms consistent performance across seven gestures, including nuanced distinctions, such as thumb–index versus thumb–pinky pinches. By optimizing architectural depth and sensor integration, this work enables real-time operation on microcontrollers, such as the STM32F7, advancing applications in touchless medical interfaces and assistive devices for users with motor impairments.\",\"PeriodicalId\":13014,\"journal\":{\"name\":\"IEEE Sensors Letters\",\"volume\":\"9 6\",\"pages\":\"1-4\"},\"PeriodicalIF\":2.2000,\"publicationDate\":\"2025-03-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Sensors Letters\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11009135/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Sensors Letters","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/11009135/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

摘要

手势识别对于直观的人机交互至关重要，特别是在医疗保健和辅助技术中，传统的界面（例如键盘）是不切实际的。现有的模式，如肌电图和惯性传感器（惯性测量单元），与噪声敏感性、运动依赖性或精细手势的有限分辨率作斗争。这项工作提出了红外传感作为一个强大的替代方案，利用反射光模式来捕捉宏手势和微手势，而不依赖于肌肉活动或明显的手臂运动。我们进行了实验并比较了不同的架构，以确保在以实时处理为目标的情况下，以尽可能低的延迟对手势进行正确分类。实验结果表明，浅层两层1-D卷积神经网络（cnn）实现了快速推理（3 ms）和最小内存（56 kB），但准确率较低（81.45%），而更深的12层cnn以高昂的成本（176 ms延迟，17.4 MB内存）达到了98.29%的准确率。六层1-D CNN达到了最佳平衡，在中等资源（56 ms延迟，640 kB内存）下提供95.97%的准确率，优于同样准确的长短期记忆（94.88%,136 ms）和循环神经网络（96.59%,102 ms）模型。混淆矩阵分析证实了七种手势的一致表现，包括细微的区别，比如拇指指和拇指指。通过优化架构深度和传感器集成，这项工作可以在微控制器（如STM32F7）上实现实时操作，推进非接触式医疗接口和运动障碍用户辅助设备的应用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Real-Time Hand Gesture Classification Using Infrared Sensor Arrays-Based Wearable Bracelet and Efficient 1-D Convolutional Neural Network

Hand gesture recognition is pivotal for intuitive human–machine interaction, particularly in healthcare and assistive technologies, where traditional interfaces (e.g., keyboards) are impractical. Existing modalities, such as electromyography and inertial sensors (inertial measurement units), struggle with noise sensitivity, motion dependence, or limited resolution for fine gestures. This work proposes infrared sensing as a robust alternative, leveraging reflected light patterns to capture both macrogestures and microgestures without relying on muscle activity or pronounced arm movements. We conducted experiments and compared different architectures to ensure the correct classification of hand gestures with the lowest possible latency, when targeting real-time processing. Experimental results demonstrate that shallow two-layer 1-D convolutional neural networks (CNNs) achieve rapid inference (3 ms) and minimal memory (56 kB) but suffer from low accuracy (81.45%), while deeper 12-layer CNNs attain 98.29% accuracy at prohibitive cost (176 ms latency, 17.4 MB memory). A six-layer 1-D CNN strikes an optimal balance, delivering 95.97% accuracy with moderate resources (56 ms latency, 640 kB memory), outperforming similarly accurate long short-term memory (94.88%, 136 ms), and recurrent neural network (96.59%, 102 ms) models. Confusion matrix analysis confirms consistent performance across seven gestures, including nuanced distinctions, such as thumb–index versus thumb–pinky pinches. By optimizing architectural depth and sensor integration, this work enables real-time operation on microcontrollers, such as the STM32F7, advancing applications in touchless medical interfaces and assistive devices for users with motor impairments.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE Sensors Letters Engineering-Electrical and Electronic Engineering

CiteScore

3.50

自引率

7.10%

发文量

194