基于轻量级TCNFormer网络的毫米波雷达稀疏驱动手势识别

IF 5.6 2区工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC

IEEE Transactions on Instrumentation and Measurement Pub Date : 2025-06-02 DOI:10.1109/TIM.2025.3576011

Hao Wu;Biao Jin;Zhenkai Zhang;Zhuxian Lian;Baoxiong Xu;Jin Liang;Xiangqun Zhang;Genyuan Du

{"title":"基于轻量级TCNFormer网络的毫米波雷达稀疏驱动手势识别","authors":"Hao Wu;Biao Jin;Zhenkai Zhang;Zhuxian Lian;Baoxiong Xu;Jin Liang;Xiangqun Zhang;Genyuan Du","doi":"10.1109/TIM.2025.3576011","DOIUrl":null,"url":null,"abstract":"Gesture recognition with millimeter-wave radar has broad application prospects in human-computer interaction. However, the traditional recognition methods generate overly redundant features and construct large-scale networks, rendering them unsuitable for embedded devices with limited memory. To address this challenge, we propose a sparse-driven dynamic gesture recognition network in millimeter-wave radar, named time convolutional network and transFormer (TCNFormer). First, we employ a 2-D fast Fourier transform (2D-FFT) to obtain the range-Doppler maps (RDMs). These maps are then processed through incoherent integration of multiple frames to produce Doppler-time maps (DTMs). We subsequently use the orthogonal matching pursuit (OMP) algorithm to achieve a sparse representation of the Doppler-time trajectories and integrate the RDM to extract the range features of gestures, obtaining the multidimensional sparse sequences encompassing the range-Doppler-time feature. We then design a TCNFormer network tailored to these multidimensional sparse sequences. This network leverages a shallow TCN to learn local features, a Transformer network to capture global features and an adaptive weighting method to fuse these local and global features effectively. Experimental results demonstrate that our network fully exploits the sparse multidimensional sequences, achieving a recognition accuracy of 99.17% on a self-built dataset. The parameter size of the network is only 0.13 M, significantly outperforming the existing state-of-the-art models in relevant metrics, thereby proving its suitability for embedded applications in human-computer interaction.","PeriodicalId":13341,"journal":{"name":"IEEE Transactions on Instrumentation and Measurement","volume":"74 ","pages":"1-12"},"PeriodicalIF":5.6000,"publicationDate":"2025-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Sparsity-Driven Gesture Recognition Using Lightweight TCNFormer Networks in Millimeter-Wave Radar\",\"authors\":\"Hao Wu;Biao Jin;Zhenkai Zhang;Zhuxian Lian;Baoxiong Xu;Jin Liang;Xiangqun Zhang;Genyuan Du\",\"doi\":\"10.1109/TIM.2025.3576011\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Gesture recognition with millimeter-wave radar has broad application prospects in human-computer interaction. However, the traditional recognition methods generate overly redundant features and construct large-scale networks, rendering them unsuitable for embedded devices with limited memory. To address this challenge, we propose a sparse-driven dynamic gesture recognition network in millimeter-wave radar, named time convolutional network and transFormer (TCNFormer). First, we employ a 2-D fast Fourier transform (2D-FFT) to obtain the range-Doppler maps (RDMs). These maps are then processed through incoherent integration of multiple frames to produce Doppler-time maps (DTMs). We subsequently use the orthogonal matching pursuit (OMP) algorithm to achieve a sparse representation of the Doppler-time trajectories and integrate the RDM to extract the range features of gestures, obtaining the multidimensional sparse sequences encompassing the range-Doppler-time feature. We then design a TCNFormer network tailored to these multidimensional sparse sequences. This network leverages a shallow TCN to learn local features, a Transformer network to capture global features and an adaptive weighting method to fuse these local and global features effectively. Experimental results demonstrate that our network fully exploits the sparse multidimensional sequences, achieving a recognition accuracy of 99.17% on a self-built dataset. The parameter size of the network is only 0.13 M, significantly outperforming the existing state-of-the-art models in relevant metrics, thereby proving its suitability for embedded applications in human-computer interaction.\",\"PeriodicalId\":13341,\"journal\":{\"name\":\"IEEE Transactions on Instrumentation and Measurement\",\"volume\":\"74 \",\"pages\":\"1-12\"},\"PeriodicalIF\":5.6000,\"publicationDate\":\"2025-06-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Instrumentation and Measurement\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11021466/\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Instrumentation and Measurement","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/11021466/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

摘要

毫米波雷达手势识别在人机交互中具有广阔的应用前景。然而，传统的识别方法产生了过多的冗余特征，并且构建了大规模的网络，使得它们不适合内存有限的嵌入式设备。为了解决这一挑战，我们提出了一种稀疏驱动的毫米波雷达动态手势识别网络，命名为时间卷积网络和变压器（TCNFormer）。首先，我们采用二维快速傅里叶变换（2D-FFT）来获得距离-多普勒图（rdm）。然后通过多帧的非相干积分来处理这些图，以产生多普勒时间图（dtm）。随后，我们使用正交匹配追踪（OMP）算法实现多普勒-时间轨迹的稀疏表示，并整合RDM提取手势的距离特征，得到包含距离-多普勒-时间特征的多维稀疏序列。然后，我们针对这些多维稀疏序列设计了一个TCNFormer网络。该网络利用浅TCN学习局部特征，利用Transformer网络捕获全局特征，并利用自适应加权方法有效地融合这些局部和全局特征。实验结果表明，我们的网络充分利用了稀疏多维序列，在自建数据集上的识别准确率达到99.17%。该网络的参数大小仅为0.13 M，在相关指标上明显优于现有最先进的模型，从而证明了其适合于人机交互的嵌入式应用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Sparsity-Driven Gesture Recognition Using Lightweight TCNFormer Networks in Millimeter-Wave Radar

Gesture recognition with millimeter-wave radar has broad application prospects in human-computer interaction. However, the traditional recognition methods generate overly redundant features and construct large-scale networks, rendering them unsuitable for embedded devices with limited memory. To address this challenge, we propose a sparse-driven dynamic gesture recognition network in millimeter-wave radar, named time convolutional network and transFormer (TCNFormer). First, we employ a 2-D fast Fourier transform (2D-FFT) to obtain the range-Doppler maps (RDMs). These maps are then processed through incoherent integration of multiple frames to produce Doppler-time maps (DTMs). We subsequently use the orthogonal matching pursuit (OMP) algorithm to achieve a sparse representation of the Doppler-time trajectories and integrate the RDM to extract the range features of gestures, obtaining the multidimensional sparse sequences encompassing the range-Doppler-time feature. We then design a TCNFormer network tailored to these multidimensional sparse sequences. This network leverages a shallow TCN to learn local features, a Transformer network to capture global features and an adaptive weighting method to fuse these local and global features effectively. Experimental results demonstrate that our network fully exploits the sparse multidimensional sequences, achieving a recognition accuracy of 99.17% on a self-built dataset. The parameter size of the network is only 0.13 M, significantly outperforming the existing state-of-the-art models in relevant metrics, thereby proving its suitability for embedded applications in human-computer interaction.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE Transactions on Instrumentation and Measurement 工程技术-工程：电子与电气

CiteScore

9.00

自引率

23.20%

发文量

1294

审稿时长

3.9 months

期刊介绍： Papers are sought that address innovative solutions to the development and use of electrical and electronic instruments and equipment to measure, monitor and/or record physical phenomena for the purpose of advancing measurement science, methods, functionality and applications. The scope of these papers may encompass: (1) theory, methodology, and practice of measurement; (2) design, development and evaluation of instrumentation and measurement systems and components used in generating, acquiring, conditioning and processing signals; (3) analysis, representation, display, and preservation of the information obtained from a set of measurements; and (4) scientific and technical support to establishment and maintenance of technical standards in the field of Instrumentation and Measurement.