基于多域双注意转换器融合网络和毫米波雷达的端到端人体运动识别

IF 10.9 2区计算机科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC

IEEE Transactions on Consumer Electronics Pub Date : 2025-04-02 DOI:10.1109/TCE.2025.3557084

Chao Fang;Yong Wang;Mu Zhou;Wei He;Qian Zhang;Yu Pang;Bao Peng

{"title":"基于多域双注意转换器融合网络和毫米波雷达的端到端人体运动识别","authors":"Chao Fang;Yong Wang;Mu Zhou;Wei He;Qian Zhang;Yu Pang;Bao Peng","doi":"10.1109/TCE.2025.3557084","DOIUrl":null,"url":null,"abstract":"Human-computer interaction technology, by improving user experience and ease of use, drives innovation and growth in consumer electronics. As a noninvasive and noncontact sensing device, millimeter-wave radar has attracted great attention in human motion recognition for human-computer interaction. However, previous motion recognition models are generally based on radar echo data, i.e., images and point cloud, and single domain radar information, resulting in the loss of raw radar data information and a limited ability to capture the complementary global features. In this paper, a novel end-to-end joint global-local dual attention transformer model for human motion recognition using mmWave radar is proposed to address the above problem. First, we introduce a learnable complex transformation module to process raw radar signals for different inputs. Then, we design two important feature extraction modules, named dual residual attention module (DRAM) and dual coupled filter module (DCFM), to accurately extract the valuable motion information of the radar signal. Furthermore, a position encoding is utilized to obtain the time information of inputs feature and a transformer module is designed to get long-range context global information. Finally, the experimental results show that our proposed method achieves an average accuracy of 99.17% on the human gesture dataset and an average accuracy of 99.72% on the arm motion dataset, which demonstrates that our model has both high recognition accuracy and strong robustness.","PeriodicalId":13208,"journal":{"name":"IEEE Transactions on Consumer Electronics","volume":"71 2","pages":"3252-3265"},"PeriodicalIF":10.9000,"publicationDate":"2025-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"End-to-End Human Motion Recognition With Multidomain Dual Attention Transformer Fusion Network and Millimeter-Wave Radar\",\"authors\":\"Chao Fang;Yong Wang;Mu Zhou;Wei He;Qian Zhang;Yu Pang;Bao Peng\",\"doi\":\"10.1109/TCE.2025.3557084\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Human-computer interaction technology, by improving user experience and ease of use, drives innovation and growth in consumer electronics. As a noninvasive and noncontact sensing device, millimeter-wave radar has attracted great attention in human motion recognition for human-computer interaction. However, previous motion recognition models are generally based on radar echo data, i.e., images and point cloud, and single domain radar information, resulting in the loss of raw radar data information and a limited ability to capture the complementary global features. In this paper, a novel end-to-end joint global-local dual attention transformer model for human motion recognition using mmWave radar is proposed to address the above problem. First, we introduce a learnable complex transformation module to process raw radar signals for different inputs. Then, we design two important feature extraction modules, named dual residual attention module (DRAM) and dual coupled filter module (DCFM), to accurately extract the valuable motion information of the radar signal. Furthermore, a position encoding is utilized to obtain the time information of inputs feature and a transformer module is designed to get long-range context global information. Finally, the experimental results show that our proposed method achieves an average accuracy of 99.17% on the human gesture dataset and an average accuracy of 99.72% on the arm motion dataset, which demonstrates that our model has both high recognition accuracy and strong robustness.\",\"PeriodicalId\":13208,\"journal\":{\"name\":\"IEEE Transactions on Consumer Electronics\",\"volume\":\"71 2\",\"pages\":\"3252-3265\"},\"PeriodicalIF\":10.9000,\"publicationDate\":\"2025-04-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Consumer Electronics\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10947482/\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Consumer Electronics","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10947482/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

摘要

人机交互技术通过改善用户体验和易用性，推动了消费电子产品的创新和增长。毫米波雷达作为一种无创、非接触的传感设备，在人机交互的人体运动识别中受到了广泛的关注。然而，以往的运动识别模型通常基于雷达回波数据，即图像和点云，以及单域雷达信息，导致原始雷达数据信息的丢失，并且捕获互补全局特征的能力有限。针对上述问题，本文提出了一种基于毫米波雷达的端到端联合全局-局部双注意转换器人体运动识别模型。首先，我们引入了一个可学习的复变换模块来处理不同输入的原始雷达信号。然后，设计了双残余注意模块（DRAM）和双耦合滤波模块（DCFM）两个重要的特征提取模块，以准确提取雷达信号中有价值的运动信息。利用位置编码获取输入特征的时间信息，设计变压器模块获取远程上下文全局信息。最后，实验结果表明，该方法在人体手势数据集上的平均准确率为99.17%，在手臂运动数据集上的平均准确率为99.72%，表明该模型具有较高的识别精度和较强的鲁棒性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

End-to-End Human Motion Recognition With Multidomain Dual Attention Transformer Fusion Network and Millimeter-Wave Radar

Human-computer interaction technology, by improving user experience and ease of use, drives innovation and growth in consumer electronics. As a noninvasive and noncontact sensing device, millimeter-wave radar has attracted great attention in human motion recognition for human-computer interaction. However, previous motion recognition models are generally based on radar echo data, i.e., images and point cloud, and single domain radar information, resulting in the loss of raw radar data information and a limited ability to capture the complementary global features. In this paper, a novel end-to-end joint global-local dual attention transformer model for human motion recognition using mmWave radar is proposed to address the above problem. First, we introduce a learnable complex transformation module to process raw radar signals for different inputs. Then, we design two important feature extraction modules, named dual residual attention module (DRAM) and dual coupled filter module (DCFM), to accurately extract the valuable motion information of the radar signal. Furthermore, a position encoding is utilized to obtain the time information of inputs feature and a transformer module is designed to get long-range context global information. Finally, the experimental results show that our proposed method achieves an average accuracy of 99.17% on the human gesture dataset and an average accuracy of 99.72% on the arm motion dataset, which demonstrates that our model has both high recognition accuracy and strong robustness.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE Transactions on Consumer Electronics 工程技术-电信学

CiteScore

7.70

自引率

9.30%

发文量

审稿时长

3.3 months

期刊介绍： The main focus for the IEEE Transactions on Consumer Electronics is the engineering and research aspects of the theory, design, construction, manufacture or end use of mass market electronics, systems, software and services for consumers.