基于多域双注意转换器融合网络和毫米波雷达的端到端人体运动识别

IF 10.9 2区 计算机科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC
Chao Fang;Yong Wang;Mu Zhou;Wei He;Qian Zhang;Yu Pang;Bao Peng
{"title":"基于多域双注意转换器融合网络和毫米波雷达的端到端人体运动识别","authors":"Chao Fang;Yong Wang;Mu Zhou;Wei He;Qian Zhang;Yu Pang;Bao Peng","doi":"10.1109/TCE.2025.3557084","DOIUrl":null,"url":null,"abstract":"Human-computer interaction technology, by improving user experience and ease of use, drives innovation and growth in consumer electronics. As a noninvasive and noncontact sensing device, millimeter-wave radar has attracted great attention in human motion recognition for human-computer interaction. However, previous motion recognition models are generally based on radar echo data, i.e., images and point cloud, and single domain radar information, resulting in the loss of raw radar data information and a limited ability to capture the complementary global features. In this paper, a novel end-to-end joint global-local dual attention transformer model for human motion recognition using mmWave radar is proposed to address the above problem. First, we introduce a learnable complex transformation module to process raw radar signals for different inputs. Then, we design two important feature extraction modules, named dual residual attention module (DRAM) and dual coupled filter module (DCFM), to accurately extract the valuable motion information of the radar signal. Furthermore, a position encoding is utilized to obtain the time information of inputs feature and a transformer module is designed to get long-range context global information. Finally, the experimental results show that our proposed method achieves an average accuracy of 99.17% on the human gesture dataset and an average accuracy of 99.72% on the arm motion dataset, which demonstrates that our model has both high recognition accuracy and strong robustness.","PeriodicalId":13208,"journal":{"name":"IEEE Transactions on Consumer Electronics","volume":"71 2","pages":"3252-3265"},"PeriodicalIF":10.9000,"publicationDate":"2025-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"End-to-End Human Motion Recognition With Multidomain Dual Attention Transformer Fusion Network and Millimeter-Wave Radar\",\"authors\":\"Chao Fang;Yong Wang;Mu Zhou;Wei He;Qian Zhang;Yu Pang;Bao Peng\",\"doi\":\"10.1109/TCE.2025.3557084\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Human-computer interaction technology, by improving user experience and ease of use, drives innovation and growth in consumer electronics. As a noninvasive and noncontact sensing device, millimeter-wave radar has attracted great attention in human motion recognition for human-computer interaction. However, previous motion recognition models are generally based on radar echo data, i.e., images and point cloud, and single domain radar information, resulting in the loss of raw radar data information and a limited ability to capture the complementary global features. In this paper, a novel end-to-end joint global-local dual attention transformer model for human motion recognition using mmWave radar is proposed to address the above problem. First, we introduce a learnable complex transformation module to process raw radar signals for different inputs. Then, we design two important feature extraction modules, named dual residual attention module (DRAM) and dual coupled filter module (DCFM), to accurately extract the valuable motion information of the radar signal. Furthermore, a position encoding is utilized to obtain the time information of inputs feature and a transformer module is designed to get long-range context global information. Finally, the experimental results show that our proposed method achieves an average accuracy of 99.17% on the human gesture dataset and an average accuracy of 99.72% on the arm motion dataset, which demonstrates that our model has both high recognition accuracy and strong robustness.\",\"PeriodicalId\":13208,\"journal\":{\"name\":\"IEEE Transactions on Consumer Electronics\",\"volume\":\"71 2\",\"pages\":\"3252-3265\"},\"PeriodicalIF\":10.9000,\"publicationDate\":\"2025-04-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Consumer Electronics\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10947482/\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Consumer Electronics","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10947482/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

摘要

人机交互技术通过改善用户体验和易用性,推动了消费电子产品的创新和增长。毫米波雷达作为一种无创、非接触的传感设备,在人机交互的人体运动识别中受到了广泛的关注。然而,以往的运动识别模型通常基于雷达回波数据,即图像和点云,以及单域雷达信息,导致原始雷达数据信息的丢失,并且捕获互补全局特征的能力有限。针对上述问题,本文提出了一种基于毫米波雷达的端到端联合全局-局部双注意转换器人体运动识别模型。首先,我们引入了一个可学习的复变换模块来处理不同输入的原始雷达信号。然后,设计了双残余注意模块(DRAM)和双耦合滤波模块(DCFM)两个重要的特征提取模块,以准确提取雷达信号中有价值的运动信息。利用位置编码获取输入特征的时间信息,设计变压器模块获取远程上下文全局信息。最后,实验结果表明,该方法在人体手势数据集上的平均准确率为99.17%,在手臂运动数据集上的平均准确率为99.72%,表明该模型具有较高的识别精度和较强的鲁棒性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
End-to-End Human Motion Recognition With Multidomain Dual Attention Transformer Fusion Network and Millimeter-Wave Radar
Human-computer interaction technology, by improving user experience and ease of use, drives innovation and growth in consumer electronics. As a noninvasive and noncontact sensing device, millimeter-wave radar has attracted great attention in human motion recognition for human-computer interaction. However, previous motion recognition models are generally based on radar echo data, i.e., images and point cloud, and single domain radar information, resulting in the loss of raw radar data information and a limited ability to capture the complementary global features. In this paper, a novel end-to-end joint global-local dual attention transformer model for human motion recognition using mmWave radar is proposed to address the above problem. First, we introduce a learnable complex transformation module to process raw radar signals for different inputs. Then, we design two important feature extraction modules, named dual residual attention module (DRAM) and dual coupled filter module (DCFM), to accurately extract the valuable motion information of the radar signal. Furthermore, a position encoding is utilized to obtain the time information of inputs feature and a transformer module is designed to get long-range context global information. Finally, the experimental results show that our proposed method achieves an average accuracy of 99.17% on the human gesture dataset and an average accuracy of 99.72% on the arm motion dataset, which demonstrates that our model has both high recognition accuracy and strong robustness.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
7.70
自引率
9.30%
发文量
59
审稿时长
3.3 months
期刊介绍: The main focus for the IEEE Transactions on Consumer Electronics is the engineering and research aspects of the theory, design, construction, manufacture or end use of mass market electronics, systems, software and services for consumers.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信