Robust Skeletal-Graph Reconstruction Using mmWave Radar and its Application for Human-Activity Recognition

IEEE Journal of Selected Areas in Sensors Pub Date : 2025-06-19 DOI:10.1109/JSAS.2025.3581498

Ta-Wei Wu;Shih-Hau Fang;Hsiao-Chun Wu;Guannan Liu;Kun Yan

{"title":"Robust Skeletal-Graph Reconstruction Using mmWave Radar and its Application for Human-Activity Recognition","authors":"Ta-Wei Wu;Shih-Hau Fang;Hsiao-Chun Wu;Guannan Liu;Kun Yan","doi":"10.1109/JSAS.2025.3581498","DOIUrl":null,"url":null,"abstract":"Skeletal graphs can represent concise and reliable features for human-activity recognition in recent years. However, they have to be acquired by Kinect sensors or regular cameras, which rely on sufficient lighting. Meanwhile, skeletal graphs can only be created from the front views of sensors and cameras in the absence of any obstacle. The above stated restrictions limit the practical applicability of skeletal graphs. Therefore, in this work, we would like to investigate robust skeletal-graph reconstruction using milimeter-wave (mmWave) radar. The mmWave radar, which does not require light-of-sight propagation for data acquisition, can be equipped anywhere in room and operates in darkness so that it can overcome the aforementioned drawbacks. In this work, we propose to utilize the double-view cumulative numbers of radar-cloud points, temporal differentials in cumulative numbers of radar-cloud points, and Doppler velocities as the input features and adopt the deep-learning network integrating convolutional neural network (CNN) and bidirectional long short-term memory (BiLSTM). To fully investigate the effectiveness of our proposed new deep-learning network for robust skeletal-graph reconstruction, we evaluate the reconstruction accuracies in terms of mean absolute errorssubject to the human-location and human-orientation mismatches between the training and testing stages. Furthermore, we also investigate the advantage of our proposed novel robust skeletal-graph reconstruction approach in human-activity recognition since human-activity recognition turns out to be a primary application of skeletal graphs. We also compare the performances of our proposed new approach and two prevalent methods, namely, mmPose-natural language processing and BiLSTM in conjunction with CNN using the 3-D coordinates, signal-to-noise ratios, and Doppler velocites as the input features. Our experiments show that our proposed new approach outperforms the aforementioned two existing methods in both skeletal-graph reconstruction and human-activity recognition.","PeriodicalId":100622,"journal":{"name":"IEEE Journal of Selected Areas in Sensors","volume":"2 ","pages":"199-211"},"PeriodicalIF":0.0000,"publicationDate":"2025-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11045161","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Journal of Selected Areas in Sensors","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/11045161/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Skeletal graphs can represent concise and reliable features for human-activity recognition in recent years. However, they have to be acquired by Kinect sensors or regular cameras, which rely on sufficient lighting. Meanwhile, skeletal graphs can only be created from the front views of sensors and cameras in the absence of any obstacle. The above stated restrictions limit the practical applicability of skeletal graphs. Therefore, in this work, we would like to investigate robust skeletal-graph reconstruction using milimeter-wave (mmWave) radar. The mmWave radar, which does not require light-of-sight propagation for data acquisition, can be equipped anywhere in room and operates in darkness so that it can overcome the aforementioned drawbacks. In this work, we propose to utilize the double-view cumulative numbers of radar-cloud points, temporal differentials in cumulative numbers of radar-cloud points, and Doppler velocities as the input features and adopt the deep-learning network integrating convolutional neural network (CNN) and bidirectional long short-term memory (BiLSTM). To fully investigate the effectiveness of our proposed new deep-learning network for robust skeletal-graph reconstruction, we evaluate the reconstruction accuracies in terms of mean absolute errorssubject to the human-location and human-orientation mismatches between the training and testing stages. Furthermore, we also investigate the advantage of our proposed novel robust skeletal-graph reconstruction approach in human-activity recognition since human-activity recognition turns out to be a primary application of skeletal graphs. We also compare the performances of our proposed new approach and two prevalent methods, namely, mmPose-natural language processing and BiLSTM in conjunction with CNN using the 3-D coordinates, signal-to-noise ratios, and Doppler velocites as the input features. Our experiments show that our proposed new approach outperforms the aforementioned two existing methods in both skeletal-graph reconstruction and human-activity recognition.

查看原文本刊更多论文

毫米波雷达鲁棒骨架图重建及其在人体活动识别中的应用

近年来，骨胳图以其简洁、可靠的特征作为人体活动识别的基础。然而，它们必须通过Kinect传感器或普通摄像头来获取，这依赖于充足的照明。同时，骨架图只能在没有任何障碍物的情况下从传感器和摄像头的前视图创建。上述限制限制了骨架图的实际适用性。因此，在这项工作中，我们想研究使用毫米波（mmWave）雷达的鲁棒骨骼图重建。毫米波雷达不需要光传播来获取数据，可以安装在房间的任何地方，并且可以在黑暗中工作，因此可以克服上述缺点。在这项工作中，我们提出利用雷达云点的双视点累积数、雷达云点累积数的时间差和多普勒速度作为输入特征，并采用卷积神经网络（CNN）和双向长短期记忆（BiLSTM）相结合的深度学习网络。为了充分研究我们提出的新的深度学习网络在鲁棒骨骼图重建方面的有效性，我们根据训练和测试阶段之间人类位置和人类方向不匹配的平均绝对误差来评估重建精度。此外，我们还研究了我们提出的新的鲁棒骨骼图重建方法在人类活动识别中的优势，因为人类活动识别是骨骼图的主要应用。我们还比较了我们提出的新方法和两种流行的方法的性能，即mmpose -自然语言处理和BiLSTM结合CNN，使用三维坐标、信噪比和多普勒速度作为输入特征。我们的实验表明，我们提出的新方法在骨骼图重建和人体活动识别方面都优于上述两种现有方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Journal of Selected Areas in Sensors

自引率

0.00%

发文量