基于wi - fi的人体跌倒和活动识别，使用基于变压器的编码器-解码器和图神经网络

IF 4.3 2区综合性期刊 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC

IEEE Sensors Journal Pub Date : 2025-08-01 DOI:10.1109/JSEN.2025.3593126

Younggeol Cho;Elisa Motta;Olivia Nocentini;Marta Lagomarsino;Andrea Merello;Marco Crepaldi;Arash Ajoudani

{"title":"基于wi - fi的人体跌倒和活动识别，使用基于变压器的编码器-解码器和图神经网络","authors":"Younggeol Cho;Elisa Motta;Olivia Nocentini;Marta Lagomarsino;Andrea Merello;Marco Crepaldi;Arash Ajoudani","doi":"10.1109/JSEN.2025.3593126","DOIUrl":null,"url":null,"abstract":"Human pose estimation and action recognition have received attention due to their critical roles in healthcare monitoring, rehabilitation, and assistive technologies. In this study, we proposed a novel architecture named Transformer-based encoder–decoder network (TED-Net) designed for estimating human skeleton poses from Wi-Fi channel state information (CSI). The TED-Net integrates convolutional encoders with Transformer-based attention mechanisms to capture spatiotemporal features from CSI signals. The estimated skeleton poses were used as input to a customized directed graph neural network (DGNN) for action recognition. We validated our model on two datasets: a publicly available multimodal dataset for assessing general pose estimation, and a newly collected dataset focused on fall-related scenarios involving 20 participants. Experimental results demonstrated that the TED-Net outperformed existing approaches in pose estimation and that the DGNN achieves reliable action classification using CSI-based skeletons, with performance comparable to RGB-based systems. Notably, the TED-Net maintains robust performance across both fall and nonfall cases. These findings highlight the potential of CSI-driven human skeleton estimation for effective action recognition, particularly in home environments such as elderly fall detection. In such settings, Wi-Fi signals are often readily available, offering a privacy-preserving alternative to vision-based methods, which may raise concerns about continuous camera monitoring.","PeriodicalId":447,"journal":{"name":"IEEE Sensors Journal","volume":"25 18","pages":"34939-34947"},"PeriodicalIF":4.3000,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Wi-Fi-Based Human Fall and Activity Recognition Using Transformer-Based Encoder–Decoder and Graph Neural Networks\",\"authors\":\"Younggeol Cho;Elisa Motta;Olivia Nocentini;Marta Lagomarsino;Andrea Merello;Marco Crepaldi;Arash Ajoudani\",\"doi\":\"10.1109/JSEN.2025.3593126\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Human pose estimation and action recognition have received attention due to their critical roles in healthcare monitoring, rehabilitation, and assistive technologies. In this study, we proposed a novel architecture named Transformer-based encoder–decoder network (TED-Net) designed for estimating human skeleton poses from Wi-Fi channel state information (CSI). The TED-Net integrates convolutional encoders with Transformer-based attention mechanisms to capture spatiotemporal features from CSI signals. The estimated skeleton poses were used as input to a customized directed graph neural network (DGNN) for action recognition. We validated our model on two datasets: a publicly available multimodal dataset for assessing general pose estimation, and a newly collected dataset focused on fall-related scenarios involving 20 participants. Experimental results demonstrated that the TED-Net outperformed existing approaches in pose estimation and that the DGNN achieves reliable action classification using CSI-based skeletons, with performance comparable to RGB-based systems. Notably, the TED-Net maintains robust performance across both fall and nonfall cases. These findings highlight the potential of CSI-driven human skeleton estimation for effective action recognition, particularly in home environments such as elderly fall detection. In such settings, Wi-Fi signals are often readily available, offering a privacy-preserving alternative to vision-based methods, which may raise concerns about continuous camera monitoring.\",\"PeriodicalId\":447,\"journal\":{\"name\":\"IEEE Sensors Journal\",\"volume\":\"25 18\",\"pages\":\"34939-34947\"},\"PeriodicalIF\":4.3000,\"publicationDate\":\"2025-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Sensors Journal\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11107310/\",\"RegionNum\":2,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Sensors Journal","FirstCategoryId":"103","ListUrlMain":"https://ieeexplore.ieee.org/document/11107310/","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

摘要

人体姿态估计和动作识别因其在医疗监测、康复和辅助技术中的关键作用而受到关注。在这项研究中，我们提出了一种基于变压器的编码器-解码器网络（TED-Net）的新架构，用于从Wi-Fi信道状态信息（CSI）中估计人体骨骼姿势。TED-Net将卷积编码器与基于变压器的注意力机制集成在一起，从CSI信号中捕获时空特征。将估计的骨骼姿态作为自定义有向图神经网络（DGNN）的输入，用于动作识别。我们在两个数据集上验证了我们的模型：一个是用于评估一般姿势估计的公开多模态数据集，另一个是新收集的数据集，主要关注与跌倒相关的场景，涉及20名参与者。实验结果表明，TED-Net在姿态估计方面优于现有方法，DGNN使用基于csi的骨架实现了可靠的动作分类，其性能与基于rgb的系统相当。值得注意的是，TED-Net在跌倒和非跌倒的情况下都保持了良好的性能。这些发现突出了csi驱动的人体骨骼估计在有效动作识别方面的潜力，特别是在家庭环境中，如老年人跌倒检测。在这种情况下，Wi-Fi信号通常很容易获得，这为基于视觉的方法提供了一种保护隐私的选择，这种方法可能会引起对连续摄像头监控的担忧。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Wi-Fi-Based Human Fall and Activity Recognition Using Transformer-Based Encoder–Decoder and Graph Neural Networks

Human pose estimation and action recognition have received attention due to their critical roles in healthcare monitoring, rehabilitation, and assistive technologies. In this study, we proposed a novel architecture named Transformer-based encoder–decoder network (TED-Net) designed for estimating human skeleton poses from Wi-Fi channel state information (CSI). The TED-Net integrates convolutional encoders with Transformer-based attention mechanisms to capture spatiotemporal features from CSI signals. The estimated skeleton poses were used as input to a customized directed graph neural network (DGNN) for action recognition. We validated our model on two datasets: a publicly available multimodal dataset for assessing general pose estimation, and a newly collected dataset focused on fall-related scenarios involving 20 participants. Experimental results demonstrated that the TED-Net outperformed existing approaches in pose estimation and that the DGNN achieves reliable action classification using CSI-based skeletons, with performance comparable to RGB-based systems. Notably, the TED-Net maintains robust performance across both fall and nonfall cases. These findings highlight the potential of CSI-driven human skeleton estimation for effective action recognition, particularly in home environments such as elderly fall detection. In such settings, Wi-Fi signals are often readily available, offering a privacy-preserving alternative to vision-based methods, which may raise concerns about continuous camera monitoring.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE Sensors Journal 工程技术-工程：电子与电气

CiteScore

7.70

自引率

14.00%

发文量

2058

审稿时长

5.2 months

期刊介绍： The fields of interest of the IEEE Sensors Journal are the theory, design , fabrication, manufacturing and applications of devices for sensing and transducing physical, chemical and biological phenomena, with emphasis on the electronics and physics aspect of sensors and integrated sensors-actuators. IEEE Sensors Journal deals with the following: -Sensor Phenomenology, Modelling, and Evaluation -Sensor Materials, Processing, and Fabrication -Chemical and Gas Sensors -Microfluidics and Biosensors -Optical Sensors -Physical Sensors: Temperature, Mechanical, Magnetic, and others -Acoustic and Ultrasonic Sensors -Sensor Packaging -Sensor Networks -Sensor Applications -Sensor Systems: Signals, Processing, and Interfaces -Actuators and Sensor Power Systems -Sensor Signal Processing for high precision and stability (amplification, filtering, linearization, modulation/demodulation) and under harsh conditions (EMC, radiation, humidity, temperature); energy consumption/harvesting -Sensor Data Processing (soft computing with sensor data, e.g., pattern recognition, machine learning, evolutionary computation; sensor data fusion, processing of wave e.g., electromagnetic and acoustic; and non-wave, e.g., chemical, gravity, particle, thermal, radiative and non-radiative sensor data, detection, estimation and classification based on sensor data) -Sensors in Industrial Practice