{"title":"三维人体姿态增强和完成的生成式ai授权RFID传感","authors":"Ziqi Wang;Shiwen Mao","doi":"10.1109/OJCOMS.2025.3539705","DOIUrl":null,"url":null,"abstract":"Collecting paired Radio Frequency Identification (RFID) data and corresponding 3D human pose data is challenging due to practical limitations, such as the discomfort of wearing numerous RFID tags and the inconvenience of timestamp synchronization between RFID and camera data. We propose a novel framework that leverages latent diffusion transformers to generate high-quality, diverse RFID sensing data across multiple classes. This synthetic data augments limited datasets by training a transformer-based kinematics predictor to estimate 3D poses with temporal smoothness from RFID data. Most importantly, we introduce a latent diffusion transformer training stage with cross-attention conditioning and an inference design of two-stage velocity alignment to accurately infer missing joints in skeletal poses, completing full 25-joint configurations from partial 12-joint inputs. This is the first method to detect >20 distinct skeletal joints using Generative-AI technologies for any wireless sensing-based continuous 3D human pose estimation (HPE) task. The application is particularly important for RFID-based systems, which typically capture limited joint information due to RFID sensing constraints. Our approach can extend the applicability of wireless-based pose estimation in scenarios where collecting extensive paired datasets is impractical and achieving more fine-grained joint information is infeasible, such as pedestrian and health monitoring in occluded environments.","PeriodicalId":33803,"journal":{"name":"IEEE Open Journal of the Communications Society","volume":"6 ","pages":"1-1"},"PeriodicalIF":6.3000,"publicationDate":"2025-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10877927","citationCount":"0","resultStr":"{\"title\":\"Generative AI-Empowered RFID Sensing for 3D Human Pose Augmentation and Completion\",\"authors\":\"Ziqi Wang;Shiwen Mao\",\"doi\":\"10.1109/OJCOMS.2025.3539705\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Collecting paired Radio Frequency Identification (RFID) data and corresponding 3D human pose data is challenging due to practical limitations, such as the discomfort of wearing numerous RFID tags and the inconvenience of timestamp synchronization between RFID and camera data. We propose a novel framework that leverages latent diffusion transformers to generate high-quality, diverse RFID sensing data across multiple classes. This synthetic data augments limited datasets by training a transformer-based kinematics predictor to estimate 3D poses with temporal smoothness from RFID data. Most importantly, we introduce a latent diffusion transformer training stage with cross-attention conditioning and an inference design of two-stage velocity alignment to accurately infer missing joints in skeletal poses, completing full 25-joint configurations from partial 12-joint inputs. This is the first method to detect >20 distinct skeletal joints using Generative-AI technologies for any wireless sensing-based continuous 3D human pose estimation (HPE) task. The application is particularly important for RFID-based systems, which typically capture limited joint information due to RFID sensing constraints. Our approach can extend the applicability of wireless-based pose estimation in scenarios where collecting extensive paired datasets is impractical and achieving more fine-grained joint information is infeasible, such as pedestrian and health monitoring in occluded environments.\",\"PeriodicalId\":33803,\"journal\":{\"name\":\"IEEE Open Journal of the Communications Society\",\"volume\":\"6 \",\"pages\":\"1-1\"},\"PeriodicalIF\":6.3000,\"publicationDate\":\"2025-02-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10877927\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Open Journal of the Communications Society\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10877927/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Open Journal of the Communications Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10877927/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
Generative AI-Empowered RFID Sensing for 3D Human Pose Augmentation and Completion
Collecting paired Radio Frequency Identification (RFID) data and corresponding 3D human pose data is challenging due to practical limitations, such as the discomfort of wearing numerous RFID tags and the inconvenience of timestamp synchronization between RFID and camera data. We propose a novel framework that leverages latent diffusion transformers to generate high-quality, diverse RFID sensing data across multiple classes. This synthetic data augments limited datasets by training a transformer-based kinematics predictor to estimate 3D poses with temporal smoothness from RFID data. Most importantly, we introduce a latent diffusion transformer training stage with cross-attention conditioning and an inference design of two-stage velocity alignment to accurately infer missing joints in skeletal poses, completing full 25-joint configurations from partial 12-joint inputs. This is the first method to detect >20 distinct skeletal joints using Generative-AI technologies for any wireless sensing-based continuous 3D human pose estimation (HPE) task. The application is particularly important for RFID-based systems, which typically capture limited joint information due to RFID sensing constraints. Our approach can extend the applicability of wireless-based pose estimation in scenarios where collecting extensive paired datasets is impractical and achieving more fine-grained joint information is infeasible, such as pedestrian and health monitoring in occluded environments.
期刊介绍:
The IEEE Open Journal of the Communications Society (OJ-COMS) is an open access, all-electronic journal that publishes original high-quality manuscripts on advances in the state of the art of telecommunications systems and networks. The papers in IEEE OJ-COMS are included in Scopus. Submissions reporting new theoretical findings (including novel methods, concepts, and studies) and practical contributions (including experiments and development of prototypes) are welcome. Additionally, survey and tutorial articles are considered. The IEEE OJCOMS received its debut impact factor of 7.9 according to the Journal Citation Reports (JCR) 2023.
The IEEE Open Journal of the Communications Society covers science, technology, applications and standards for information organization, collection and transfer using electronic, optical and wireless channels and networks. Some specific areas covered include:
Systems and network architecture, control and management
Protocols, software, and middleware
Quality of service, reliability, and security
Modulation, detection, coding, and signaling
Switching and routing
Mobile and portable communications
Terminals and other end-user devices
Networks for content distribution and distributed computing
Communications-based distributed resources control.