Xu Lu;Zexiao Cai;Xiongwei Huang;Cheng Zhou;Jun Liu
{"title":"AFERSM-Net: Joint Network for Gesture Recognition and Location Classification","authors":"Xu Lu;Zexiao Cai;Xiongwei Huang;Cheng Zhou;Jun Liu","doi":"10.1109/TLA.2025.11007187","DOIUrl":null,"url":null,"abstract":"With the widespread deployment of wireless communication systems and smart devices, gesture recognition and indoor location classification technologies based on WiFi wireless devices are increasingly used. Its technical principle is to identify human activities and locations by extracting gesture and location features from WiFi channel state information (CSI). However, the signal is susceptible to interference from the environment during CSI data acquisition to produce multipath effect noise, and the amplitude change with the change of location often affects the extraction and recognition of gesture features. To address these problems, Auxiliary Feature Extraction based Residual Shrinkage Multi-tasking Network (AFERSM-Net) is proposed for gesture recognition and position classification of one-dimensional multivariate time series. AFERSM-Net is a hybrid architecture that combines CNN for spatial feature extraction and LSTM networks for capturing temporal dependencies. Firstly, a reasonable threshold is set adaptively by the shrinkage module to dynamically identify and eliminate the transformed environmental noise. Secondly, the feature extraction module is used to focus on and extract location-independent gesture features to reduce the influence of location-independent features. Finally, the gesture features extracted by the feature extraction module are fused with the shared features of the residual shrinkage multi-tasking network as an aid. Its module fusion is mainly used to improve the accuracy of gesture recognition and solve the problem of insufficient model generalization ability. We evaluated this network on a dual-labeled gesture and location dataset, an the gesture recognition accuracy was 97.84% and the location classification accuracy was 98.92%, which outperformed other advanced network frameworks.","PeriodicalId":55024,"journal":{"name":"IEEE Latin America Transactions","volume":"23 6","pages":"462-471"},"PeriodicalIF":1.3000,"publicationDate":"2025-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11007187","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Latin America Transactions","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/11007187/","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
With the widespread deployment of wireless communication systems and smart devices, gesture recognition and indoor location classification technologies based on WiFi wireless devices are increasingly used. Its technical principle is to identify human activities and locations by extracting gesture and location features from WiFi channel state information (CSI). However, the signal is susceptible to interference from the environment during CSI data acquisition to produce multipath effect noise, and the amplitude change with the change of location often affects the extraction and recognition of gesture features. To address these problems, Auxiliary Feature Extraction based Residual Shrinkage Multi-tasking Network (AFERSM-Net) is proposed for gesture recognition and position classification of one-dimensional multivariate time series. AFERSM-Net is a hybrid architecture that combines CNN for spatial feature extraction and LSTM networks for capturing temporal dependencies. Firstly, a reasonable threshold is set adaptively by the shrinkage module to dynamically identify and eliminate the transformed environmental noise. Secondly, the feature extraction module is used to focus on and extract location-independent gesture features to reduce the influence of location-independent features. Finally, the gesture features extracted by the feature extraction module are fused with the shared features of the residual shrinkage multi-tasking network as an aid. Its module fusion is mainly used to improve the accuracy of gesture recognition and solve the problem of insufficient model generalization ability. We evaluated this network on a dual-labeled gesture and location dataset, an the gesture recognition accuracy was 97.84% and the location classification accuracy was 98.92%, which outperformed other advanced network frameworks.
期刊介绍:
IEEE Latin America Transactions (IEEE LATAM) is an interdisciplinary journal focused on the dissemination of original and quality research papers / review articles in Spanish and Portuguese of emerging topics in three main areas: Computing, Electric Energy and Electronics. Some of the sub-areas of the journal are, but not limited to: Automatic control, communications, instrumentation, artificial intelligence, power and industrial electronics, fault diagnosis and detection, transportation electrification, internet of things, electrical machines, circuits and systems, biomedicine and biomedical / haptic applications, secure communications, robotics, sensors and actuators, computer networks, smart grids, among others.