Zhenzhen Huang , Jiukai Deng , Shengzhi Wang , Chaogang Tang , Shuo Xiao
{"title":"TFC: Time–frequency contrasting network for wearable-based human activity recognition","authors":"Zhenzhen Huang , Jiukai Deng , Shengzhi Wang , Chaogang Tang , Shuo Xiao","doi":"10.1016/j.knosys.2025.113373","DOIUrl":null,"url":null,"abstract":"<div><div>Human Activity Recognition (HAR) using sensor data has significantly progressed with various supervised learning architectures, including both traditional CNN/LSTM models and the more recent Transformer-based models. A primary challenge in supervised learning is the requirement for extensive, accurately labeled training data. Self-supervised methods, particularly those employing contrastive learning, offer an innovative solution to this challenge by leveraging unlabeled data. In this study, we introduce a novel self-supervised learning method named Time–Frequency Contrasting (TFC) for limited labeled data in HAR, rooted in the principles of contrastive learning and Bayesian structural time series. This approach interprets sensor data as a combination of time-domain trends, frequency-domain cycles, and error noise. Our objective is to learn a universal representation so as to enhance the performance of human activity recognition in downstream tasks. This is achieved by minimizing the impact of redundant noise and leveraging time-domain prior knowledge to learn time-domain trend features and utilizing frequency-domain prior knowledge to acquire frequency-domain cycle features, respectively. After fine-tuning, TFC achieved Macro F1-scores of 86.39, 95.44, and 80.27 on three publicly available datasets, namely MotionSense, USC-HAD, and UCI-HAR. Additionally, it obtained a F1-score of 97.64 on a custom-built dataset called BARD. Our extensive experiment demonstrate that TFC markedly improves self-supervised activity recognition tasks, especially in scenarios with limited labeled data.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"319 ","pages":"Article 113373"},"PeriodicalIF":7.2000,"publicationDate":"2025-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950705125004204","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Human Activity Recognition (HAR) using sensor data has significantly progressed with various supervised learning architectures, including both traditional CNN/LSTM models and the more recent Transformer-based models. A primary challenge in supervised learning is the requirement for extensive, accurately labeled training data. Self-supervised methods, particularly those employing contrastive learning, offer an innovative solution to this challenge by leveraging unlabeled data. In this study, we introduce a novel self-supervised learning method named Time–Frequency Contrasting (TFC) for limited labeled data in HAR, rooted in the principles of contrastive learning and Bayesian structural time series. This approach interprets sensor data as a combination of time-domain trends, frequency-domain cycles, and error noise. Our objective is to learn a universal representation so as to enhance the performance of human activity recognition in downstream tasks. This is achieved by minimizing the impact of redundant noise and leveraging time-domain prior knowledge to learn time-domain trend features and utilizing frequency-domain prior knowledge to acquire frequency-domain cycle features, respectively. After fine-tuning, TFC achieved Macro F1-scores of 86.39, 95.44, and 80.27 on three publicly available datasets, namely MotionSense, USC-HAD, and UCI-HAR. Additionally, it obtained a F1-score of 97.64 on a custom-built dataset called BARD. Our extensive experiment demonstrate that TFC markedly improves self-supervised activity recognition tasks, especially in scenarios with limited labeled data.
期刊介绍:
Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.