Transforming label-efficient decoding of healthcare wearables with self-supervised learning and "embedded" medical domain expertise.

Xiao Gu, Zhangdaihong Liu, Jinpei Han, Jianing Qiu, Wenfei Fang, Lei Lu, Lei Clifton, Yuan-Ting Zhang, David A Clifton
{"title":"Transforming label-efficient decoding of healthcare wearables with self-supervised learning and \"embedded\" medical domain expertise.","authors":"Xiao Gu, Zhangdaihong Liu, Jinpei Han, Jianing Qiu, Wenfei Fang, Lei Lu, Lei Clifton, Yuan-Ting Zhang, David A Clifton","doi":"10.1038/s44172-025-00467-6","DOIUrl":null,"url":null,"abstract":"<p><p>Healthcare wearables are transforming health monitoring, generating vast and complex data in everyday free-living environments. While supervised deep learning has enabled tremendous advances in interpreting such data, it remains heavily dependent on large labeled datasets, which are often difficult and expensive to obtain in clinical practice. Self-supervised contrastive learning (SSCL) provides a promising alternative by learning from unlabeled data, but conventional SSCL frequently overlooks important physiological similarities by treating all non-identical instances as unrelated, which can result in suboptimal representations. In this study, we revisit the enduring value of domain knowledge \"embedded\" in traditional domain feature engineering pipelines and demonstrate how it can be used to guide SSCL. We introduce a framework that integrates clinically meaningful features-such as heart rate variability from electrocardiograms (ECGs)-into the contrastive learning process. These features guide the formation of more relevant positive pairs through nearest-neighbor matching and promote global structure through clustering-based prototype representations. Evaluated across diverse wearable technologies, our method achieves comparable performance with only 10% labeled data, compared to conventional SSCL approaches with full annotations for fine-tuning. This work highlights the indispensable and sustainable role of domain expertise in advancing machine learning for real-world healthcare, especially for healthcare wearables.</p>","PeriodicalId":72644,"journal":{"name":"Communications engineering","volume":"4 1","pages":"135"},"PeriodicalIF":0.0000,"publicationDate":"2025-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12297484/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Communications engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1038/s44172-025-00467-6","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Healthcare wearables are transforming health monitoring, generating vast and complex data in everyday free-living environments. While supervised deep learning has enabled tremendous advances in interpreting such data, it remains heavily dependent on large labeled datasets, which are often difficult and expensive to obtain in clinical practice. Self-supervised contrastive learning (SSCL) provides a promising alternative by learning from unlabeled data, but conventional SSCL frequently overlooks important physiological similarities by treating all non-identical instances as unrelated, which can result in suboptimal representations. In this study, we revisit the enduring value of domain knowledge "embedded" in traditional domain feature engineering pipelines and demonstrate how it can be used to guide SSCL. We introduce a framework that integrates clinically meaningful features-such as heart rate variability from electrocardiograms (ECGs)-into the contrastive learning process. These features guide the formation of more relevant positive pairs through nearest-neighbor matching and promote global structure through clustering-based prototype representations. Evaluated across diverse wearable technologies, our method achieves comparable performance with only 10% labeled data, compared to conventional SSCL approaches with full annotations for fine-tuning. This work highlights the indispensable and sustainable role of domain expertise in advancing machine learning for real-world healthcare, especially for healthcare wearables.

通过自我监督学习和“嵌入式”医疗领域专业知识,转变医疗可穿戴设备的标签高效解码。
医疗可穿戴设备正在改变健康监测,在日常自由生活环境中产生大量复杂的数据。虽然监督式深度学习在解释此类数据方面取得了巨大进步,但它仍然严重依赖于大型标记数据集,而这些数据集在临床实践中通常难以获得且昂贵。自我监督对比学习(Self-supervised contrastive learning, SSCL)通过学习未标记的数据提供了一个很有前途的选择,但是传统的SSCL经常忽略重要的生理相似性,将所有不相同的实例视为不相关的,这可能导致次优表征。在本研究中,我们重新审视了“嵌入”在传统领域特征工程管道中的领域知识的持久价值,并展示了如何使用它来指导SSCL。我们引入了一个框架,将临床有意义的特征(如心电图(ECGs)的心率变异性)整合到对比学习过程中。这些特征通过最近邻匹配引导形成更相关的正对,并通过基于聚类的原型表示促进全局结构。通过对各种可穿戴技术的评估,与传统的SSCL方法相比,我们的方法在只有10%的标记数据的情况下取得了相当的性能,而传统的SSCL方法具有完整的注释进行微调。这项工作强调了领域专业知识在推动现实世界医疗保健,特别是医疗可穿戴设备的机器学习方面不可或缺和可持续的作用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信