Toward Foundational Model for Sleep Analysis Using a Multimodal Hybrid-Self-Supervised Learning Framework.

IF 10.5 1区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

IEEE Transactions on Cybernetics Pub Date : 2025-09-09 DOI:10.1109/TCYB.2025.3603608

Cheol-Hui Lee, Hakseung Kim, Byung Chul Yoon, Dong-Joo Kim

{"title":"Toward Foundational Model for Sleep Analysis Using a Multimodal Hybrid-Self-Supervised Learning Framework.","authors":"Cheol-Hui Lee, Hakseung Kim, Byung Chul Yoon, Dong-Joo Kim","doi":"10.1109/TCYB.2025.3603608","DOIUrl":null,"url":null,"abstract":"<p><p>Sleep is essential for maintaining human health and quality of life. Analyzing physiological signals during sleep is critical in assessing sleep quality and diagnosing sleep disorders. However, manual diagnoses by clinicians are time-intensive and subjective. Despite advances in deep learning that have enhanced automation, these approaches remain heavily dependent on large-scale labeled datasets. This study introduces SynthSleepNet, a multimodal hybrid-self-supervised learning (SSL) framework designed for analyzing polysomnography (PSG) data. SynthSleepNet effectively integrates masked prediction and contrastive learning to leverage complementary features across multiple modalities, including electroencephalogram (EEG), electrooculography (EOG), electromyography (EMG), and electrocardiogram (ECG). This approach enables the model to learn highly expressive representations of PSG data. Furthermore, a TCM based on Mamba was developed to efficiently capture contextual information across signals. SynthSleepNet achieved superior performance compared to state-of-the-art methods across three downstream tasks: sleep-stage classification, apnea detection, and hypopnea detection, with accuracies of 89.89%, 99.75%, and 89.60%, respectively. The model demonstrated robust performance in a semi-SSL environment with limited labels, achieving accuracies of 87.98%, 99.37%, and 77.52% in the same tasks. These results underscore the potential of the model as a foundational tool for the comprehensive analysis of PSG data. SynthSleepNet demonstrates comprehensively superior performance across multiple downstream tasks compared to other methodologies, making it expected to set a new standard for sleep disorder monitoring and diagnostic systems. The source code is available at https://github.com/dlcjfgmlnasa/SynthSleepNet.</p>","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"PP ","pages":""},"PeriodicalIF":10.5000,"publicationDate":"2025-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Cybernetics","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1109/TCYB.2025.3603608","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Sleep is essential for maintaining human health and quality of life. Analyzing physiological signals during sleep is critical in assessing sleep quality and diagnosing sleep disorders. However, manual diagnoses by clinicians are time-intensive and subjective. Despite advances in deep learning that have enhanced automation, these approaches remain heavily dependent on large-scale labeled datasets. This study introduces SynthSleepNet, a multimodal hybrid-self-supervised learning (SSL) framework designed for analyzing polysomnography (PSG) data. SynthSleepNet effectively integrates masked prediction and contrastive learning to leverage complementary features across multiple modalities, including electroencephalogram (EEG), electrooculography (EOG), electromyography (EMG), and electrocardiogram (ECG). This approach enables the model to learn highly expressive representations of PSG data. Furthermore, a TCM based on Mamba was developed to efficiently capture contextual information across signals. SynthSleepNet achieved superior performance compared to state-of-the-art methods across three downstream tasks: sleep-stage classification, apnea detection, and hypopnea detection, with accuracies of 89.89%, 99.75%, and 89.60%, respectively. The model demonstrated robust performance in a semi-SSL environment with limited labels, achieving accuracies of 87.98%, 99.37%, and 77.52% in the same tasks. These results underscore the potential of the model as a foundational tool for the comprehensive analysis of PSG data. SynthSleepNet demonstrates comprehensively superior performance across multiple downstream tasks compared to other methodologies, making it expected to set a new standard for sleep disorder monitoring and diagnostic systems. The source code is available at https://github.com/dlcjfgmlnasa/SynthSleepNet.

查看原文本刊更多论文

基于多模态混合自监督学习框架的睡眠分析基础模型。

睡眠对维持人类健康和生活质量至关重要。分析睡眠中的生理信号对于评估睡眠质量和诊断睡眠障碍至关重要。然而，临床医生的手工诊断是费时且主观的。尽管深度学习的进步提高了自动化程度，但这些方法仍然严重依赖于大规模标记数据集。本研究介绍了SynthSleepNet，一个多模态混合自监督学习（SSL）框架，设计用于分析多导睡眠图（PSG）数据。SynthSleepNet有效地集成了掩模预测和对比学习，以利用多种模式的互补功能，包括脑电图（EEG）、眼电图（EOG）、肌电图（EMG）和心电图（ECG）。这种方法使模型能够学习PSG数据的高表达表示。此外，开发了基于曼巴的中医，以有效地捕获跨信号的上下文信息。与最先进的方法相比，SynthSleepNet在三个下游任务（睡眠阶段分类、呼吸暂停检测和呼吸不足检测）上取得了卓越的性能，准确率分别为89.89%、99.75%和89.60%。该模型在标签有限的半ssl环境中表现出稳健的性能，在相同的任务中实现了87.98%，99.37%和77.52%的准确率。这些结果强调了该模型作为PSG数据综合分析的基础工具的潜力。与其他方法相比，SynthSleepNet在多个下游任务中表现出全面的优越性能，有望为睡眠障碍监测和诊断系统树立新的标准。源代码可从https://github.com/dlcjfgmlnasa/SynthSleepNet获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Transactions on Cybernetics COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-COMPUTER SCIENCE, CYBERNETICS

CiteScore

25.40

自引率

11.00%

发文量

1869

期刊介绍： The scope of the IEEE Transactions on Cybernetics includes computational approaches to the field of cybernetics. Specifically, the transactions welcomes papers on communication and control across machines or machine, human, and organizations. The scope includes such areas as computational intelligence, computer vision, neural networks, genetic algorithms, machine learning, fuzzy systems, cognitive systems, decision making, and robotics, to the extent that they contribute to the theme of cybernetics or demonstrate an application of cybernetics principles.