Benjamin Fox, Joy Jiang, Sajila Wickramaratne, Patricia Kovatch, Mayte Suarez-Farinas, Neomi A Shah, Ankit Parekh, Girish N Nadkarni
{"title":"A foundational transformer leveraging full night, multichannel sleep study data accurately classifies sleep stages.","authors":"Benjamin Fox, Joy Jiang, Sajila Wickramaratne, Patricia Kovatch, Mayte Suarez-Farinas, Neomi A Shah, Ankit Parekh, Girish N Nadkarni","doi":"10.1093/sleep/zsaf061","DOIUrl":null,"url":null,"abstract":"<p><strong>Study objectives: </strong>To evaluate whether a foundational transformer using 8-hour, multichannel polysomnogram (PSG) data can effectively encode signals and classify sleep stages with state-of-the-art performance.</p><p><strong>Methods: </strong>The Sleep Heart Health Study, Wisconsin Sleep Cohort, and Osteoporotic Fractures in Men (MrOS) Study Visit 1 were used for training, and the Multi-Ethnic Study of Atherosclerosis (MESA), Apnea Positive Pressure Long-term Efficacy Study (APPLES), and MrOS visit 2 served as independent test sets. We developed PFTSleep, a self-supervised foundational transformer that encodes full night sleep studies with brain, movement, cardiac, oxygen, and respiratory channels. These representations were used to train another model to classify sleep stages. We compared our results to existing methods, examined differences in performance by varying channel input data and training dataset size, and investigated an AI explainability tool to analyze decision processes.</p><p><strong>Results: </strong>PFTSleep was trained with 13,888 sleep studies and tested on 4,169 independent studies. Cohen's Kappa scores were 0.81 for our held-out set, 0.59 for APPLES, 0.60 for MESA, and 0.75 for MrOS Visit 2. Performance increases to 0.76 on a held-out MESA set when MESA is included in the training of the classifier head but not the transformer. Compared to other state-of-the-art AI models, our model shows high performance across diverse datasets while only using task agnostic PSG representations from a foundational transformer as input for sleep stage classification.</p><p><strong>Conclusions: </strong>Full night, multichannel PSG representations from a foundational transformer enable accurate sleep stage classification comparable to state-of-the-art AI methods across diverse datasets.</p>","PeriodicalId":22018,"journal":{"name":"Sleep","volume":" ","pages":""},"PeriodicalIF":5.6000,"publicationDate":"2025-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Sleep","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1093/sleep/zsaf061","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0
Abstract
Study objectives: To evaluate whether a foundational transformer using 8-hour, multichannel polysomnogram (PSG) data can effectively encode signals and classify sleep stages with state-of-the-art performance.
Methods: The Sleep Heart Health Study, Wisconsin Sleep Cohort, and Osteoporotic Fractures in Men (MrOS) Study Visit 1 were used for training, and the Multi-Ethnic Study of Atherosclerosis (MESA), Apnea Positive Pressure Long-term Efficacy Study (APPLES), and MrOS visit 2 served as independent test sets. We developed PFTSleep, a self-supervised foundational transformer that encodes full night sleep studies with brain, movement, cardiac, oxygen, and respiratory channels. These representations were used to train another model to classify sleep stages. We compared our results to existing methods, examined differences in performance by varying channel input data and training dataset size, and investigated an AI explainability tool to analyze decision processes.
Results: PFTSleep was trained with 13,888 sleep studies and tested on 4,169 independent studies. Cohen's Kappa scores were 0.81 for our held-out set, 0.59 for APPLES, 0.60 for MESA, and 0.75 for MrOS Visit 2. Performance increases to 0.76 on a held-out MESA set when MESA is included in the training of the classifier head but not the transformer. Compared to other state-of-the-art AI models, our model shows high performance across diverse datasets while only using task agnostic PSG representations from a foundational transformer as input for sleep stage classification.
Conclusions: Full night, multichannel PSG representations from a foundational transformer enable accurate sleep stage classification comparable to state-of-the-art AI methods across diverse datasets.
研究目的:评估使用8小时多通道多导睡眠图(PSG)数据的基础变压器是否可以有效地编码信号并以最先进的性能对睡眠阶段进行分类。方法:以睡眠心脏健康研究(Sleep Heart Health Study)、Wisconsin睡眠队列(Sleep Cohort)和男性骨质疏松性骨折(osteoporosis Fractures in Men, mrs) Study Visit 1进行训练,以动脉粥样硬化多民族研究(MESA)、呼吸暂停正压长期疗效研究(APPLES)和mrs Study Visit 2作为独立测试集。我们开发了PFTSleep,这是一个自我监督的基础转换器,可以对大脑、运动、心脏、氧气和呼吸通道的全夜睡眠研究进行编码。这些表征被用来训练另一个模型来对睡眠阶段进行分类。我们将结果与现有方法进行了比较,通过不同的通道输入数据和训练数据集大小检查了性能差异,并研究了人工智能可解释性工具来分析决策过程。结果:PFTSleep接受了13888项睡眠研究的训练,并在4169项独立研究中进行了测试。Cohen的Kappa分数为0.81,苹果为0.59,MESA为0.60,mrs为0.75。当MESA包含在分类器头部的训练中而不包括变压器时,在伸出的MESA集上性能增加到0.76。与其他最先进的人工智能模型相比,我们的模型在不同的数据集上表现出高性能,同时只使用来自基本变压器的任务无关的PSG表示作为睡眠阶段分类的输入。结论:基于基础变压器的整晚多通道PSG表示可以在不同数据集上实现与最先进的人工智能方法相媲美的准确睡眠阶段分类。
期刊介绍:
SLEEP® publishes findings from studies conducted at any level of analysis, including:
Genes
Molecules
Cells
Physiology
Neural systems and circuits
Behavior and cognition
Self-report
SLEEP® publishes articles that use a wide variety of scientific approaches and address a broad range of topics. These may include, but are not limited to:
Basic and neuroscience studies of sleep and circadian mechanisms
In vitro and animal models of sleep, circadian rhythms, and human disorders
Pre-clinical human investigations, including the measurement and manipulation of sleep and circadian rhythms
Studies in clinical or population samples. These may address factors influencing sleep and circadian rhythms (e.g., development and aging, and social and environmental influences) and relationships between sleep, circadian rhythms, health, and disease
Clinical trials, epidemiology studies, implementation, and dissemination research.