Joonki Hong, Seung Koo Yang, Seunghun Kim, Sung-Woo Cho, Jayoung Oh, Eun Sung Cho, In-Young Yoon, Dongheon Lee, Jeong-Whun Kim
{"title":"Real-Time Snoring Detection Using Deep Learning: A Home-Based Smartphone Approach for Sleep Monitoring.","authors":"Joonki Hong, Seung Koo Yang, Seunghun Kim, Sung-Woo Cho, Jayoung Oh, Eun Sung Cho, In-Young Yoon, Dongheon Lee, Jeong-Whun Kim","doi":"10.2147/NSS.S514631","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Despite the prevalence of sleep-related disorders, few studies have developed deep learning models to predict snoring using home-recorded smartphone audio. This study proposes a real-time snoring detection method utilizing a Vision Transformer-based deep learning model and smartphone recordings.</p><p><strong>Methods: </strong>Participants' sleep-breathing sounds were recorded using smartphones, with concurrent Level I or II polysomnography (PSG) conducted in home or hospital settings. A total of 200 minutes of smartphone audio per participant, corresponding to 400 30-second sleep stage epochs on PSG, were sampled. Each epoch was annotated independently by two trained labelers, with snoring labeled only when both agreed. Model performance was evaluated by epoch-by-epoch prediction accuracy and correlation between observed and predicted snoring ratios.</p><p><strong>Results: </strong>The study included 214 participants (85,600 epochs). Hospital audio data from 105 participants (42,000 epochs) were used for training, while home audio data from 109 participants were split into 54 participants (21,600 epochs) for training and 55 participants (22,000 epochs) for testing. On the test dataset, the model demonstrated a sensitivity of 89.8% and a specificity of 91.3%. Correlation analysis showed strong agreement between observed and predicted snoring ratios (r = 0.97, 95% CI: 0.95-0.99).</p><p><strong>Conclusion: </strong>This study demonstrates the feasibility of using deep learning for real-time snoring detection from home-recorded smartphone audio. With high accuracy and scalability, the approach offers a practical and accessible tool for monitoring sleep-related disorders, paving the way for home-based sleep health management solutions.</p>","PeriodicalId":18896,"journal":{"name":"Nature and Science of Sleep","volume":"17 ","pages":"519-530"},"PeriodicalIF":3.0000,"publicationDate":"2025-03-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11970270/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nature and Science of Sleep","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.2147/NSS.S514631","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"CLINICAL NEUROLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Despite the prevalence of sleep-related disorders, few studies have developed deep learning models to predict snoring using home-recorded smartphone audio. This study proposes a real-time snoring detection method utilizing a Vision Transformer-based deep learning model and smartphone recordings.
Methods: Participants' sleep-breathing sounds were recorded using smartphones, with concurrent Level I or II polysomnography (PSG) conducted in home or hospital settings. A total of 200 minutes of smartphone audio per participant, corresponding to 400 30-second sleep stage epochs on PSG, were sampled. Each epoch was annotated independently by two trained labelers, with snoring labeled only when both agreed. Model performance was evaluated by epoch-by-epoch prediction accuracy and correlation between observed and predicted snoring ratios.
Results: The study included 214 participants (85,600 epochs). Hospital audio data from 105 participants (42,000 epochs) were used for training, while home audio data from 109 participants were split into 54 participants (21,600 epochs) for training and 55 participants (22,000 epochs) for testing. On the test dataset, the model demonstrated a sensitivity of 89.8% and a specificity of 91.3%. Correlation analysis showed strong agreement between observed and predicted snoring ratios (r = 0.97, 95% CI: 0.95-0.99).
Conclusion: This study demonstrates the feasibility of using deep learning for real-time snoring detection from home-recorded smartphone audio. With high accuracy and scalability, the approach offers a practical and accessible tool for monitoring sleep-related disorders, paving the way for home-based sleep health management solutions.
期刊介绍:
Nature and Science of Sleep is an international, peer-reviewed, open access journal covering all aspects of sleep science and sleep medicine, including the neurophysiology and functions of sleep, the genetics of sleep, sleep and society, biological rhythms, dreaming, sleep disorders and therapy, and strategies to optimize healthy sleep.
Specific topics covered in the journal include:
The functions of sleep in humans and other animals
Physiological and neurophysiological changes with sleep
The genetics of sleep and sleep differences
The neurotransmitters, receptors and pathways involved in controlling both sleep and wakefulness
Behavioral and pharmacological interventions aimed at improving sleep, and improving wakefulness
Sleep changes with development and with age
Sleep and reproduction (e.g., changes across the menstrual cycle, with pregnancy and menopause)
The science and nature of dreams
Sleep disorders
Impact of sleep and sleep disorders on health, daytime function and quality of life
Sleep problems secondary to clinical disorders
Interaction of society with sleep (e.g., consequences of shift work, occupational health, public health)
The microbiome and sleep
Chronotherapy
Impact of circadian rhythms on sleep, physiology, cognition and health
Mechanisms controlling circadian rhythms, centrally and peripherally
Impact of circadian rhythm disruptions (including night shift work, jet lag and social jet lag) on sleep, physiology, cognition and health
Behavioral and pharmacological interventions aimed at reducing adverse effects of circadian-related sleep disruption
Assessment of technologies and biomarkers for measuring sleep and/or circadian rhythms
Epigenetic markers of sleep or circadian disruption.