{"title":"机器听力诊断阻塞性睡眠呼吸暂停:贝叶斯荟萃分析。","authors":"Benjamin Kye Jyn Tan,Esther Yanxin Gao,Nicole Kye Wen Tan,Brian Sheng Yep Yeo,Claire Jing-Wen Tan,Adele Chin Wei Ng,Zhou Hao Leong,Chu Qin Phua,Maythad Uataya,Liang Chye Goh,Thun How Ong,Leong Chai Leow,Guang-Bin Huang,Song Tar Toh","doi":"10.1016/j.chest.2025.04.006","DOIUrl":null,"url":null,"abstract":"BACKGROUND\r\nAmong 1 billion patients worldwide with obstructive sleep apnea (OSA), 90% remain undiagnosed. Their main barrier is the overnight polysomnogram, which requires specialized equipment, skilled technicians and inpatient beds available only in tertiary sleep centers. Recent advances in artificial intelligence (AI) have enabled OSA detection using breath sound recordings.\r\n\r\nRESEARCH QUESTION\r\nWhat is the diagnostic accuracy and how can we optimize machine listening for OSA?\r\n\r\nSTUDY DESIGN AND METHODS\r\nPubMed, Embase, Scopus, Web of Science and IEEE Xplore were systematically searched. Two blinded reviewers selected studies comparing the patient-level diagnostic performance of AI approaches using overnight audio recordings, versus conventional diagnosis (apnea-hypopnea index [AHI]) using a train-test split or k-fold cross-validation. Bayesian bivariate meta-analysis and meta-regression were performed. Publication bias was assessed using a selection model. Risk of bias and evidence quality were assessed using QUADAS-2 and GRADE.\r\n\r\nRESULTS\r\nFrom 6,254 records, we included 16 studies (41 models) trained/tested on 4,864/2,370 participants. No study had a high risk of bias. Machine listening achieved a pooled sensitivity (95% credible interval) of 90.3% (86.9-93.1%), specificity of 86.7% (83.1-89.7%), diagnostic odds ratio of 60.8 (39.4-99.9), positive and negative likelihood ratios of 6.78 (5.34-8.85) and 0.113 (0.079-0.152). At AHI cut-offs of ≥5, ≥15, ≥30: sensitivities were 94.3% (90.3-96.8%), 86.3% (80.1-90.9%), 86.3% (79.2-91.1%); specificities were 78.5% (68.0-86.9%), 87.3% (81.8-91.3%), 89.5% (84.8-93.3%). Meta-regression identified higher sensitivity for: higher audio sampling frequencies; non-contact microphones; higher OSA prevalence; train-test split model evaluation. Accuracy was equal regardless of: home smartphone versus in-laboratory professional microphone recordings; deep learning versus traditional machine learning; varying age and sex. Publication bias was not evident. The evidence was of high quality.\r\n\r\nINTERPRETATION\r\nMachine listening achieved excellent diagnostic accuracy, superior to STOP-Bang and comparable to common home sleep tests. Digital medicine should be further explored and externally validated for accessible and equitable OSA diagnosis.","PeriodicalId":9782,"journal":{"name":"Chest","volume":"108 1","pages":""},"PeriodicalIF":9.5000,"publicationDate":"2025-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine Listening for Obstructive Sleep Apnea Diagnosis: A Bayesian Meta-Analysis.\",\"authors\":\"Benjamin Kye Jyn Tan,Esther Yanxin Gao,Nicole Kye Wen Tan,Brian Sheng Yep Yeo,Claire Jing-Wen Tan,Adele Chin Wei Ng,Zhou Hao Leong,Chu Qin Phua,Maythad Uataya,Liang Chye Goh,Thun How Ong,Leong Chai Leow,Guang-Bin Huang,Song Tar Toh\",\"doi\":\"10.1016/j.chest.2025.04.006\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"BACKGROUND\\r\\nAmong 1 billion patients worldwide with obstructive sleep apnea (OSA), 90% remain undiagnosed. Their main barrier is the overnight polysomnogram, which requires specialized equipment, skilled technicians and inpatient beds available only in tertiary sleep centers. Recent advances in artificial intelligence (AI) have enabled OSA detection using breath sound recordings.\\r\\n\\r\\nRESEARCH QUESTION\\r\\nWhat is the diagnostic accuracy and how can we optimize machine listening for OSA?\\r\\n\\r\\nSTUDY DESIGN AND METHODS\\r\\nPubMed, Embase, Scopus, Web of Science and IEEE Xplore were systematically searched. Two blinded reviewers selected studies comparing the patient-level diagnostic performance of AI approaches using overnight audio recordings, versus conventional diagnosis (apnea-hypopnea index [AHI]) using a train-test split or k-fold cross-validation. Bayesian bivariate meta-analysis and meta-regression were performed. Publication bias was assessed using a selection model. Risk of bias and evidence quality were assessed using QUADAS-2 and GRADE.\\r\\n\\r\\nRESULTS\\r\\nFrom 6,254 records, we included 16 studies (41 models) trained/tested on 4,864/2,370 participants. No study had a high risk of bias. Machine listening achieved a pooled sensitivity (95% credible interval) of 90.3% (86.9-93.1%), specificity of 86.7% (83.1-89.7%), diagnostic odds ratio of 60.8 (39.4-99.9), positive and negative likelihood ratios of 6.78 (5.34-8.85) and 0.113 (0.079-0.152). At AHI cut-offs of ≥5, ≥15, ≥30: sensitivities were 94.3% (90.3-96.8%), 86.3% (80.1-90.9%), 86.3% (79.2-91.1%); specificities were 78.5% (68.0-86.9%), 87.3% (81.8-91.3%), 89.5% (84.8-93.3%). Meta-regression identified higher sensitivity for: higher audio sampling frequencies; non-contact microphones; higher OSA prevalence; train-test split model evaluation. Accuracy was equal regardless of: home smartphone versus in-laboratory professional microphone recordings; deep learning versus traditional machine learning; varying age and sex. Publication bias was not evident. The evidence was of high quality.\\r\\n\\r\\nINTERPRETATION\\r\\nMachine listening achieved excellent diagnostic accuracy, superior to STOP-Bang and comparable to common home sleep tests. Digital medicine should be further explored and externally validated for accessible and equitable OSA diagnosis.\",\"PeriodicalId\":9782,\"journal\":{\"name\":\"Chest\",\"volume\":\"108 1\",\"pages\":\"\"},\"PeriodicalIF\":9.5000,\"publicationDate\":\"2025-04-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Chest\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1016/j.chest.2025.04.006\",\"RegionNum\":1,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CRITICAL CARE MEDICINE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chest","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.chest.2025.04.006","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CRITICAL CARE MEDICINE","Score":null,"Total":0}
Machine Listening for Obstructive Sleep Apnea Diagnosis: A Bayesian Meta-Analysis.
BACKGROUND
Among 1 billion patients worldwide with obstructive sleep apnea (OSA), 90% remain undiagnosed. Their main barrier is the overnight polysomnogram, which requires specialized equipment, skilled technicians and inpatient beds available only in tertiary sleep centers. Recent advances in artificial intelligence (AI) have enabled OSA detection using breath sound recordings.
RESEARCH QUESTION
What is the diagnostic accuracy and how can we optimize machine listening for OSA?
STUDY DESIGN AND METHODS
PubMed, Embase, Scopus, Web of Science and IEEE Xplore were systematically searched. Two blinded reviewers selected studies comparing the patient-level diagnostic performance of AI approaches using overnight audio recordings, versus conventional diagnosis (apnea-hypopnea index [AHI]) using a train-test split or k-fold cross-validation. Bayesian bivariate meta-analysis and meta-regression were performed. Publication bias was assessed using a selection model. Risk of bias and evidence quality were assessed using QUADAS-2 and GRADE.
RESULTS
From 6,254 records, we included 16 studies (41 models) trained/tested on 4,864/2,370 participants. No study had a high risk of bias. Machine listening achieved a pooled sensitivity (95% credible interval) of 90.3% (86.9-93.1%), specificity of 86.7% (83.1-89.7%), diagnostic odds ratio of 60.8 (39.4-99.9), positive and negative likelihood ratios of 6.78 (5.34-8.85) and 0.113 (0.079-0.152). At AHI cut-offs of ≥5, ≥15, ≥30: sensitivities were 94.3% (90.3-96.8%), 86.3% (80.1-90.9%), 86.3% (79.2-91.1%); specificities were 78.5% (68.0-86.9%), 87.3% (81.8-91.3%), 89.5% (84.8-93.3%). Meta-regression identified higher sensitivity for: higher audio sampling frequencies; non-contact microphones; higher OSA prevalence; train-test split model evaluation. Accuracy was equal regardless of: home smartphone versus in-laboratory professional microphone recordings; deep learning versus traditional machine learning; varying age and sex. Publication bias was not evident. The evidence was of high quality.
INTERPRETATION
Machine listening achieved excellent diagnostic accuracy, superior to STOP-Bang and comparable to common home sleep tests. Digital medicine should be further explored and externally validated for accessible and equitable OSA diagnosis.
期刊介绍:
At CHEST, our mission is to revolutionize patient care through the collaboration of multidisciplinary clinicians in the fields of pulmonary, critical care, and sleep medicine. We achieve this by publishing cutting-edge clinical research that addresses current challenges and brings forth future advancements. To enhance understanding in a rapidly evolving field, CHEST also features review articles, commentaries, and facilitates discussions on emerging controversies. We place great emphasis on scientific rigor, employing a rigorous peer review process, and ensuring all accepted content is published online within two weeks.