Noah Alter, Claiborne Lucas, Ricardo Torres-Guzman, Andrew James, Amy Stone, Maria E Powell, Scott Corlew, Weixin Liu, Bowen Qu, Zhijun Yin, Andrea Hiller, Michael Golinko, Matthew E Pontell
{"title":"从支持向量机到神经网络:推进腭裂患者腭咽功能障碍自动检测。","authors":"Noah Alter, Claiborne Lucas, Ricardo Torres-Guzman, Andrew James, Amy Stone, Maria E Powell, Scott Corlew, Weixin Liu, Bowen Qu, Zhijun Yin, Andrea Hiller, Michael Golinko, Matthew E Pontell","doi":"10.1097/SAP.0000000000004460","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The generation of intelligible speech is the single most important outcome after cleft palate repair. The development of velopharyngeal dysfunction (VPD) compromises the outcome, and the burden of VPD remains largely unknown in low- and middle-income countries (LMICs). To scale up VPD care in these areas, we continue to explore the use of artificial intelligence (AI) and machine learning (ML) for automatic detection of VPD from speech samples alone.</p><p><strong>Methods: </strong>An age-matched, single-institution cohort of 60 patients (30 control, 30 with VPD after cleft palate repair) generated approximately 8000 audio samples (4000 VPD and 4000 control). These samples were used to inform the development of a neural network-based, self-supervised deep learning ML model.</p><p><strong>Results: </strong>ML model testing with augmented and unaugmented data sets revealed accuracies of 1.0, macro precisions of 1.0, macro recalls of 1.0, and F1 scores of 1.0.</p><p><strong>Discussion: </strong>Although these results are promising and support the ability of ML models to detect VPD, the results likely indicate that the ML models are also picking up confounding data. Efforts are underway to address this problem while simultaneously employing disentanglement tactics to allow for multilingual speech analysis. The ability to clinically operationalize such a model could instantaneously enhance VPD care in LMICs for patients with cleft palate with little changes to existing healthcare infrastructure.</p>","PeriodicalId":8060,"journal":{"name":"Annals of Plastic Surgery","volume":"95 3S Suppl 1","pages":"S55-S59"},"PeriodicalIF":1.6000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"From Support Vector Machines to Neural Networks: Advancing Automated Velopharyngeal Dysfunction Detection in Patients With Cleft Palate.\",\"authors\":\"Noah Alter, Claiborne Lucas, Ricardo Torres-Guzman, Andrew James, Amy Stone, Maria E Powell, Scott Corlew, Weixin Liu, Bowen Qu, Zhijun Yin, Andrea Hiller, Michael Golinko, Matthew E Pontell\",\"doi\":\"10.1097/SAP.0000000000004460\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>The generation of intelligible speech is the single most important outcome after cleft palate repair. The development of velopharyngeal dysfunction (VPD) compromises the outcome, and the burden of VPD remains largely unknown in low- and middle-income countries (LMICs). To scale up VPD care in these areas, we continue to explore the use of artificial intelligence (AI) and machine learning (ML) for automatic detection of VPD from speech samples alone.</p><p><strong>Methods: </strong>An age-matched, single-institution cohort of 60 patients (30 control, 30 with VPD after cleft palate repair) generated approximately 8000 audio samples (4000 VPD and 4000 control). These samples were used to inform the development of a neural network-based, self-supervised deep learning ML model.</p><p><strong>Results: </strong>ML model testing with augmented and unaugmented data sets revealed accuracies of 1.0, macro precisions of 1.0, macro recalls of 1.0, and F1 scores of 1.0.</p><p><strong>Discussion: </strong>Although these results are promising and support the ability of ML models to detect VPD, the results likely indicate that the ML models are also picking up confounding data. Efforts are underway to address this problem while simultaneously employing disentanglement tactics to allow for multilingual speech analysis. The ability to clinically operationalize such a model could instantaneously enhance VPD care in LMICs for patients with cleft palate with little changes to existing healthcare infrastructure.</p>\",\"PeriodicalId\":8060,\"journal\":{\"name\":\"Annals of Plastic Surgery\",\"volume\":\"95 3S Suppl 1\",\"pages\":\"S55-S59\"},\"PeriodicalIF\":1.6000,\"publicationDate\":\"2025-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Annals of Plastic Surgery\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1097/SAP.0000000000004460\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"SURGERY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Annals of Plastic Surgery","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1097/SAP.0000000000004460","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"SURGERY","Score":null,"Total":0}
From Support Vector Machines to Neural Networks: Advancing Automated Velopharyngeal Dysfunction Detection in Patients With Cleft Palate.
Background: The generation of intelligible speech is the single most important outcome after cleft palate repair. The development of velopharyngeal dysfunction (VPD) compromises the outcome, and the burden of VPD remains largely unknown in low- and middle-income countries (LMICs). To scale up VPD care in these areas, we continue to explore the use of artificial intelligence (AI) and machine learning (ML) for automatic detection of VPD from speech samples alone.
Methods: An age-matched, single-institution cohort of 60 patients (30 control, 30 with VPD after cleft palate repair) generated approximately 8000 audio samples (4000 VPD and 4000 control). These samples were used to inform the development of a neural network-based, self-supervised deep learning ML model.
Results: ML model testing with augmented and unaugmented data sets revealed accuracies of 1.0, macro precisions of 1.0, macro recalls of 1.0, and F1 scores of 1.0.
Discussion: Although these results are promising and support the ability of ML models to detect VPD, the results likely indicate that the ML models are also picking up confounding data. Efforts are underway to address this problem while simultaneously employing disentanglement tactics to allow for multilingual speech analysis. The ability to clinically operationalize such a model could instantaneously enhance VPD care in LMICs for patients with cleft palate with little changes to existing healthcare infrastructure.
期刊介绍:
The only independent journal devoted to general plastic and reconstructive surgery, Annals of Plastic Surgery serves as a forum for current scientific and clinical advances in the field and a sounding board for ideas and perspectives on its future. The journal publishes peer-reviewed original articles, brief communications, case reports, and notes in all areas of interest to the practicing plastic surgeon. There are also historical and current reviews, descriptions of surgical technique, and lively editorials and letters to the editor.