通过机器学习实现声带病理的自动和可解释的移动健康歧视

Nabeel Seedat, V. Aharonson, Y. Hamzany
{"title":"通过机器学习实现声带病理的自动和可解释的移动健康歧视","authors":"Nabeel Seedat, V. Aharonson, Y. Hamzany","doi":"10.1109/CSDE50874.2020.9411529","DOIUrl":null,"url":null,"abstract":"Clinical methods that assess voice pathologies are typically based on laryngeal endoscopy or audio-perceptual assessment. Both methods have limited accessibility in low-resourced healthcare settings. M-health systems can provide a quantitative assessment and improve early detection in a patient centered care. Automated methods for voice pathologies assessment apply machine learning methods to acoustic, frequency and noise features extracted from sustained phonation recordings and aim to discriminate pathological voices from controls. The machine learning methods in this study are applied to a discriminating between two prevalent vocal pathologies: vocal cord polyp and vocal cord paralysis. The data was acquired by a low-cost recording device in an experiment at a tertiary medical center and the pathologies were clinically labeled. Acoustic and spectral features were extracted and multiple classifiers compared using batched cross validation. The best classifiers were tree-based classifiers, with the Extra Trees classifier providing the best performance with an accuracy of 0.9565 and F1-score of 0.9130. Explainable AI (XAI) and feature interpretability analysis was carried out to allow clinicians to use the features marked as important to clinical care and planning. The most important features were octave-based spectral contrast and MFCCs 0 to 3. The results indicate a feasibility of machine learning to accurately discriminate between different types of vocal cord pathologies.","PeriodicalId":445708,"journal":{"name":"2020 IEEE Asia-Pacific Conference on Computer Science and Data Engineering (CSDE)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Automated and interpretable m-health discrimination of vocal cord pathology enabled by machine learning\",\"authors\":\"Nabeel Seedat, V. Aharonson, Y. Hamzany\",\"doi\":\"10.1109/CSDE50874.2020.9411529\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Clinical methods that assess voice pathologies are typically based on laryngeal endoscopy or audio-perceptual assessment. Both methods have limited accessibility in low-resourced healthcare settings. M-health systems can provide a quantitative assessment and improve early detection in a patient centered care. Automated methods for voice pathologies assessment apply machine learning methods to acoustic, frequency and noise features extracted from sustained phonation recordings and aim to discriminate pathological voices from controls. The machine learning methods in this study are applied to a discriminating between two prevalent vocal pathologies: vocal cord polyp and vocal cord paralysis. The data was acquired by a low-cost recording device in an experiment at a tertiary medical center and the pathologies were clinically labeled. Acoustic and spectral features were extracted and multiple classifiers compared using batched cross validation. The best classifiers were tree-based classifiers, with the Extra Trees classifier providing the best performance with an accuracy of 0.9565 and F1-score of 0.9130. Explainable AI (XAI) and feature interpretability analysis was carried out to allow clinicians to use the features marked as important to clinical care and planning. The most important features were octave-based spectral contrast and MFCCs 0 to 3. The results indicate a feasibility of machine learning to accurately discriminate between different types of vocal cord pathologies.\",\"PeriodicalId\":445708,\"journal\":{\"name\":\"2020 IEEE Asia-Pacific Conference on Computer Science and Data Engineering (CSDE)\",\"volume\":\"19 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-12-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE Asia-Pacific Conference on Computer Science and Data Engineering (CSDE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CSDE50874.2020.9411529\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE Asia-Pacific Conference on Computer Science and Data Engineering (CSDE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSDE50874.2020.9411529","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6

摘要

评估声音病理的临床方法通常是基于喉内窥镜检查或听觉感知评估。在资源匮乏的医疗环境中,这两种方法的可及性有限。移动医疗系统可以提供定量评估,并在以患者为中心的护理中改善早期发现。语音病理评估的自动化方法将机器学习方法应用于从持续录音中提取的声学、频率和噪声特征,旨在区分病理声音和对照组。本研究将机器学习方法应用于区分两种常见的声带病变:声带息肉和声带麻痹。在某三级医疗中心的实验中,通过低成本的记录设备获取数据,并对病理进行临床标记。声学和光谱特征提取和多分类器比较使用批量交叉验证。最好的分类器是基于树的分类器,其中Extra Trees分类器的准确率为0.9565,f1分数为0.9130,表现最好。可解释AI (XAI)和特征可解释性分析允许临床医生使用标记为临床护理和计划重要的特征。最重要的特征是基于八度的光谱对比度和mfccc为0 ~ 3。结果表明,机器学习可以准确区分不同类型的声带病变。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Automated and interpretable m-health discrimination of vocal cord pathology enabled by machine learning
Clinical methods that assess voice pathologies are typically based on laryngeal endoscopy or audio-perceptual assessment. Both methods have limited accessibility in low-resourced healthcare settings. M-health systems can provide a quantitative assessment and improve early detection in a patient centered care. Automated methods for voice pathologies assessment apply machine learning methods to acoustic, frequency and noise features extracted from sustained phonation recordings and aim to discriminate pathological voices from controls. The machine learning methods in this study are applied to a discriminating between two prevalent vocal pathologies: vocal cord polyp and vocal cord paralysis. The data was acquired by a low-cost recording device in an experiment at a tertiary medical center and the pathologies were clinically labeled. Acoustic and spectral features were extracted and multiple classifiers compared using batched cross validation. The best classifiers were tree-based classifiers, with the Extra Trees classifier providing the best performance with an accuracy of 0.9565 and F1-score of 0.9130. Explainable AI (XAI) and feature interpretability analysis was carried out to allow clinicians to use the features marked as important to clinical care and planning. The most important features were octave-based spectral contrast and MFCCs 0 to 3. The results indicate a feasibility of machine learning to accurately discriminate between different types of vocal cord pathologies.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信