Automated and interpretable m-health discrimination of vocal cord pathology enabled by machine learning

2020 IEEE Asia-Pacific Conference on Computer Science and Data Engineering (CSDE) Pub Date : 2020-12-16 DOI:10.1109/CSDE50874.2020.9411529

Nabeel Seedat, V. Aharonson, Y. Hamzany

{"title":"Automated and interpretable m-health discrimination of vocal cord pathology enabled by machine learning","authors":"Nabeel Seedat, V. Aharonson, Y. Hamzany","doi":"10.1109/CSDE50874.2020.9411529","DOIUrl":null,"url":null,"abstract":"Clinical methods that assess voice pathologies are typically based on laryngeal endoscopy or audio-perceptual assessment. Both methods have limited accessibility in low-resourced healthcare settings. M-health systems can provide a quantitative assessment and improve early detection in a patient centered care. Automated methods for voice pathologies assessment apply machine learning methods to acoustic, frequency and noise features extracted from sustained phonation recordings and aim to discriminate pathological voices from controls. The machine learning methods in this study are applied to a discriminating between two prevalent vocal pathologies: vocal cord polyp and vocal cord paralysis. The data was acquired by a low-cost recording device in an experiment at a tertiary medical center and the pathologies were clinically labeled. Acoustic and spectral features were extracted and multiple classifiers compared using batched cross validation. The best classifiers were tree-based classifiers, with the Extra Trees classifier providing the best performance with an accuracy of 0.9565 and F1-score of 0.9130. Explainable AI (XAI) and feature interpretability analysis was carried out to allow clinicians to use the features marked as important to clinical care and planning. The most important features were octave-based spectral contrast and MFCCs 0 to 3. The results indicate a feasibility of machine learning to accurately discriminate between different types of vocal cord pathologies.","PeriodicalId":445708,"journal":{"name":"2020 IEEE Asia-Pacific Conference on Computer Science and Data Engineering (CSDE)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE Asia-Pacific Conference on Computer Science and Data Engineering (CSDE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSDE50874.2020.9411529","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 6

Abstract

Clinical methods that assess voice pathologies are typically based on laryngeal endoscopy or audio-perceptual assessment. Both methods have limited accessibility in low-resourced healthcare settings. M-health systems can provide a quantitative assessment and improve early detection in a patient centered care. Automated methods for voice pathologies assessment apply machine learning methods to acoustic, frequency and noise features extracted from sustained phonation recordings and aim to discriminate pathological voices from controls. The machine learning methods in this study are applied to a discriminating between two prevalent vocal pathologies: vocal cord polyp and vocal cord paralysis. The data was acquired by a low-cost recording device in an experiment at a tertiary medical center and the pathologies were clinically labeled. Acoustic and spectral features were extracted and multiple classifiers compared using batched cross validation. The best classifiers were tree-based classifiers, with the Extra Trees classifier providing the best performance with an accuracy of 0.9565 and F1-score of 0.9130. Explainable AI (XAI) and feature interpretability analysis was carried out to allow clinicians to use the features marked as important to clinical care and planning. The most important features were octave-based spectral contrast and MFCCs 0 to 3. The results indicate a feasibility of machine learning to accurately discriminate between different types of vocal cord pathologies.

查看原文本刊更多论文

通过机器学习实现声带病理的自动和可解释的移动健康歧视

评估声音病理的临床方法通常是基于喉内窥镜检查或听觉感知评估。在资源匮乏的医疗环境中，这两种方法的可及性有限。移动医疗系统可以提供定量评估，并在以患者为中心的护理中改善早期发现。语音病理评估的自动化方法将机器学习方法应用于从持续录音中提取的声学、频率和噪声特征，旨在区分病理声音和对照组。本研究将机器学习方法应用于区分两种常见的声带病变:声带息肉和声带麻痹。在某三级医疗中心的实验中，通过低成本的记录设备获取数据，并对病理进行临床标记。声学和光谱特征提取和多分类器比较使用批量交叉验证。最好的分类器是基于树的分类器，其中Extra Trees分类器的准确率为0.9565,f1分数为0.9130，表现最好。可解释AI (XAI)和特征可解释性分析允许临床医生使用标记为临床护理和计划重要的特征。最重要的特征是基于八度的光谱对比度和mfccc为0 ~ 3。结果表明，机器学习可以准确区分不同类型的声带病变。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2020 IEEE Asia-Pacific Conference on Computer Science and Data Engineering (CSDE)

自引率

0.00%

发文量