M. C. Robustillo, M. I. Parra, Y. Campos-Roca, Carlos J. Pérez Sánchez
{"title":"Reinke’s edema diagnosis support system based on voice recordings and machine learning","authors":"M. C. Robustillo, M. I. Parra, Y. Campos-Roca, Carlos J. Pérez Sánchez","doi":"10.1109/ICCAD55197.2022.9853994","DOIUrl":null,"url":null,"abstract":"Voice pathologies have a direct impact on human communication. One of the most common voice disorders is Reinke’s edema. Speech analysis algorithms applied to voice recordings in combination with machine learning techniques are explored to develop non-invasive low-cost tools to help diagnose this pathology. Different approaches have been compared to discriminate subjects affected by Reinke’s edema from healthy ones. Several classification methods have been used, such as decision trees, k-nearest neighbours, neural networks, support vector machines, Bayesian classification, regression analysis and linear discriminant. The experiments are based on two different databases. One of them is the commercial database Massachusetts Eye and Ear Infirmary (MEEI), recorded under highly controlled acoustical conditions, while the other one is an in-house database, recorded in a more realistic environment. The best results have been obtained by using the model based on neural networks, that achieved an overall accuracy of 100% on MEEI database and 95.49% on the in-house one. These are competitive results in comparison with those presented in the scientific literature and show the potential of these techniques to be employed within a support system for the diagnosis of Reinke’s edema.","PeriodicalId":436377,"journal":{"name":"2022 International Conference on Control, Automation and Diagnosis (ICCAD)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Control, Automation and Diagnosis (ICCAD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCAD55197.2022.9853994","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Voice pathologies have a direct impact on human communication. One of the most common voice disorders is Reinke’s edema. Speech analysis algorithms applied to voice recordings in combination with machine learning techniques are explored to develop non-invasive low-cost tools to help diagnose this pathology. Different approaches have been compared to discriminate subjects affected by Reinke’s edema from healthy ones. Several classification methods have been used, such as decision trees, k-nearest neighbours, neural networks, support vector machines, Bayesian classification, regression analysis and linear discriminant. The experiments are based on two different databases. One of them is the commercial database Massachusetts Eye and Ear Infirmary (MEEI), recorded under highly controlled acoustical conditions, while the other one is an in-house database, recorded in a more realistic environment. The best results have been obtained by using the model based on neural networks, that achieved an overall accuracy of 100% on MEEI database and 95.49% on the in-house one. These are competitive results in comparison with those presented in the scientific literature and show the potential of these techniques to be employed within a support system for the diagnosis of Reinke’s edema.