The accuracy of an Online Sequential Extreme Learning Machine in detecting voice pathology using the Malaysian Voice Pathology Database.

IF 2.6 3区医学 Q1 OTORHINOLARYNGOLOGY

Journal of Otolaryngology - Head & Neck Surgery Pub Date : 2023-09-20 DOI:10.1186/s40463-023-00661-6

Nur Ain Nabila Za'im, Fahad Taha Al-Dhief, Mawaddah Azman, Majid Razaq Mohamed Alsemawi, Nurul Mu Azzah Abdul Latiff, Marina Mat Baki

{"title":"The accuracy of an Online Sequential Extreme Learning Machine in detecting voice pathology using the Malaysian Voice Pathology Database.","authors":"Nur Ain Nabila Za'im, Fahad Taha Al-Dhief, Mawaddah Azman, Majid Razaq Mohamed Alsemawi, Nurul Mu Azzah Abdul Latiff, Marina Mat Baki","doi":"10.1186/s40463-023-00661-6","DOIUrl":null,"url":null,"abstract":"Background: A multidimensional voice quality assessment is recommended for all patients with dysphonia, which requires a patient visit to the otolaryngology clinic. The aim of this study was to determine the accuracy of an online artificial intelligence classifier, the Online Sequential Extreme Learning Machine (OSELM), in detecting voice pathology. In this study, a Malaysian Voice Pathology Database (MVPD), which is the first Malaysian voice database, was created and tested.Methods: The study included 382 participants (252 normal voices and 130 dysphonic voices) in the proposed database MVPD. Complete data were obtained for both groups, including voice samples, laryngostroboscopy videos, and acoustic analysis. The diagnoses of patients with dysphonia were obtained. Each voice sample was anonymized using a code that was specific to each individual and stored in the MVPD. These voice samples were used to train and test the proposed OSELM algorithm. The performance of OSELM was evaluated and compared with other classifiers in terms of the accuracy, sensitivity, and specificity of detecting and differentiating dysphonic voices.Results: The accuracy, sensitivity, and specificity of OSELM in detecting normal and dysphonic voices were 90%, 98%, and 73%, respectively. The classifier differentiated between structural and non-structural vocal fold pathology with accuracy, sensitivity, and specificity of 84%, 89%, and 88%, respectively, while it differentiated between malignant and benign lesions with an accuracy, sensitivity, and specificity of 92%, 100%, and 58%, respectively. Compared to other classifiers, OSELM showed superior accuracy and sensitivity in detecting dysphonic voices, differentiating structural versus non-structural vocal fold pathology, and between malignant and benign voice pathology.Conclusion: The OSELM algorithm exhibited the highest accuracy and sensitivity compared to other classifiers in detecting voice pathology, classifying between malignant and benign lesions, and differentiating between structural and non-structural vocal pathology. Hence, it is a promising artificial intelligence that supports an online application to be used as a screening tool to encourage people to seek medical consultation early for a definitive diagnosis of voice pathology.","PeriodicalId":16615,"journal":{"name":"Journal of Otolaryngology - Head & Neck Surgery","volume":"52 1","pages":"62"},"PeriodicalIF":2.6000,"publicationDate":"2023-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10512596/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Otolaryngology - Head & Neck Surgery","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s40463-023-00661-6","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"OTORHINOLARYNGOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Background: A multidimensional voice quality assessment is recommended for all patients with dysphonia, which requires a patient visit to the otolaryngology clinic. The aim of this study was to determine the accuracy of an online artificial intelligence classifier, the Online Sequential Extreme Learning Machine (OSELM), in detecting voice pathology. In this study, a Malaysian Voice Pathology Database (MVPD), which is the first Malaysian voice database, was created and tested.

Methods: The study included 382 participants (252 normal voices and 130 dysphonic voices) in the proposed database MVPD. Complete data were obtained for both groups, including voice samples, laryngostroboscopy videos, and acoustic analysis. The diagnoses of patients with dysphonia were obtained. Each voice sample was anonymized using a code that was specific to each individual and stored in the MVPD. These voice samples were used to train and test the proposed OSELM algorithm. The performance of OSELM was evaluated and compared with other classifiers in terms of the accuracy, sensitivity, and specificity of detecting and differentiating dysphonic voices.

Results: The accuracy, sensitivity, and specificity of OSELM in detecting normal and dysphonic voices were 90%, 98%, and 73%, respectively. The classifier differentiated between structural and non-structural vocal fold pathology with accuracy, sensitivity, and specificity of 84%, 89%, and 88%, respectively, while it differentiated between malignant and benign lesions with an accuracy, sensitivity, and specificity of 92%, 100%, and 58%, respectively. Compared to other classifiers, OSELM showed superior accuracy and sensitivity in detecting dysphonic voices, differentiating structural versus non-structural vocal fold pathology, and between malignant and benign voice pathology.

Conclusion: The OSELM algorithm exhibited the highest accuracy and sensitivity compared to other classifiers in detecting voice pathology, classifying between malignant and benign lesions, and differentiating between structural and non-structural vocal pathology. Hence, it is a promising artificial intelligence that supports an online application to be used as a screening tool to encourage people to seek medical consultation early for a definitive diagnosis of voice pathology.

Abstract Image

查看原文本刊更多论文

在线序列极限学习机使用马来西亚语音病理学数据库检测语音病理学的准确性。

背景：建议对所有发音困难的患者进行多维语音质量评估，这需要患者去耳鼻喉科诊所就诊。本研究的目的是确定在线人工智能分类器——在线序列极限学习机（OSELM）在检测语音病理学方面的准确性。在本研究中，创建并测试了马来西亚语音病理学数据库（MVPD），这是第一个马来西亚语音数据库。方法：该研究包括382名参与者（252名正常语音和130名发音困难语音），纳入拟议的MVPD数据库。获得了两组的完整数据，包括语音样本、喉镜视频和声学分析。获得了发音困难患者的诊断。每个语音样本都使用特定于每个个体的代码进行匿名化，并存储在MVPD中。这些语音样本被用于训练和测试所提出的OSELM算法。评估了OSELM的性能，并将其与其他分类器在检测和区分发音困难的准确性、敏感性和特异性方面进行了比较。结果：OSELM检测正常和发音困难的准确率、灵敏度和特异性分别为90%、98%和73%。该分类器区分结构性和非结构性声带病变，准确率、灵敏度和特异性分别为84%、89%和88%，而区分恶性和良性病变，准确度、灵敏度和特异度分别为92%、100%和58%。与其他分类器相比，OSELM在检测发音困难、区分结构性与非结构性声带病理以及恶性与良性声带病理方面表现出优越的准确性和敏感性。结论：与其他分类器相比，OSELM算法在检测语音病理、区分恶性和良性病变以及区分结构和非结构语音病理方面表现出最高的准确性和灵敏度。因此，它是一种很有前途的人工智能，支持将在线应用程序用作筛查工具，鼓励人们尽早寻求医疗咨询，以最终诊断语音病理。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Otolaryngology - Head & Neck Surgery 医学-耳鼻喉科学

CiteScore

6.50

自引率

2.90%

发文量

审稿时长

6 weeks

期刊介绍： Journal of Otolaryngology-Head & Neck Surgery is an open access, peer-reviewed journal publishing on all aspects and sub-specialties of otolaryngology-head & neck surgery, including pediatric and geriatric otolaryngology, rhinology & anterior skull base surgery, otology/neurotology, facial plastic & reconstructive surgery, head & neck oncology, and maxillofacial rehabilitation, as well as a broad range of related topics.