基于saarbr cken语音数据库的语音病理分类超参数优化机器学习模型。

IF 2.5 4区 医学 Q1 AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY
Pervin Gulsen, Abdulkadir Gulsen, Mustafa Alci
{"title":"基于saarbr<s:1> cken语音数据库的语音病理分类超参数优化机器学习模型。","authors":"Pervin Gulsen, Abdulkadir Gulsen, Mustafa Alci","doi":"10.1016/j.jvoice.2024.12.009","DOIUrl":null,"url":null,"abstract":"<p><p>Early diagnosis and referral are crucial in the treatment of voice disorders. Contemporary investigations have indicated the efficacy of voice pathology detection systems in significantly contributing to the evaluation of voice disorders, facilitating early diagnosis of such pathologies. These systems leverage machine learning methodologies, widely applied across diverse domains, and exhibit particular potential in the realm of voice pathology classification. However, machine learning models and performance metrics employed in these studies vary significantly, making it challenging to determine the optimal model for voice pathology classification. In this study, healthy and pathological voices were classified with state-of-the-art machine learning models, and the performance results of the models were compared. The voice samples employed in our research were sourced from the Saarbrücken Voice Database, a reputable German database. Feature extraction from voice signals was conducted using the Mel Frequency Cepstral Coefficients method. To assess and enhance the models' performance adequately, we employed hyperparameter optimization and implemented a 10-fold cross-validation approach. The outcomes revealed that the support vector machine model exhibited the highest accuracy, achieving 99.19% and 99.50% accuracies in the classification of male and female voice pathologies, respectively.</p>","PeriodicalId":49954,"journal":{"name":"Journal of Voice","volume":" ","pages":""},"PeriodicalIF":2.5000,"publicationDate":"2025-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine Learning Models With Hyperparameter Optimization for Voice Pathology Classification on Saarbrücken Voice Database.\",\"authors\":\"Pervin Gulsen, Abdulkadir Gulsen, Mustafa Alci\",\"doi\":\"10.1016/j.jvoice.2024.12.009\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Early diagnosis and referral are crucial in the treatment of voice disorders. Contemporary investigations have indicated the efficacy of voice pathology detection systems in significantly contributing to the evaluation of voice disorders, facilitating early diagnosis of such pathologies. These systems leverage machine learning methodologies, widely applied across diverse domains, and exhibit particular potential in the realm of voice pathology classification. However, machine learning models and performance metrics employed in these studies vary significantly, making it challenging to determine the optimal model for voice pathology classification. In this study, healthy and pathological voices were classified with state-of-the-art machine learning models, and the performance results of the models were compared. The voice samples employed in our research were sourced from the Saarbrücken Voice Database, a reputable German database. Feature extraction from voice signals was conducted using the Mel Frequency Cepstral Coefficients method. To assess and enhance the models' performance adequately, we employed hyperparameter optimization and implemented a 10-fold cross-validation approach. The outcomes revealed that the support vector machine model exhibited the highest accuracy, achieving 99.19% and 99.50% accuracies in the classification of male and female voice pathologies, respectively.</p>\",\"PeriodicalId\":49954,\"journal\":{\"name\":\"Journal of Voice\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":2.5000,\"publicationDate\":\"2025-01-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Voice\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1016/j.jvoice.2024.12.009\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Voice","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.jvoice.2024.12.009","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

早期诊断和转诊对嗓音障碍的治疗至关重要。当前的研究表明,语音病理检测系统在评估语音障碍、促进此类病理的早期诊断方面具有重要作用。这些系统利用机器学习方法,广泛应用于各个领域,并在语音病理分类领域表现出特别的潜力。然而,这些研究中使用的机器学习模型和性能指标差异很大,这使得确定语音病理分类的最佳模型具有挑战性。在本研究中,使用最先进的机器学习模型对健康和病理声音进行分类,并比较模型的性能结果。在我们的研究中使用的声音样本来自saarbr cken声音数据库,一个著名的德国数据库。采用Mel频率倒谱系数法对语音信号进行特征提取。为了充分评估和增强模型的性能,我们采用了超参数优化并实施了10倍交叉验证方法。结果表明,支持向量机模型在男声和女声病理分类中准确率最高,分别达到99.19%和99.50%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Machine Learning Models With Hyperparameter Optimization for Voice Pathology Classification on Saarbrücken Voice Database.

Early diagnosis and referral are crucial in the treatment of voice disorders. Contemporary investigations have indicated the efficacy of voice pathology detection systems in significantly contributing to the evaluation of voice disorders, facilitating early diagnosis of such pathologies. These systems leverage machine learning methodologies, widely applied across diverse domains, and exhibit particular potential in the realm of voice pathology classification. However, machine learning models and performance metrics employed in these studies vary significantly, making it challenging to determine the optimal model for voice pathology classification. In this study, healthy and pathological voices were classified with state-of-the-art machine learning models, and the performance results of the models were compared. The voice samples employed in our research were sourced from the Saarbrücken Voice Database, a reputable German database. Feature extraction from voice signals was conducted using the Mel Frequency Cepstral Coefficients method. To assess and enhance the models' performance adequately, we employed hyperparameter optimization and implemented a 10-fold cross-validation approach. The outcomes revealed that the support vector machine model exhibited the highest accuracy, achieving 99.19% and 99.50% accuracies in the classification of male and female voice pathologies, respectively.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Voice
Journal of Voice 医学-耳鼻喉科学
CiteScore
4.00
自引率
13.60%
发文量
395
审稿时长
59 days
期刊介绍: The Journal of Voice is widely regarded as the world''s premiere journal for voice medicine and research. This peer-reviewed publication is listed in Index Medicus and is indexed by the Institute for Scientific Information. The journal contains articles written by experts throughout the world on all topics in voice sciences, voice medicine and surgery, and speech-language pathologists'' management of voice-related problems. The journal includes clinical articles, clinical research, and laboratory research. Members of the Foundation receive the journal as a benefit of membership.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信