Gabriel José Pellisser Dalalana, Rodrigo Capobianco Guido, Eduardo Sperle Honorato, Ivan Nunes da Silva
{"title":"Application of Wavelet Analysis and Paraconsistent Feature Extraction in the Classification of Voice Pathologies.","authors":"Gabriel José Pellisser Dalalana, Rodrigo Capobianco Guido, Eduardo Sperle Honorato, Ivan Nunes da Silva","doi":"10.1016/j.jvoice.2025.01.021","DOIUrl":null,"url":null,"abstract":"<p><strong>Objectives: </strong>This study explores the application of wavelet analysis and paraconsistent logic for the classification of voice pathologies. The primary objective is to develop a methodology combining signal decomposition techniques and intelligent classification to distinguish between healthy and pathological voice samples.</p><p><strong>Methods: </strong>Voice signals from the Saarbruecken Voice Database were preprocessed and decomposed using the discrete-time wavelet packet transform across multiple levels. Features such as energy, entropy, and zero-crossing rate (ZCR) were extracted for classification using support vector machines. Additionally, a paraconsistent logic framework was implemented to handle uncertainty and class overlap, enhancing classification. Six wavelet families were analyzed, including Haar, Daubechies, Symlets, Coiflets, Beylkin, and Vaidyanathan, to identify the most suitable filters for each pathology.</p><p><strong>Results: </strong>The proposed method achieved high classification accuracy, surpassing several state-of-the-art approaches. The best-performing filters varied by pathology, with Sym32, Beylkin18, and Vaidyanathan24 excelling for dysphonia, Daub4, Daub12, Sym8, and Coif6 for Reinke's edema, and Haar, Sym32, and Coif6 for recurrent laryngeal nerve paralysis. Energy and ZCR proved particularly effective as features, while entropy exhibited limited performance in this context.</p><p><strong>Conclusions: </strong>The integration of wavelet-based signal analysis and paraconsistent logic offers a powerful approach for voice pathology classification. This methodology not only improves classification accuracy but also provides a computationally efficient framework suitable for clinical applications. Future work will focus on expanding datasets and developing real-time diagnostic tools.</p>","PeriodicalId":49954,"journal":{"name":"Journal of Voice","volume":" ","pages":""},"PeriodicalIF":2.5000,"publicationDate":"2025-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Voice","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.jvoice.2025.01.021","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Objectives: This study explores the application of wavelet analysis and paraconsistent logic for the classification of voice pathologies. The primary objective is to develop a methodology combining signal decomposition techniques and intelligent classification to distinguish between healthy and pathological voice samples.
Methods: Voice signals from the Saarbruecken Voice Database were preprocessed and decomposed using the discrete-time wavelet packet transform across multiple levels. Features such as energy, entropy, and zero-crossing rate (ZCR) were extracted for classification using support vector machines. Additionally, a paraconsistent logic framework was implemented to handle uncertainty and class overlap, enhancing classification. Six wavelet families were analyzed, including Haar, Daubechies, Symlets, Coiflets, Beylkin, and Vaidyanathan, to identify the most suitable filters for each pathology.
Results: The proposed method achieved high classification accuracy, surpassing several state-of-the-art approaches. The best-performing filters varied by pathology, with Sym32, Beylkin18, and Vaidyanathan24 excelling for dysphonia, Daub4, Daub12, Sym8, and Coif6 for Reinke's edema, and Haar, Sym32, and Coif6 for recurrent laryngeal nerve paralysis. Energy and ZCR proved particularly effective as features, while entropy exhibited limited performance in this context.
Conclusions: The integration of wavelet-based signal analysis and paraconsistent logic offers a powerful approach for voice pathology classification. This methodology not only improves classification accuracy but also provides a computationally efficient framework suitable for clinical applications. Future work will focus on expanding datasets and developing real-time diagnostic tools.
期刊介绍:
The Journal of Voice is widely regarded as the world''s premiere journal for voice medicine and research. This peer-reviewed publication is listed in Index Medicus and is indexed by the Institute for Scientific Information. The journal contains articles written by experts throughout the world on all topics in voice sciences, voice medicine and surgery, and speech-language pathologists'' management of voice-related problems. The journal includes clinical articles, clinical research, and laboratory research. Members of the Foundation receive the journal as a benefit of membership.